# Most alphabetic
Marcin Ciura
Twitter: @mciura
## Judges' comments:
### To use:
make
./getwords.sh en | grep .. | ./prog aeiouvwxyz
### Try:
./getwords.sh en | grep .. | ./prog aeiouhjklmnvwxyz
./getwords.sh en | grep .. | ./prog mrjocktvquizphdbagsfewlynx
./en.sh
./fr.sh
./ru.sh
### Selected Judges Remarks:
A few letters, one at a time, with no repeats.
How many different ways can this be done?
While a quick brown fox might jump over the lazy dog,
it has too many repeat letters to allow this entry to repeat.
It is better that Mr Jock, TV quiz PhD, bags few lynx.
Speaking of jumping, can you rewrite the code to
remote all of the goto jumps in this code?
## Author's comments:
### What is this?
This entry reads from standard input a list of words in any language
written with an alphabet and outputs perfect pangrams composed of these
words. A perfect pangram is a series of words that contains every letter
of the alphabet exactly once.
Example execution:
$ grep .. /usr/share/dict/words | ./prog abcdefghijklmnopqrstuvwxyz
qualm fjord wiz pyx vs beg kc nth
quartz jinx vs fed kc womb glyph
quid jamb fez vs pyx kc growl nth
quiz fjord pyx vs bag kc mewl nth
quiz fjord pyx vs gab kc mewl nth
quiz fjord pyx vs gem bawl kc nth
quiz fjord pyx vs meg bawl kc nth
quiz fjord vex bawl kc gyp ms nth
quiz jamb flex vs kc gyp word nth
quiz jamb pyx vs kc flog drew nth
quiz jamb pyx vs kc frog lewd nth
quiz jamb pyx vs kc frog weld nth
quiz jamb pyx vs kc golf drew nth
quiz jamb pyx vs kc grew fold nth
quiz jamb pyx vs kc grow nth fled
quiz jamb pyx vs kc growl fed nth
Rearranging the words into more or less meaningful expressions is the
user's duty.
This entry implements Algorithm X (_Exact cover via dancing links_)
from Section 7.2.2.1 in fascicle 5C of _The Art of Computer
Programming_ by Donald E. Knuth. As of June 2019, you can download an
incomplete draft of the fascicle from [Knuth's
website](https://www-cs-faculty.stanford.edu/~knuth/fasc5c.ps.gz).
### Usage
**`prog LETTERS [N]`**
reads allowed words from standard input and writes perfect pangrams
with at most _N_ words composed of _LETTERS_ to standard output, one
per line. If _N_ is not given, the number of words in the pangrams is
unlimited. The characters in _LETTERS_ must not repeat. The length of
_LETTERS_ must not exceed 97 characters.
The exit status is 0 if no error occurred, 1 if the input word list
is too long, and 2 if the command-line arguments are missing.
### Helper scripts
**`getwords.sh LANGUAGE_CODE`**
outputs a sorted list of unique words in a given language, one per
line. It requires **`aspell`** with a dictionary for _LANGUAGE_CODE_.
**`wrapprog.py LETTERS [N]`**
works similarly to **`prog LETTERS [N]`** except that it merges
anagrams before passing them to **`prog`** and expands them in
**`prog`**'s output. This way, it finishes the job faster than
**`prog`** when there are millions of perfect pangrams. It requires a
UTF-8 locale.
**`de.sh`**, **`en.sh`**, **`fr.sh`**, **`it.sh`**, **`pl.sh`**,
and **`ru.sh`** output German, English, French, Italian, Polish,
and Russian perfect pangrams, respectively. They require a UTF-8
locale.
### Miscellaneous remarks
I dare submit this entry to categories _algorithms_,
_internationalization_ (I have not found active use of **``**
among the programs that won IOCCC in the past), and _obscure bugs_ (see
the questions about Linux core dumps below).
To a casual eye, this entry may look similar to
[2005 klausler](https://www.ioccc.org/years.html#2005_klausler).
However, **`prog`** is internationalized and way faster. On my machine,
**`./klausler abcdefghijklmnopqrstuvwxyz`** printed only 3,965 perfect
pangrams until I killed it after 144 hours running, while the
equivalent
**`tr A-Z a-z < /usr/share/dict/words | egrep -v '^[^ais]$' | ./prog abcdefghijklmnopqrstuvwxyz | wc -l`**
finished in 62 minutes, reporting 1,640,569,315 perfect pangrams, and
the example from section [What is this?](#wit) finished in 1.35
seconds. On the other hand, **`prog`** can only output series of words
with nonrepeating characters, unlike **`klausler`**.
In contrast to **`prog.orig.c`**, **`prog.c`** does not call
**`fflush(stdout)`** after outputting each line, thus running faster.
I am grateful to Witold Jarnicki for suggesting this change.
The original **`prog`** finished the example above in 100 minutes
instead of 62 minutes.
If your word list contains meaningless one-letter words, pipe
**`grep .. [FILE]`** through **`prog`** to eliminate the pangrams that
contain them.
According to **`prog`** and **`aspell`**, some languages, e.g. Spanish,
have no perfect pangrams.
Why does **`yes | ./prog y`** hang on Linux? Change **`static int`** to
**`int`** and **`static wchar_t`** to **`wchar_t`**, and recompile
**`prog`**. Why does **`yes | ./prog y`** dump core now? Change the line
**`#define P 9<<23^1`** to **`#define P 9<<23`** and recompile
**`prog`**. Why does **`yes | ./prog y`** exit gracefully?
Answer: Nf bs Whar 2019, guhf znavsrfg ohtf va **`tyvop`** 2.22--2.29:
[1](https://sourceware.org/bugzilla/show_bug.cgi?id=20568) naq
[2](https://sourceware.org/bugzilla/show_bug.cgi?id=20632).
The source code uses **`goto`** three times. While it could easily get
by without **`goto M`** and **`goto H`**, I challenge the adherents of
structured programming to refactor **`goto T`**, which jumps back into a
nested **`if`** block.
With the supplied **`Makefile`**, both **`gcc`** and **`clang`** compile
**`prog.c`** without warnings in C11 and C99 mode. For a clean
compilation with **`gcc -std=c90`**, add **`-Wno-format`** to
**`CSILENCE`** in **`Makefile`**.
--------------------------------------------------------------------------------