The word lists compiled here are based on the words lists available at http://www.isc.ro/en/commands/lists.html.
To create the TWL06 lists, for example, the complete list was unzipped, and then the following commands were used to extract the two, three, and four letter words:
grep -E '^.{2}\W$' TWL06.txt > 2_TWL06.php
grep -E '^.{3}\W$' TWL06.txt > 3_TWL06.php
grep -E '^.{4}\W$' TWL06.txt > 4_TWL06.php
The php files were opened using a text editor that provides search-and-replace using regular expressions. The following replacements were made in each file:
Find: (\w)
Replace with: <span class="letter $1">$1</span>
Find: \n
Replace with: </div>\n<div class="word">
The resulting markup was cleaned up, and header and footer includes were added to each file.
You can view the source code.
Chris Snyder
New York City
December 25, 2007