Girish Managoli thinks this is interesting: nltk.corpus.treebank.words() From Regular Expressions for Detecting Word Patterns from Natural Language Processing with Python by Edward Loper, Steven Bird, Ewan Klein Publisher: O'Reilly Media, Inc. Released: June 2009 Note Treebank corpus Share this highlight http://www.safaribooksonline.com/a/natural-language-processing/4936960/ Twitter Facebook Google Plus Email Get Instant Access Now Start a Free Trial Learn about Safari for Business Have an account? Sign in. Minimise Unlock the rest of Natural Language Processing with Python and 30,000 other books and videos By clicking this box, you confirm that you have read and agree to the terms and conditions of our Membership Agreement, and you understand that when your trial period ends, you will be required to provide billing information if you wish to continue using the service. Unlock the rest of this book Start a Free 10-Day Trial loading Learn about Safari for Business Have an account? Sign in.