Since programmers often build task-specific tools, one way to make them more productive is to give them better tool-making tools. When tools take the form of program generators, this idea leads to libraries for creating languages that are directly extensible. Programmers may even be encouraged to think about a problem in terms of a language that would better support the task. This approach is sometimes called language-oriented programming
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Advanced Email Parser (AEP) is an email data extraction utility. It offers extensive features to help you get every bit of information from your emails easily and painlessly. No more copy and paste or drag and drop between the many opened applications. Th
ANTLR (ANother Tool for Language Recognition) is a parser and translator generator tool that lets one define language grammars in either ANTLR syntax (which is YACC and EBNF(Extended Backus-Naur Form) like) or a special AST(Abstract Syntax Tree) syntax. ANTLR can create lexers, parsers and AST's. ANTLR is more than just a grammar definition language however, the tools provided allow one to implement the ANTLR defined grammar by automatically generating lexers and parsers (and tree parsers) in either Java (http://java.sun.com/, C++ (http://anubis.dkuug.dk/jtc1/sc22/wg21/ or Sather (http://www.icsi.berkeley.edu/~sather/.
R. Kate, and R. Mooney. Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers, page 81--84. Rochester, New York, Association for Computational Linguistics, (April 2007)
E. Charniak. Proceedings of the First Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2000), April 29 - May 04, 2000, Seattle, Washington, page 132--139. Morgan Kaufmann Publishers, San Francisco, CA, USA, (2000)
D. Reis, P. Golgher, A. Silva, and A. Laender. WWW '04: Proceedings of the 13th international conference on World Wide Web, page 502--511. New York, NY, USA, ACM Press, (2004)
E. Charniak. Proceedings of the First Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2000), page 132--139. Seattle, Washington, Morgan Kaufmann Publishers, San Francisco, CA, USA, (April 2000)
M. Collins. Proceedings of the 34th annual meeting on Association for Computational Linguistics, page 184--191. Stroudsburg, PA, USA, Association for Computational Linguistics, (1996)
R. Swanson, and A. Gordon. Proceedings of the Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics, page 17-21. Sydney, Australia, (July 2006)
S. Clark, and J. Curran. ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, page 103. Morristown, NJ, USA, Association for Computational Linguistics, (2004)
R. Ge, and R. Mooney. Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005), page 9--16. Ann Arbor, Michigan, Association for Computational Linguistics, (June 2005)