- Watij (pronounced wattage) stands for Web Application Testing in Java. It is a pure Java API created to allow for the automation of web applications.
- Tom Mitchell (2009): self-supervised KBP, only NPs without Entity Linking
- Approach to convert any Web data into RSS format.
- Webstemmer is a web crawler and HTML layout analyzer that automatically extracts main text of a news site without having banners, ads and/or navigation lin...Webstemmer is a web crawler and HTML layout analyzer that automatically extracts main text of a news site without having banners, ads and/or navigation links mixed up
- Towards Automatic Data Extraction from Large Web Sites
- Knowledge and Information Systems 17(1):17-33 (2008)
- 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD 2006, page 712--717. New York, NY, USA, ACM, (2006)
- KDD, page 601-606. ACM, (2003)
- Proc. of the Twentieth International Joint Conference on Artificial Intelligence IJCAI'07, Hyderabad, India, (January 2007)
- ECML/PKDD 1, volume 5211 of Lecture Notes in Computer Science, page 195-210. Springer, (2008)
- Max-Planck-Institut für Informatik, Saarbrücken, Germany, (April 2006)
- Artificial Intelligence 165(1):91 - 134 (2005)
- CIKM, page 1355-1356. ACM, (2008)
- Proc. of the 9th International Conference on Web Engineering, (2009)
- J. ACM 51(5):731--779 (2004)
- WWW '09: Proceedings of the 18th international conference on World wide web, page 971--980. New York, NY, USA, ACM, (2009)


user