NLTK — the Natural Language Toolkit — is a suite of open source Python modules, data and documentation for research and development in natural language processing. NLTK contains Code supporting dozens of NLP tasks, along with 40 popular Corpora and extensive Documentation including a 375-page online Book. Distributions for Windows, Mac OSX and Linux are available.
Screengrab saves entire webpages as images.
It will save what you can see in the window, the entire page, just a selection, a particular frame... basically it saves webpages as images.
AWStats is a free powerful and featureful tool that generates advanced web, streaming, ftp or mail server statistics, graphically. This log analyzer works as a CGI or from command line and shows you all possible information your log contains, in few graphical web pages. It uses a partial information file to be able to process large log files, often and quickly. It can analyze log files from all major server tools like Apache log files (NCSA combined/XLF/ELF log format or common/CLF log format), WebStar, IIS (W3C log format) and a lot of other web, proxy, wap, streaming servers, mail servers and some ftp servers.
"Mod_bandwidth" is a module for the Apache webserver that enable the setting of server-wide or per connection bandwidth limits, based on the directory, size of files and remote IP/domain.
M. Granitzer, M. Hristakeva, R. Knight, K. Jack, und R. Kern. Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics, Seite 19:1--19:8. New York, NY, USA, ACM, (2012)
D. Milne, und I. Witten. Proceeding of AAAI Workshop on Wikipedia and Artificial Intelligence: an Evolving Synergy, Seite 25--30. AAAI Press, (Juli 2008)
G. Cheng, S. Gong, und Y. Qu. Proceedings of the 10th international conference on The semantic web - Volume Part I, Seite 98--113. Berlin, Heidelberg, Springer-Verlag, (2011)
T. Elsayed, J. Lin, und D. Oard. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, Seite 265--268. Stroudsburg, PA, USA, Association for Computational Linguistics, (2008)
E. Garbin, und I. Mani. Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, Seite 363--370. Stroudsburg, PA, USA, Association for Computational Linguistics, (2005)