Web 0.1: Wetter, Fuball-Live-Ticker, Fernsehprogramm: 16 Millionen Deutsche drcken jeden Tag auf die Fernbedienung, um an Informationen des Videotexts zu kommen. (tags: tv text)
Deutscher Wortschatz contains data generated from newspapers and web resources that are publicly available. The data were collected per language and encompass statistics about co-occurrences of words in randomly selected sentences.
A. Razavi, S. Matwin, D. Inkpen, and A. Kouznetsov. ICDMW '09: Proceedings of the 2009 IEEE International Conference on Data Mining Workshops, page 471--476. Washington, DC, USA, IEEE Computer Society, (2009)
M. Richardson, A. Prakash, and E. Brill. Proceedings of the 15th international conference on World Wide Web, page 707--715. Edinburgh, Scotland, ACM Press, (May 2006)