The Datawrangling blog was put on the back burner last May while I focused on my startup. Now that I have some bandwidth again, I am getting back to work on several pet projects (including the Amazon EC2 Cluster).
(2000) Sun Le, Jin Youbing, Du Lin, & Sun Yufang: Automatic extraction of English-Chinese term lexicons from noisy bilingual corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 751-755. [PDF, 128KB]
As the use of a Bayesian probability calculation on a simple co-occurrence frequency table created from the same data has similar disambiguation capabilities, the paper also incorporates comparison of LSA with the Bayesian model.
H. Halpin, V. Robu, и H. Shepherd. WWW '07: Proceedings of the 16th international conference on World Wide Web, стр. 211--220. New York, NY, USA, ACM, (2007)
S. Oldenburg, M. Garbe, и C. Cap. SSM '08: Proceeding of the 2008 ACM Workshop on Search in Social Media, стр. 11--18. New York, NY, USA, ACM, (октября 2008)
F. Sánchez-Martínez, M. Forcada, и A. Way. Proceedings of the 3rd Workshop on Example-Based Machine Translation, стр. 11--18. Dublin, Ireland, Centre for Next Generation Localisation (CNGL), (2009)
S. Noël, и R. Beale. Proceedings of the 22nd British CHI Group Annual Conference on HCI 2008: People and Computers XXII: Culture, Creativity, Interaction, 2, стр. 71-74. (2008)
E. Rader, и R. Wash. CSCW '08: Proceedings of the ACM 2008 conference on Computer supported cooperative work, стр. 239--248. New York, NY, USA, ACM, (2008)
P. Heymann, G. Koutrika, и H. Garcia-Molina. WSDM '08: Proceedings of the international conference on Web search and web data mining, стр. 195--206. New York, NY, USA, ACM, (2008)
M. Ames, и M. Naaman. CHI '07: Proceedings of the SIGCHI conference on Human factors in computing systems, стр. 971--980. New York, NY, USA, ACM, (2007)
C. Marlow, M. Naaman, D. Boyd, и M. Davis. HYPERTEXT '06: Proceedings of the seventeenth conference on Hypertext and hypermedia, стр. 31--40. New York, NY, USA, ACM, (2006)
M. Carman, M. Baillie, R. Gwadera, и F. Crestani. SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, стр. 123--130. New York, NY, USA, ACM, (2009)
S. Bao, G. Xue, X. Wu, Y. Yu, B. Fei, и Z. Su. WWW '07: Proceedings of the 16th international conference on World Wide Web, стр. 501--510. New York, NY, USA, ACM, (2007)
L. Muñoz, S. Rojas, и M. Rosell. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, стр. 75--82. Alicante, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, (2009)
M. Zubizarreta, F. Tyers, и G. Ramírez-Sánchez. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, стр. 3--10. Alicante, Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, (2009)
D. Oard, D. Doermann, B. Dorr, D. He, P. Resnik, A. Weinberg, W. Byrne, S. Khudanpur, D. Yarowsky, A. Leuski и 2 other автор(ы). NAACL '03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, стр. 76--78. Morristown, NJ, USA, Association for Computational Linguistics, (2003)
C. Callison-Burch, M. Osborne, и P. Koehn. Proceedings the Eleventh Conference of the European Chapter of the Association for Computational Linguistics, стр. 249--256. Trento, Italia, (2006)
D. Marcu, и W. Wong. EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing, стр. 133--139. Morristown, NJ, USA, Association for Computational Linguistics, (2002)
S. McNaught. Proceedings of the 3rd International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Language, (1990)