Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics

P. Nakov. (2019)cite arxiv:1912.01113Comment: noun compounds, paraphrasing verbs, semantic interpretation, syntax, multi-word expressions, MWEs, noun compound interpretation, noun compound bracketing, prepositional phrase attachment, noun phrase coordination, machine translation.

Zusammenfassung

An important characteristic of English written text is the abundance of noun compounds - sequences of nouns acting as a single noun, e.g., colon cancer tumor suppressor protein. While eventually mastered by domain experts, their interpretation poses a major challenge for automated analysis. Understanding noun compounds' syntax and semantics is important for many natural language applications, including question answering, machine translation, information retrieval, and information extraction. I address the problem of noun compounds syntax by means of novel, highly accurate unsupervised and lightly supervised algorithms using the Web as a corpus and search engines as interfaces to that corpus. Traditionally the Web has been viewed as a source of page hit counts, used as an estimate for n-gram word frequencies. I extend this approach by introducing novel surface features and paraphrases, which yield state-of-the-art results for the task of noun compound bracketing. I also show how these kinds of features can be applied to other structural ambiguity problems, like prepositional phrase attachment and noun phrase coordination. I address noun compound semantics by automatically generating paraphrasing verbs and prepositions that make explicit the hidden semantic relations between the nouns in a noun compound. I also demonstrate how these paraphrasing verbs can be used to solve various relational similarity problems, and how paraphrasing noun compounds can improve machine translation.

Beschreibung

[1912.01113] Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics

Links und Ressourcen

BibTeX-Schlüssel: nakov2019using
Eintragstyp: misc
Jahr: 2019
URL: http://arxiv.org/abs/1912.01113
Hinweis: cite arxiv:1912.01113Comment: noun compounds, paraphrasing verbs, semantic interpretation, syntax, multi-word expressions, MWEs, noun compound interpretation, noun compound bracketing, prepositional phrase attachment, noun phrase coordination, machine translation

@parismics Tags hervorgehoben

Zitieren Sie diese Publikation

@misc{nakov2019using, abstract = {An important characteristic of English written text is the abundance of noun compounds - sequences of nouns acting as a single noun, e.g., colon cancer tumor suppressor protein. While eventually mastered by domain experts, their interpretation poses a major challenge for automated analysis. Understanding noun compounds' syntax and semantics is important for many natural language applications, including question answering, machine translation, information retrieval, and information extraction. I address the problem of noun compounds syntax by means of novel, highly accurate unsupervised and lightly supervised algorithms using the Web as a corpus and search engines as interfaces to that corpus. Traditionally the Web has been viewed as a source of page hit counts, used as an estimate for n-gram word frequencies. I extend this approach by introducing novel surface features and paraphrases, which yield state-of-the-art results for the task of noun compound bracketing. I also show how these kinds of features can be applied to other structural ambiguity problems, like prepositional phrase attachment and noun phrase coordination. I address noun compound semantics by automatically generating paraphrasing verbs and prepositions that make explicit the hidden semantic relations between the nouns in a noun compound. I also demonstrate how these paraphrasing verbs can be used to solve various relational similarity problems, and how paraphrasing noun compounds can improve machine translation.}, added-at = {2021-01-19T12:33:45.000+0100}, author = {Nakov, Preslav}, biburl = {https://www.bibsonomy.org/bibtex/2284e411632c6140a11e32b0f66ef51f2/parismic}, description = {[1912.01113] Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics}, interhash = {38a9b09dce9d4432fa32f1f56680a04a}, intrahash = {284e411632c6140a11e32b0f66ef51f2}, keywords = {dataset semantic training web}, note = {cite arxiv:1912.01113Comment: noun compounds, paraphrasing verbs, semantic interpretation, syntax, multi-word expressions, MWEs, noun compound interpretation, noun compound bracketing, prepositional phrase attachment, noun phrase coordination, machine translation}, timestamp = {2021-01-19T12:33:45.000+0100}, title = {Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics}, url = {http://arxiv.org/abs/1912.01113}, year = 2019 }

BibSonomy

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics

Zusammenfassung

Beschreibung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

BibSonomy

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics

Zusammenfassung

Beschreibung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics

Kommentare und Rezensionen
(0)