Article,

Rule-based Information Extraction for Airplane Crashes Reports

S. H.Alkadi.
International Journal of Computational Linguistics (IJCL), 8 (1): 1-36 (April 2017)

Abstract

Over the last two decades, the internet has gained a widespread use in various aspects of everyday living. The amount of generated data in both structured and unstructured forms has increased rapidly, posing a number of challenges. Unstructured data are hard to manage, assess, and analyse in view of decision making. Extracting information from these large volumes of data is time-consuming and requires complex analysis. Information extraction (IE) technology is part of a text-mining framework for extracting useful knowledge for further analysis. Various competitions, conferences and research projects have accelerated the development phases of IE. This project presents in detail the main aspects of the information extraction field. It focused on specific domain: airplane crash reports. Set of reports were used from 1001 Crash website to perform the extraction tasks such as: crash site, crash date and time, departure, destination, etc. As such, the common structures and textual expressions are considered in designing the extraction rules. The evaluation framework used to examine the system's performance is executed for both working and test texts. It shows that the system's performance in extracting entities and relations is more accurate than for events. Generally, the good results reflect the high quality and good design of the extraction rules. It can be concluded that the rule-based approach has proved its efficiency of delivering reliable results. However, this approach does require an intensive work and a cycle process of rules testing and modification.

BibTeX key: halkadi2017rulebased
entry type: article
year: 2017
month: April
journal: International Journal of Computational Linguistics (IJCL)
number: 1
pages: 1-36
volume: 8
language: English
issn: 2180-1266
url: http://www.cscjournals.org/library/manuscriptinfo.php?mc=IJCL-78

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{halkadi2017rulebased, abstract = {Over the last two decades, the internet has gained a widespread use in various aspects of everyday living. The amount of generated data in both structured and unstructured forms has increased rapidly, posing a number of challenges. Unstructured data are hard to manage, assess, and analyse in view of decision making. Extracting information from these large volumes of data is time-consuming and requires complex analysis. Information extraction (IE) technology is part of a text-mining framework for extracting useful knowledge for further analysis. Various competitions, conferences and research projects have accelerated the development phases of IE. This project presents in detail the main aspects of the information extraction field. It focused on specific domain: airplane crash reports. Set of reports were used from 1001 Crash website to perform the extraction tasks such as: crash site, crash date and time, departure, destination, etc. As such, the common structures and textual expressions are considered in designing the extraction rules. The evaluation framework used to examine the system's performance is executed for both working and test texts. It shows that the system's performance in extracting entities and relations is more accurate than for events. Generally, the good results reflect the high quality and good design of the extraction rules. It can be concluded that the rule-based approach has proved its efficiency of delivering reliable results. However, this approach does require an intensive work and a cycle process of rules testing and modification.}, added-at = {2018-12-14T08:25:47.000+0100}, author = {H.Alkadi, Sarah}, biburl = {https://www.bibsonomy.org/bibtex/25c3cf29fc22a05d3d41b0702c6c180d9/cscjournals}, interhash = {8ac3576153ff48ecd6ddec9394e9a3fa}, intrahash = {5c3cf29fc22a05d3d41b0702c6c180d9}, issn = {2180-1266}, journal = {International Journal of Computational Linguistics (IJCL)}, keywords = {Airplane Crashes, Extraction, Information Mining, NLP, Rule-Based Text}, language = {English}, month = {April}, number = 1, pages = {1-36}, timestamp = {2018-12-14T08:25:47.000+0100}, title = {Rule-based Information Extraction for Airplane Crashes Reports}, url = {http://www.cscjournals.org/library/manuscriptinfo.php?mc=IJCL-78}, volume = 8, year = 2017 }

BibSonomy

Rule-based Information Extraction for Airplane Crashes Reports

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on