Access the full text.
Sign up today, get DeepDyve free for 14 days.
[Extracting information from both Web and natural language documents is the central step in knowledge graph construction, since it is the first line of attack in going from a corpus that is not machine-understandable or queryable to a semi-structured corpus that can be queried and reasoned over. Wrapper induction techniques were developed early in the Web community to deal with the special problem of extracting information from webpages and web templates. However, wrapper induction is not enough. Many key attributes need to be extracted directly from text using information extraction algorithms developed in the natural language processing community. This is also true in cases where the raw data is not from the Web, but is a corpus of natural language documents to begin with. Therefore, we also cover some established research on information extraction, including named entity recognition, relation extraction and event extraction. While the first of these has been around for quite some time, the last is a relatively novel research area where improving quality continues to be a challenge.]
Published: Mar 5, 2019
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.