Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Machine learning and natural language processing on the patent corpus: Data, tools, and new measures

Machine learning and natural language processing on the patent corpus: Data, tools, and new measures Drawing upon recent advances in machine learning and natural language processing, we introduce new tools that automatically ingest, parse, disambiguate, and build an updated database using U.S. patent data. The tools identify unique inventor, assignee, and location entities mentioned on each granted U.S. patent from 1976 to 2016. We describe data flow, algorithms, user interfaces, descriptive statistics, and a novelty measure based on the first appearance of a word in the patent corpus. We illustrate an automated coinventor network mapping tool and visualize trends in patenting over the last 40 years. Data and documentation can be found at https://console.cloud.google.com/launcher/partners/patents-public-data. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Journal of Economics & Management Strategy Wiley

Loading next page...
 
/lp/wiley/machine-learning-and-natural-language-processing-on-the-patent-corpus-nKiSmH974B

References (21)

Publisher
Wiley
Copyright
Copyright © 2018 Wiley Periodicals, Inc.
ISSN
1058-6407
eISSN
1530-9134
DOI
10.1111/jems.12259
Publisher site
See Article on Publisher Site

Abstract

Drawing upon recent advances in machine learning and natural language processing, we introduce new tools that automatically ingest, parse, disambiguate, and build an updated database using U.S. patent data. The tools identify unique inventor, assignee, and location entities mentioned on each granted U.S. patent from 1976 to 2016. We describe data flow, algorithms, user interfaces, descriptive statistics, and a novelty measure based on the first appearance of a word in the patent corpus. We illustrate an automated coinventor network mapping tool and visualize trends in patenting over the last 40 years. Data and documentation can be found at https://console.cloud.google.com/launcher/partners/patents-public-data.

Journal

Journal of Economics & Management StrategyWiley

Published: Jan 1, 2018

Keywords: ; ; ; ; ; ; ; ; ; ;

There are no references for this article.