Downloads: 109 | Views: 309
Research Paper | Engineering Applications of Artificial Intelligence | India | Volume 8 Issue 6, June 2019 | Popularity: 6.8 / 10
Free Form Document Based Extraction Using ML
Mona Deshmukh, Shruti Maheshwari
Abstract: Information extraction is concerned with applying natural language processing to automatically extract required information from free form based text documents. Several machine learning techniques have been applied in order to facilitate the portability of the information extraction systems. The challenge is not just to extract data from scanned documents but also to extract it accurately. This paper describes a general method for building an information extraction system using properties such as tokenization, POS tagging, entity detection and dependency parsing along with supervised learning algorithms. In this method, the extraction decisions are lead by a set of classifiers instead of sophisticated linguistic analyses. A major problem incurred by many businesses today is insufficiency to leverage data from scanned documents and images. Whenever a business makes use of data which is to be captured from paper documents, manually entering data can impact the efficiency, system vulnerability and speed of carrying out of business. In such business cases, we need data entry automation that helps to extract data from scanned documents and automate document based business processes.
Keywords: spaCy, POS tagging, tokenization, OCR engine, open NLP
Edition: Volume 8 Issue 6, June 2019
Pages: 2165 - 2169
Make Sure to Disable the Pop-Up Blocker of Web Browser
Downloads: 200 | Views: 325
Engineering Applications of Artificial Intelligence, Morocco, Volume 8 Issue 9, September 2019
Pages: 564 - 579Heart Disease Prediction: Artificial Intelligence / Machine Learning
Dr Yasin Bouanani
Downloads: 134 | Views: 448
Engineering Applications of Artificial Intelligence, Nigeria, Volume 9 Issue 10, October 2020
Pages: 38 - 43Study on Forging and Forming, Intelligent Production Robot System for an Automotive Component
Orelaja Oluseyi.A, Michael Bola Adeleke, Ishola A. Afiz, Odutayo Oladipo, Olufemi Peter Kehinde, Abiodun Olakunle Israel
Downloads: 111 | Views: 263
Engineering Applications of Artificial Intelligence, China, Volume 6 Issue 3, March 2017
Pages: 1743 - 1745Design and Implementation of a Centralized Air Conditioning Energy Saving System based on Intelligent Control System
Lisha Gao
Downloads: 108 | Views: 353
Engineering Applications of Artificial Intelligence, India, Volume 8 Issue 11, November 2019
Pages: 1100 - 1106An Evaluation for Various Text Summarization Algorithms on Blog Summarization Dataset
Shakshi Neha, Amanpreet Singh, Ishika Raj, Saveta Kumari