International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

Downloads: 0 | Views: 56

Research Paper | Computer Science and Information Technology | India | Volume 8 Issue 4, April 2019 | Rating: 4.1 / 10


Punctuation and Capitalization Restoration using Bi-LSTM Network

Ashish Bansal [2]


Abstract: In this paper we present Bi-Directional Long short-term memory (BLSTM) network with linear Conditional Random Field (CRF) to restore punctuation and capitalization in an unsegmented speech transcript. Our approach will restore both punctuation and capitalization using a single model. This is purely lexical based approach with pre trained glove embeddings as an input. The task was treated as a sequence tagging problem where the input is sequence of un- punctuated and un-capitalized words, and the output is a corresponding sequence of punctuated and capitalized words. We demonstrated the accuracy of proposed model are competitive with the state-of-the-art models and can do both the tasks in a single model.


Keywords: Punctuation prediction, Capitalization restoration, Neural network, true-casing, sentence segmentation


Edition: Volume 8 Issue 4, April 2019,


Pages: 2020 - 2025

Rate this Article


Select Rating (Lowest: 1, Highest: 10)

5

Your Comments

Characters: 0

Your Full Name:


Your Valid Email Address:


Verification Code will appear in 2 Seconds ... Wait

Top