International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 0 | Views: 17

Research Paper | Computer Science and Information Technology | India | Volume 8 Issue 4, April 2019 | Rating: 3.2 / 10


Punctuation and Capitalization Restoration using Bi-LSTM Network

Ashish Bansal


Abstract: In this paper we present Bi-Directional Long short-term memory (BLSTM) network with linear Conditional Random Field (CRF) to restore punctuation and capitalization in an unsegmented speech transcript. Our approach will restore both punctuation and capitalization using a single model. This is purely lexical based approach with pre trained glove embeddings as an input. The task was treated as a sequence tagging problem where the input is sequence of un- punctuated and un-capitalized words, and the output is a corresponding sequence of punctuated and capitalized words. We demonstrated the accuracy of proposed model are competitive with the state-of-the-art models and can do both the tasks in a single model.


Keywords: Punctuation prediction, Capitalization restoration, Neural network, true-casing, sentence segmentation


Edition: Volume 8 Issue 4, April 2019


Pages: 2020 - 2025



How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top