Downloads: 3 | Views: 166 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Research Paper | Computer Science and Information Technology | India | Volume 8 Issue 4, April 2019 | Popularity: 4.5 / 10
Punctuation and Capitalization Restoration using Bi-LSTM Network
Ashish Bansal
Abstract: In this paper we present Bi-Directional Long short-term memory (BLSTM) network with linear Conditional Random Field (CRF) to restore punctuation and capitalization in an unsegmented speech transcript. Our approach will restore both punctuation and capitalization using a single model. This is purely lexical based approach with pre trained glove embeddings as an input. The task was treated as a sequence tagging problem where the input is sequence of un- punctuated and un-capitalized words, and the output is a corresponding sequence of punctuated and capitalized words. We demonstrated the accuracy of proposed model are competitive with the state-of-the-art models and can do both the tasks in a single model.
Keywords: Punctuation prediction, Capitalization restoration, Neural network, true-casing, sentence segmentation
Edition: Volume 8 Issue 4, April 2019
Pages: 2020 - 2025
Make Sure to Disable the Pop-Up Blocker of Web Browser