Journals
2014 EN
Shafali Goyal · Ashok Kumar Bathla
days, a vast research is going in Optical Character Recognition (OCR) of handwritten Documents in Indian scripts. A lot of handwritten data is existed in Devanagari script which is still to be recognized. Segmentation is the key step of OCR process. Segmentation is the process of extracting the valuable segments from the text document which are used in the process of recognition of characters. Line segmentation is the process of segmenting the text document into lines. Afterwards, word segmentation and character segmentation is carried out. This paper only deals with the Line segmentation of handwritten documents in Hindi. Devanagari script is the basic script to write Hindi, Marathi, Sanskrit and Nepali languages. In this paper the brief introduction of various existing techniques for segmentation of handwritten text is discussed. Also, develops an algorithm for segmentation of skewed lines, touching lines present in the text document and broken parts in upper modifiers or space present between the upper modifiers. This algorithm is implemented on large database collected from various writers. The proposed algorithm integrated the Projection based method, gap detection between text lines and neighbor pixel analysis method.
Foundation of Computer Science
Journals
2014 EN
Nemir AhmedAl-Azzawi
Foundation of Computer Science
Journals
2014 EN
Nikita Munot · Sharvari Govilkar
Text summarization is one of application of natural language processing and is becoming more popular for information condensation. Text summarization is a process of reducing the size of original document and producing a summary by retaining important information of original document. This paper gives comparative study of various text summarization methods based on different types of application. The paper discusses in detail two main categories of text summarization methods these are extractive and abstractive summarization methods. The paper also presents taxonomy of summarization systems and statistical and linguistic approaches for summarization.
Foundation of Computer Science
Journals
2014 EN
Deepthi Maryala · J. Krishna Chaithanya · T. Ramashri
Foundation of Computer Science
Journals
2014 EN
K. Banu · N.Rama N.Rama
Foundation of Computer Science
Journals
2014 EN
Khushboo Gulati · NARENDER NARENDER
Foundation of Computer Science
Journals
2014 EN
Snehdeep Snehdeep · Manoj Kumar
Text line segmentation is extremely important phase of OCR. Overlapped lines, skewed lines and connected components make the problem of line segmentation more complicated in Gurumukhi handwritten documents. The existence of these problems in handwritten text documents declines the performance of OCR system. In this paper, we present a technique to solve these problems. The proposed algorithm is based on mid-point detection. The algorithm deals with these problems and gives effective results 90% in case of overlapped lines and 94% accurate results for segmentation of connected components between neighboring lines. This paper also provides a review on major problems in line segmentation that decreases the accuracy of recognition system. The proposed method has achieved 93.05% accuracy in text line segmentation.
Foundation of Computer Science
Journals
2014 EN
Sharon Dominick · T. Abdul Razak
Foundation of Computer Science
Journals
2014 EN
Anjali Varshney · Dinesh Goyal
Foundation of Computer Science
Journals
2014 EN
Gurjot Kaur · Ishpreet Singh Virk
Foundation of Computer Science