Access the full text.
Sign up today, get DeepDyve free for 14 days.
S. Jennett (1951)
The Making of Books
M. Gorman, Paul Winkler (1967)
Anglo-American Cataloguing Rules
Neil Merwe (1993)
The integration of document image processing and text retrieval principlesThe Electronic Library, 11
S. Ranganathan, A. Neelameghan (2006)
Classified Catalogue Code: With Additional Rules for Dictionary Catalogue Code
This paper presents a methodology to capture bibliographic data from the verso of the title pages of documents. A survey has been undertaken to identify the syntactic and semantic features of bibliographic elements on the verso of title pages. These features include the font size, line numbers and appearence of certain string of characters. Emphasis is given to the study of “cataloguing‐in‐publication” data. The results of the survey are used to develop heuristics which can help in developing a program to automatically identify the various bibliogaphic data elements. The back of the title pages are scanned and stored as HTML pages using optical recognition software. The heuristics are then applied on the HTML pages. Few samples of input and the output generated are presented. Finally, the problems related to OCR and the heuristics are enumerated.
Library Hi Tech – Emerald Publishing
Published: Dec 1, 2004
Keywords: Bibliographic systems; Information operations; Data handling; Cataloguing; Classification schemes
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.