Automatic extraction and processing of document references: A CRF-based approach

Kathrin Eichler

Price

R$ 241.90

excl. VAT

Ordered from remote warehouse

Expected delivery Jul 7 - 17

Our customers say:

Top-vurdering på Google Reviews, baseret på tusinder af anmeldelser.

14-day return policy in accordance with European consumer protection law

Top ranking on Trustpilot

Add to your iMusic wish list

Automatic extraction and processing of document references: A CRF-based approach

Master's Thesis from the year 2007 in the subject Computer Science - Applied, grade: 1.0, University of Sunderland (School of Computing and Technology), language: English, comment: Für die Arbeit wurde die Bewertung "with distinction" vergeben. , abstract: While reading documents, you often encounter text passages advising you to refer to other documents for more information about a specific topic. These references to other documents are particularly common in technical documents, written for the sole purpose of providing the reader with as much relevant information as possible, without rephrasing information that can be found elsewhere. Knowing how the documents in a system are interrelated, i.e. which other documents a document refers to or is referred by, can be extremely helpful when trying to get access to relevant information. A typical example of such a "knowledge net" providing information about document relations is CiteSeer, a digital library of academic literature. For each document in the library system, CiteSeer displays lists of related documents, such as a list of documents that the current document cites as well as a list of documents that the current document is cited by. The assumption that inspired this thesis is that such lists are not only helpful when reading academic literature but could also assist a reader of technical documents stored in a company's document management system. The idea was thus to extend an existing document management system by displaying, for each document stored in the system, a list of links to documents that the current document refers to. As information about how the documents in this system are interrelated was not available, the focus of the project underlying this thesis was on the first step towards solving this task: automatically analyzing documents in order to extract names of related documents. Once all document names mentioned in a document have been extracted, the next step would then be to search for these

72 pages

Media	Books Paperback Book (Book with soft cover and glued back)
Released	October 25, 2010
ISBN13	9783640723164
Publishers	Grin Verlag
Pages	72
Dimensions	148 × 210 × 4 mm · 118 g
Language	German

See all of Kathrin Eichler ( e.g. Paperback Book )