Abstract
We describe a novel search engine for scientific literature. The system allows for sentence-level search starting from portable document format (PDF) files, and integrates text and image search, thus facilitating the retrieval of information present in tables and figures. It allows the user to generate in an intuitive manner complex queries for search terms that are related through particular grammatical (and thus implicitly semantic) relations. The system uses grid processing to parallelise the analysis of large numbers of scientific papers. It is currently undergoing user evaluation, but we report some preliminary evaluation and comparison with Google Scholar, demonstrating its utility. Finally, we discuss future work and the potential and complimentarity of the system for patent search.
Original language | English |
---|---|
Title of host publication | Current Challenges in Patent Information Retrieval |
Editors | K Mayer, J Tait |
Publisher | Springer |
Pages | 329-342 |
Number of pages | 14 |
ISBN (Print) | 3642192300 , 978-3642192302 |
DOIs | |
Publication status | Published - 2011 |