Open Access News

News from the open access movement


Monday, June 15, 2009

CiteSeerX now searches within tables

Lee Giles, CiteSeerX indexes tables, posted to American Scientist Open Access Forum, June 11, 2009.

CiteSeerX now provides indexing and ranking of tables in documents. This new feature will soon be released in open source as part of the CiteSeerX open source project. Currently, nearly a million tables are indexed.

In addition, a demo of the data extraction from tables in pdf files can be found at [link] ...

See also this blog post by Pradeep Teregowda:
... Table search allows users to search embedded tables of documents in the CiteSeerx collection. Table caption, reference text and footnotes are indexed for each table. Ranking of table search results can be based on relevance, year and the number of citations to the corresponding document in CiteSeerx.