Indexing a lot of text/pdf files

  • Thread starter Thread starter ice109
  • Start date Start date
  • Tags Tags
    files
Click For Summary
SUMMARY

This discussion focuses on indexing approximately 10 gigabytes of PDF and other article formats for efficient word and phrase searching. Key tools mentioned include Spotlight for Mac OS X, Google Desktop available across Mac OS X, Linux, and Windows, and Windows Desktop Search, which may require an IFilter installation. The conversation highlights the necessity of using appropriate desktop search engines to achieve effective indexing of large text files.

PREREQUISITES
  • Familiarity with PDF file formats
  • Understanding of desktop search engines
  • Knowledge of IFilters for Windows
  • Basic navigation of Mac OS X and Windows operating systems
NEXT STEPS
  • Research the capabilities of Google Desktop for indexing various file formats
  • Explore the installation and configuration of IFilters for Windows Desktop Search
  • Investigate alternative desktop search engines that support extensive file indexing
  • Learn about the performance and limitations of Spotlight in Mac OS X
USEFUL FOR

This discussion is beneficial for individuals looking to optimize file searching capabilities, including software developers, data analysts, and anyone managing large collections of documents in various formats.

ice109
Messages
1,708
Reaction score
6
i have about 10gigs of pdfs and other format articles i would like to index so i can search for a word or a phrase in them. anyone know of a program that does this?
 
Computer science news on Phys.org

Similar threads

  • · Replies 16 ·
Replies
16
Views
6K
  • · Replies 36 ·
2
Replies
36
Views
5K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 27 ·
Replies
27
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 8 ·
Replies
8
Views
8K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K