Comp Sci Creating index vs adding document to index differences

shivajikobardan · Jul 16, 2022

These are the steps of indexing in Lucene given in our syllabus-:

The first step says that it is creating an index whereas the last step says that it's adding document to index.
What's the difference between these two? Can I get an example.

Here's what I think it should happen-:
1) Collect all words from each documents. Lists it like-;

doc1=>word1,word2,WORD3….wordn
doc2=>word1,WORD2,word3….wordn
And so on.

2) Analyse the words and remove various types of words as per analyzer, process them as per analyzer.

Say now what remains is-:
doc1=>word1,word3,...word(n-1)
doc2=>word2,...word(n-3)

3) Done. Now you can make inverted index as well by converting this to inverted index.

But it's done bit differently, which I'm not 100% clear about.

pbuk · Jul 16, 2022

shivajikobardan said:

whereas the last step says that it's adding document to index.

No it doesn't, what you are calling the "last" step simply creates a document; adding it to the index is another step.

shivajikobardan said:

What's the difference between these two? Can I get an example.

Can you get an example of the difference between creating a thing and adding something to that thing? Are you serious?

shivajikobardan said:

Here's what I think it should happen-:
...

This is all done behind the scenes, you don't have to worry about any of this to use Lucene, you just need to learn how to use the API. A good place to learn that is the API documentation itself: https://lucene.apache.org/core/9_2_0/core/index.html

Comp Sci Creating index vs adding document to index differences

Thread 'Rectifier Meter'

Similar threads

Bending Stress and Shear Stress

Engineering Full bridge circuit with inductor and resistor

How Do I Draw This Shear and Moment Diagram?

Engineering AM-AM and AM-PM graph generation in LTSpice

PLL - How to find all the gains of a PI corrector and fix Ki ? MATLAB

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers