Is google search a Database management system ?

In summary, the conversation revolves around whether Google search can be considered a database management system (DBMS). One person argues that Google simply queries a database and displays results, while the other argues that it manages and gives access to data. They also bring up the example of Facebook, which is seen as a DBMS due to its storage and management of user data. Ultimately, there is no clear answer and it depends on how one defines a database. However, it is agreed that metadata, which is data about data, is stored and searchable in databases. Some early database systems, such as ISAM and IDS, are also discussed.
  • #1
Darkstar3000
29
0
Wasn't sure about where else to post this so I posted it here :

I got into an argument with my friend about google search being a DBMS, I disagreed because it doesn't fit the definition that we were given or the one on wikipedia and a few more websites. All that google search does it query a database (according to my knowledge) and gives you the requested results. He on the other hand, says that it manages data and gives the user what the user is looking for, I went forward to say that according to his logic the find function in word processing programs is also a DBMS because it's doing almost the same thing as google search, he said yes it. He also said that facebook is a DBMS.

To settle this argument we decided to post this question here.


Thanks in advance
 
Physics news on Phys.org
  • #2
Nice question, if you ever find the answer please let me know.

I guess it depends on how you look at it. When you look at Google as a DBMS then what is the database? I suppose the database is formed by all pages written by us and spread over countless servers all over the world. On Google's servers they only keep indexes that make it easy and fast for us to find something back. Google even caches lots of pages, I suppose in my line of work as an Oracle DBA I would say that Google has a materialized view for the many things it finds on the web, but Google does of course not use Oracle. There is one important difference when comparing Google's DBMS with other database management systems; Oracle, SQL Server, mySQL and anything else control access and governor changes in the data while you don't need Google's approval to change your website or access any other website, at least not yet :-) And when you make changes in your data/website it is not directly reflected by Google's search engine either, only when you are lucky and then with a considerable delay. Those are some huge differences with other database management systems that make me think that if Google is a DBMS it is a very crappy one. It is an excellent indexing system though.

Kind regards,
Paul Karman
 
  • #3
Google search uses a database to store keywords and the web page where they were used. Also search stats are recorded in the database to drive future searches with similar keywords. Since Google is business it maintains these stats in orde so set search rates where companies want to tie their website to keywords selected by the end user.

For Facebook, a database is used to store your profile information.

http://en.wikipedia.org/wiki/Google_search
 
  • #4
With respect to google and other peoples websites, google runs a spider program that crawls thru webpages to see if they've been changed and recording the changes in its cache while updating its keyword search. I not sure how often it works but I am pretty sure it scans the more heavily accessed sites more often and uses any url references in those webpages to scan further and further into the web structure.
 
  • #5
For Facebook I would agree that it has a database and a database management system. Every message, story and picture is stored on Facebook’s own servers and what you can see or enter in the database is governed by some logic that starts with checking your credentials.

For Google however it is a different matter depending on how you look at Google. If you see Google as an independent system in which somehow information gets entered that then can be queried and you can log into find out even more the yes, it could be considered a database.

But it seems that Google’s main existence revolves around collecting data about data, that is called metadata. And although metadata is also data I it is a bit harder to recognize it as a database, it simply has no reason for existence if web pages did not exist. And when there is no database there is no need for a database management system?!

Of course you can use the same arguments for Facebook where the information is within the person and Facebook just indexes information that is in the user. (Are brain bots the next step?) Anyway, this argument would draw Facebook towards an indexing service like Google where the data is stored in biological organisms. I think I draw the line between man and machine and agree that Facebook contains the data.

You might want to go the other way and decide that any information that is not actual real life is in fact metadata so any database is in principle a meta database that can use a database management system. So then I would agree that meta databases are databases too and hence so is Google.

Facebook could be an indexing service, Google could be a database, everybody is right.

Paul Karman
 
  • #6
Yes, metadata is data about data and because its data it needs to be stored and searchable itself and so its placed in a database.

A database is simply an organized way of storing data using an api to access it. Some of the earliest of database systems were called ISAM databases where data was organized into records and records were inserted into pages. Each record had a key for simple access. See http://en.wikipedia.org/wiki/Indexed_Sequential_Access_Method.

Following ISAM were IDS databases originally concieved by Charles Bachmann. These databases used linked lists of data with cross links to other rings. Programmers had to be familiar with how the data was organized in order to traverse it to retrieve information. It was very fast but saving a database to tape storage and reloading it for load balancing required a specialized program to traverse your database.

Later IBM developed relational databases and the SQL language. Here data was organized into tables with associated index tables of keys. This organization wasn't as fast as IDS but it was easier to backup and reload as needed by a program provided by the vendor. In a sense it was a cross between ISAM and IDS database technology and much easier to use by non-programmers.

Google uses an algorithm called MapReduce which integrates its data together into a form of massive distributed database. Their 2008 stats were described in the article below so you can imagine where they are now:

http://highscalability.com/google-architecture/
 
Last edited:

1. Is google search considered a Database management system?

Yes, Google search is considered a Database management system (DBMS) because it allows users to store, manage, and retrieve large amounts of data in an organized and efficient manner.

2. What are the main features of Google search as a DBMS?

Some of the main features of Google search as a DBMS include data storage, data retrieval, data manipulation, data security, and data backup and recovery.

3. How does Google search handle large amounts of data?

Google search uses a distributed database system to handle large amounts of data. This means that the data is stored and managed across multiple servers, allowing for faster retrieval and improved scalability.

4. Can users access and manipulate data in Google search?

No, users cannot directly access or manipulate data in Google search. The data is managed and controlled by Google's algorithms and processes, which determine the most relevant results for a given search query.

5. How does Google search ensure data security?

Google search has various security measures in place to ensure data security, including encryption, firewalls, access controls, and regular data backups. Additionally, Google has strict privacy policies and regularly undergoes security audits to protect user data.

Similar threads

  • Engineering and Comp Sci Homework Help
Replies
8
Views
1K
  • Programming and Computer Science
Replies
5
Views
2K
  • Programming and Computer Science
Replies
15
Views
1K
  • STEM Academic Advising
Replies
1
Views
173
  • STEM Educators and Teaching
Replies
19
Views
1K
Replies
2
Views
792
  • Engineering and Comp Sci Homework Help
Replies
1
Views
6K
  • Programming and Computer Science
Replies
3
Views
1K
  • Programming and Computer Science
Replies
11
Views
958
  • Programming and Computer Science
Replies
1
Views
4K
Back
Top