Is google search a Database management system ?

Click For Summary

Discussion Overview

The discussion centers around whether Google Search can be classified as a database management system (DBMS). Participants explore definitions, functionalities, and comparisons with other systems like Facebook, considering both theoretical and practical implications.

Discussion Character

  • Debate/contested
  • Conceptual clarification
  • Technical explanation

Main Points Raised

  • One participant argues that Google Search does not fit the definition of a DBMS as it merely queries a database rather than managing it.
  • Another participant suggests that Google can be seen as a DBMS if one considers the database to consist of all web pages indexed across servers, although they note significant differences in access control compared to traditional DBMS.
  • It is mentioned that Google maintains statistics related to search queries, which could imply some level of data management.
  • A participant describes Google's spider program that crawls and updates web pages, contributing to its indexing capabilities.
  • Some participants propose that while Facebook clearly operates as a DBMS, Google’s role is more complex and may not strictly qualify as such, especially when considering the nature of metadata.
  • There is a suggestion that both Google and Facebook could be viewed as indexing services, with the distinction that Facebook contains user-generated data while Google primarily indexes existing web content.
  • One participant discusses the evolution of database systems, mentioning Google's use of algorithms like MapReduce, which could support the argument for it being a form of distributed database.

Areas of Agreement / Disagreement

Participants do not reach a consensus on whether Google Search qualifies as a DBMS. Multiple competing views are presented, with some arguing for its classification and others against it, highlighting the complexity of the definitions involved.

Contextual Notes

Participants express uncertainty regarding the definitions of DBMS and the criteria that must be met for a system to be classified as such. There are also discussions about the implications of metadata and the evolving nature of data management systems.

Darkstar3000
Messages
28
Reaction score
0
Wasn't sure about where else to post this so I posted it here :

I got into an argument with my friend about google search being a DBMS, I disagreed because it doesn't fit the definition that we were given or the one on wikipedia and a few more websites. All that google search does it query a database (according to my knowledge) and gives you the requested results. He on the other hand, says that it manages data and gives the user what the user is looking for, I went forward to say that according to his logic the find function in word processing programs is also a DBMS because it's doing almost the same thing as google search, he said yes it. He also said that facebook is a DBMS.

To settle this argument we decided to post this question here.


Thanks in advance
 
Physics news on Phys.org
Nice question, if you ever find the answer please let me know.

I guess it depends on how you look at it. When you look at Google as a DBMS then what is the database? I suppose the database is formed by all pages written by us and spread over countless servers all over the world. On Google's servers they only keep indexes that make it easy and fast for us to find something back. Google even caches lots of pages, I suppose in my line of work as an Oracle DBA I would say that Google has a materialized view for the many things it finds on the web, but Google does of course not use Oracle. There is one important difference when comparing Google's DBMS with other database management systems; Oracle, SQL Server, mySQL and anything else control access and governor changes in the data while you don't need Google's approval to change your website or access any other website, at least not yet :-) And when you make changes in your data/website it is not directly reflected by Google's search engine either, only when you are lucky and then with a considerable delay. Those are some huge differences with other database management systems that make me think that if Google is a DBMS it is a very crappy one. It is an excellent indexing system though.

Kind regards,
Paul Karman
 
Google search uses a database to store keywords and the web page where they were used. Also search stats are recorded in the database to drive future searches with similar keywords. Since Google is business it maintains these stats in orde so set search rates where companies want to tie their website to keywords selected by the end user.

For Facebook, a database is used to store your profile information.

http://en.wikipedia.org/wiki/Google_search
 
With respect to google and other peoples websites, google runs a spider program that crawls thru webpages to see if they've been changed and recording the changes in its cache while updating its keyword search. I not sure how often it works but I am pretty sure it scans the more heavily accessed sites more often and uses any url references in those webpages to scan further and further into the web structure.
 
For Facebook I would agree that it has a database and a database management system. Every message, story and picture is stored on Facebook’s own servers and what you can see or enter in the database is governed by some logic that starts with checking your credentials.

For Google however it is a different matter depending on how you look at Google. If you see Google as an independent system in which somehow information gets entered that then can be queried and you can log into find out even more the yes, it could be considered a database.

But it seems that Google’s main existence revolves around collecting data about data, that is called metadata. And although metadata is also data I it is a bit harder to recognize it as a database, it simply has no reason for existence if web pages did not exist. And when there is no database there is no need for a database management system?!

Of course you can use the same arguments for Facebook where the information is within the person and Facebook just indexes information that is in the user. (Are brain bots the next step?) Anyway, this argument would draw Facebook towards an indexing service like Google where the data is stored in biological organisms. I think I draw the line between man and machine and agree that Facebook contains the data.

You might want to go the other way and decide that any information that is not actual real life is in fact metadata so any database is in principle a meta database that can use a database management system. So then I would agree that meta databases are databases too and hence so is Google.

Facebook could be an indexing service, Google could be a database, everybody is right.

Paul Karman
 
Yes, metadata is data about data and because its data it needs to be stored and searchable itself and so its placed in a database.

A database is simply an organized way of storing data using an api to access it. Some of the earliest of database systems were called ISAM databases where data was organized into records and records were inserted into pages. Each record had a key for simple access. See http://en.wikipedia.org/wiki/Indexed_Sequential_Access_Method.

Following ISAM were IDS databases originally concieved by Charles Bachmann. These databases used linked lists of data with cross links to other rings. Programmers had to be familiar with how the data was organized in order to traverse it to retrieve information. It was very fast but saving a database to tape storage and reloading it for load balancing required a specialized program to traverse your database.

Later IBM developed relational databases and the SQL language. Here data was organized into tables with associated index tables of keys. This organization wasn't as fast as IDS but it was easier to backup and reload as needed by a program provided by the vendor. In a sense it was a cross between ISAM and IDS database technology and much easier to use by non-programmers.

Google uses an algorithm called MapReduce which integrates its data together into a form of massive distributed database. Their 2008 stats were described in the article below so you can imagine where they are now:

http://highscalability.com/google-architecture/
 
Last edited:

Similar threads

  • · Replies 13 ·
Replies
13
Views
3K
Replies
4
Views
4K
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 1 ·
Replies
1
Views
4K
Replies
1
Views
2K
Replies
18
Views
2K
  • · Replies 28 ·
Replies
28
Views
6K
  • · Replies 1 ·
Replies
1
Views
8K