Request for Large Databases (100,000+ rows) to Practice with

  • Thread starter Thread starter WWGD
  • Start date Start date
  • Tags Tags
    Request
Click For Summary

Discussion Overview

The discussion revolves around the search for large databases (specifically those with 100,000 or more rows) for practicing SQL Server skills. Participants share resources, personal experiences, and opinions on what constitutes a large database, as well as the implications of database size on performance and management.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant seeks free, larger-scale databases to enhance their SQL Server skills, noting their current experience is limited to small databases.
  • Another participant suggests that 100 million rows is a more appropriate threshold for a large database, indicating that perceptions of size can vary based on context.
  • Some participants mention specific databases, such as a test database on the MySQL website that contains 4 million rows across 6 tables, which is described as 'large'.
  • There are differing opinions on the suitability of various database management systems (DBMS) for handling large datasets, with one participant suggesting Oracle as the best option for 100,000 rows, while expressing concerns about handling 100 million rows.
  • Participants discuss alternative methods for obtaining data, such as using CSV files or continuous data feeds, and the potential challenges of importing large datasets all at once.
  • One participant emphasizes the importance of understanding indexing and hashing when working with large databases, noting their experience with databases containing over a billion entries.
  • Several participants recommend free versions of popular DBMSs, such as Oracle Database 11g Express Edition and Microsoft SQL Server Express, as resources for practice.

Areas of Agreement / Disagreement

Participants express a range of views on what constitutes a large database, with no consensus on a specific definition. There are also differing opinions on the best DBMS for handling large datasets and the methods for acquiring data for practice.

Contextual Notes

Participants' definitions of small and large databases appear to depend on individual experiences and the capabilities of the hardware being used. There are also unresolved discussions regarding the performance implications of different database sizes and management techniques.

WWGD
Science Advisor
Homework Helper
Messages
7,798
Reaction score
13,096
Hi All,
I am trying to up my game in SQL Server in general. Specifically, my knowledge has been obtained so far by working with very small-scale databases ( fewer than 200 rows). Does anyone know of free larger-scale databases that are available?
Thanks.
 
Technology news on Phys.org
Hmm. Before I retired, this was considered a small database. 100 million and up was getting large, but not very.
 
I don't know about 100,000 rows but there's a test database on the MySQL website here that is described as 'large'. See under the 'Example databases' heading.
 
  • Like
Likes   Reactions: WWGD
PAllen said:
Hmm. Before I retired, this was considered a small database. 100 million and up was getting large, but not very.
I guess the terms small and large depend on the machine available. With 100,000 I thought of an Oracle database to handle best, but I wouldn't like to handle 100,000,000 other than on a DB2. However, I think that even 100,000 are a big deal for MySQL. How about some csv data taken from anywhere or a continuous feed (rss I think) from a weather page and upload them piece by piece? The entire amount at once might be a disappointing experience.
 
  • Like
Likes   Reactions: WWGD
andrewkirk said:
I don't know about 100,000 rows but there's a test database on the MySQL website here that is described as 'large'. See under the 'Example databases' heading.
If you look down into the doc for that dB described as large, it is, in fact, 4 million rows spread across 6 tables.
 
Last edited:
  • Like
Likes   Reactions: WWGD
fresh_42 said:
I guess the terms small and large depend on the machine available. With 100,000 I thought of an Oracle database to handle best, but I wouldn't like to handle 100,000,000 other than on a DB2. However, I think that even 100,000 are a big deal for MySQL. How about some csv data taken from anywhere or a continuous feed (rss I think) from a weather page and upload them piece by piece? The entire amount at once might be a disappointing experience.
I see, so I think then I may not even need a scraper, but even Excel's import/export Wizard would do, it seems.
 
Oracle is the database to know if you are set on making SQL your career path. Microsoft SQL is "Oracle-Lite" at best. IBM's DB/2 is also a decent RDBMS, just not as popular as Oracle or MS-SQL. All three RDBMS' have pretty much unlimited number of records (MS-SQL limits you to 16 terrabytes). Oracle allows up to 1,000 columns per record. MS-SQL limits you to 8,060 bytes per record. Oracle allows for recursive scripts, MS-SQL does not. Microsoft also has their own version of SQL, they no longer use the ANSI Standard for SQL (particularly with regard to their joins), but Oracle does.

You can get the Oracle Database 11g Express Edition for free for training and development purposes. Microsoft offers the same thing for MS-SQL Server Express. If you are really interested, you can also get a free copy of IBM's Db2 Express-C.

Also, check out Oracle's, Microsoft's, and IBM's developer websites. They will often include free tools for developers.
 
  • Like
Likes   Reactions: WWGD and fresh_42
You could also just install your own Linux VM, apt-get or yum MySQL-server and insert whatever you want. Delete the vm was hen you are done.

Btw, 100,000 rows is a medium size database at best. I have over a billion entries in some of mine. The most important thing to understand is indexing and hashing when dealing with things that big. Different types of indexes are optimized for different types of data and queries.
 
  • Like
Likes   Reactions: WWGD

Similar threads

Replies
5
Views
3K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 9 ·
Replies
9
Views
7K
  • · Replies 5 ·
Replies
5
Views
3K
Replies
3
Views
3K
Replies
12
Views
3K
  • · Replies 19 ·
Replies
19
Views
4K
  • · Replies 1 ·
Replies
1
Views
4K