Data Science and Systems Integration Blog Post

Click For Summary
SUMMARY

The discussion centers on the importance of a bottom-up approach in systems design for data science, as advocated by a participant with extensive experience in data handling. The participant emphasizes the challenges of maintaining a fresh perspective on data problems due to their expertise, which can lead to assumptions about others' knowledge. They also touch on the potential pitfalls of a top-down design approach, particularly in hypothesis testing, and the need for balanced documentation to avoid user frustration.

PREREQUISITES
  • Data cleaning techniques
  • Data modeling principles
  • Understanding of hypothesis testing
  • Documentation best practices in data science
NEXT STEPS
  • Explore data cleaning tools such as OpenRefine
  • Learn about data modeling frameworks like CRISP-DM
  • Investigate the implications of top-down vs. bottom-up design in data systems
  • Study effective documentation strategies for data science projects
USEFUL FOR

Data scientists, systems designers, and anyone involved in data-driven decision-making who seeks to improve their understanding of system design approaches and documentation practices.

Ackbach
Gold Member
MHB
Messages
4,148
Reaction score
94
Really excited about this blog post by yours truly. Wineman Technology is going places!
 
Technology news on Phys.org
Interesting article!

I work with large amounts of data every day and would consider myself an expert in the field when considering the whole continuum of "Can't use a computer" to "Coding at Google". I grab data, clean it, join it, model it, and present it.

That is important to your point, because what I lose in my years of training is the ability to see data and a problem with new eyes. Once you know something, you can try to empathize with the mind of someone who doesn't know it but it is quite difficult. Even worse, you can be very unaware of the facts that you internally assume everyone must know yet they don't.

So I fully agree with you that systems benefit from building bottom-up, not top-down. Some guidance is helpful but too much will leave the end user clueless and frustrated.
 
Jameson said:
Interesting article!

I work with large amounts of data every day and would consider myself an expert in the field when considering the whole continuum of "Can't use a computer" to "Coding at Google". I grab data, clean it, join it, model it, and present it.

That is important to your point, because what I lose in my years of training is the ability to see data and a problem with new eyes. Once you know something, you can try to empathize with the mind of someone who doesn't know it but it is quite difficult.

Yeah, yeah. Total nerd. I get it. ;)

Jameson said:
Even worse, you can be very unaware of the facts that you internally assume everyone must know yet they don't.

So I fully agree with you that systems benefit from building bottom-up, not top-down.

I'm a hair puzzled. I thought I was advocating for a top-down design approach, which is simply what a typical statistics textbook will tell you. If you have an hypothesis at hand (big question, in the parlance of my blog post), then you can design the system that generates the data that, once analyzed, answers the hypothesis or question. Isn't that a top-down design approach? It is considered "cheating", is it not, to use the same data to generate an hypothesis as well as to test it?

Jameson said:
Some guidance is helpful but too much will leave the end user clueless and frustrated.

I'm very curious what you mean here. Are you talking about achieving that difficult-to-achieve balance in documentation between conciseness and adequacy?

Cheers!

By the way, I LOVE THE QUOTE-BREAKER!
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
1K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 8 ·
Replies
8
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
10
Views
5K
Replies
5
Views
2K
Replies
4
Views
3K
  • · Replies 29 ·
Replies
29
Views
5K