Projects involving data science

In summary, the conversation discusses the speaker's struggle to find project ideas for data science and machine learning using Python. They mention using a simple dataset from WHO and being limited to data manipulation and exploratory data analysis. They also express the need for projects to add to their CV as they lack experience in the field. Suggestions are given for projects involving acoustics and image character recognition. The conversation then shifts to what employers are looking for in personal projects, with a focus on the ability to draw conclusions from data. The speaker is advised to develop a program or method for automatically identifying trends and reporting on their significance.
  • #1
EngWiPy
1,368
61
Hello,

I am trying to do some projects on data science/machine learning using Python, but I am not sure what to do. I downloaded a very simple dataset from WHO, and I am trying to do something with it, but most of (actually all) what I can do with it is data manipulation and exploratory data analysis (histograms, scatter plots, ... etc). I need these projects for my CV since I don't have previous experience in the field. Any suggestion will be highly appreciated.

Thanks
 
Technology news on Phys.org
  • #2
Do something with acoustics like voice recognition, or animal sounds recognition. There are a lot of datasets for acoustics that could be used.

Image character recognition is a common project. Here's a video on how neural-nets work that uses the character recognition.

 
  • Like
Likes EngWiPy
  • #3
jedishrfu said:
Do something with acoustics like voice recognition, or animal sounds recognition. There are a lot of datasets for acoustics that could be used.

Image character recognition is a common project. Here's a video on how neural-nets work that uses the character recognition.



Thanks. I am still novice in the field, and it seems a little complicated to do these things now. Neural networks/deep learning is a topic on its own. This leads me to the following question: What are employers looking for in the personal projects? What do I need to demonstrate as skills? How complex my project should be? Thanks in advance
 
  • #4
The ultimate goal of a field like data science is to draw conclusion from data. Ideally you will be able to make some conclusion or recommendation from the data that you have.

As an example. Let's say the data-set is related to insurance, age, current estimated risk, number of accidents, cost, type of vehicle, etc...

An employer would look that you can take a block of data and draw a conclusion from it. It is not enough to say here is the difference in number of accidents between age 85 and 75 drivers. You need to say "The price of insurance for 75 year old needs to go up due to the fact that the profit after paying out for accidents is not high enough." You need to develop a program/method that will essentially fill in that data for you. You need to determine a way to evaluate through coding if an age group or car group is worth it. A good way to start would be just looking at histograms and scatter plots, then start thinking of ways to automatically identify those trends.

TLDR: with your dataset, develop ways to identify trends, then think about what those trends mean, and report on it.
 
  • Like
Likes EngWiPy

1. What is data science?

Data science is a multidisciplinary field that involves using scientific methods, processes, algorithms, and systems to extract knowledge and insights from data in various forms, both structured and unstructured.

2. What are the key skills required for data science projects?

The key skills required for data science projects include programming, statistics, data visualization, machine learning, and domain expertise. Other important skills include critical thinking, problem-solving, and communication.

3. What are the steps involved in a data science project?

The steps involved in a data science project include defining the problem, collecting and cleaning the data, exploring and visualizing the data, building and evaluating models, and deploying the solution. It also involves constantly iterating and refining the process.

4. What are some common challenges faced in data science projects?

Some common challenges faced in data science projects include data quality issues, lack of domain expertise, overfitting of models, and ethical concerns related to data privacy and bias. Other challenges may include lack of resources, time constraints, and difficulty in interpreting and communicating the results.

5. How can data science projects benefit various industries?

Data science projects can benefit various industries by providing valuable insights and predictions, improving decision-making processes, automating tasks, identifying patterns and trends, and optimizing processes and operations. It can also help in developing new products and services, and improving customer experience and engagement.

Similar threads

  • Programming and Computer Science
Replies
1
Views
843
  • Programming and Computer Science
Replies
3
Views
1K
  • Programming and Computer Science
Replies
1
Views
1K
  • Programming and Computer Science
Replies
6
Views
3K
  • Programming and Computer Science
Replies
3
Views
1K
  • Programming and Computer Science
Replies
4
Views
2K
Replies
6
Views
658
  • Programming and Computer Science
Replies
11
Views
1K
  • Programming and Computer Science
Replies
1
Views
1K
  • Programming and Computer Science
Replies
5
Views
3K
Back
Top