How to write a program to retrieve Web data

  • Thread starter Thread starter Saladsamurai
  • Start date Start date
  • Tags Tags
    Data Program Web
AI Thread Summary
Developing a tool to extract job status data from a poorly designed website is feasible with basic programming skills, particularly using Python. Python offers libraries like Beautiful Soup and Minidom for parsing HTML content, allowing users to navigate the Document Object Model (DOM) of the webpage to extract necessary information. While the initial setup may be straightforward, users should be aware that changes to the webpage's structure could require updates to the code. Overall, Python is considered an accessible and effective choice for web scraping tasks.
Saladsamurai
Messages
3,009
Reaction score
7
Hello! :smile:

So I have some basic programming skills, but I have never done anything that interacts with the web. Here at work, we have a website that we have to go to in order to check the statuses of all of the jobs we have open. The website is awful in that you cannot run a report on all of the jobs at once. I want to develop a tool that goes to the website, loops through all of the jobs, and pulls the necessary data.

I just need a starting point for now. Is this something I can do with Python? Any thoughts are helpful.

Thanks!
 
Technology news on Phys.org
Basically every programming language has some tool to read HTML content of web pages. You can pick your favorite one.
It looks easy with python.
 
I've used Beautiful Soup, and Minidom (both Python libraries) to do this. What you are really doing is parsing the DOM of the webpage you want. You look for the information located in some div by traversing the XML structure, and extract it. If the webpage changes structure, you might have to recode your solution, but it's pretty easy programming.
 
I agree python is a simple way to do it. I wrote similar parser without using any additional libraries but those that installed automatically (and with some ancient 2.x python version). But I don't have access to the code ATM so I can't tell you details.
 
Ok thanks guys! Parsing and python... The two P's... I'll tell my boss I'll be PP'ing all next week!
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
I have a quick questions. I am going through a book on C programming on my own. Afterwards, I plan to go through something call data structures and algorithms on my own also in C. I also need to learn C++, Matlab and for personal interest Haskell. For the two topic of data structures and algorithms, I understand there are standard ones across all programming languages. After learning it through C, what would be the biggest issue when trying to implement the same data...

Similar threads

Back
Top