Tool to read contents of a web page

  • Thread starter Thread starter sganesh88
  • Start date Start date
  • Tags Tags
    Web
AI Thread Summary
A tool is needed to automate the retrieval of specific data from a website displaying server memory usage statistics in table format. Currently, this data is collected manually every 30 minutes. Suggested methods for automation include using programming languages like Perl, JavaScript, or Python, which can parse HTML to extract the required values. For simpler tasks, tools like grep, sed, or awk may suffice if the HTML layout is straightforward. On Windows, VisualBasic or VBA macros can be utilized, while Linux/Unix users can employ "wget" to download the HTML page. Additionally, PHP can be used to read the HTML contents line-by-line to find keywords. Overall, various programming approaches can effectively automate the data retrieval process.
sganesh88
Messages
286
Reaction score
0
I need a tool which can read some specific data from a website.
Based on the data retrived i will make some calculations.
Is there a tool that can read a HTML page and retrive some values displayed in it?
The data are shown in table format in that website.
This website is probe which displays some memory usage of a server.
We take some statistics from this page and make calculations like taking average etc.
This task is now being done manually for evry 30 minutes.
We need to automate it.
 
Computer science news on Phys.org
I'm not familiar offhand with any programs that are specifically made to do this. Usually Perl or Javascript get put into service for this kind of thing, especially since the desired data is well hidden in the actual page formatting tags. You might search for HTML parsers to see if there are any helper functions you could use.
 
You can parse the html page using something like python. If it's a reasonably straightforward layout then you can use even simpler tools like grep, sed or awk.

Just have a look at the html page in a text editor, see if the location of the table is easy to locate (near its title for instance), then write a small script to retrieve the required values.

If you're on windows then I suppose you could use VisualBasic or even a VBA macro in Office, but it's easier in linux/unix.
 
You can use "wget" to get the html page.
wget http://thepage.com
After that, follow the usualname's sugestions.
 
There is a PHP function which can read the contents of a pages' HTML and put each line into an array. You could read the HTML line-by-line searching for keywords. It could all be done in PHP and use a refresh script to perform it at each interval.

There are other options as well.
 
I came across a video regarding the use of AI/ML to work through complex datasets to determine complicated protein structures. It is a promising and beneficial use of AI/ML. AlphaFold - The Most Useful Thing AI Has Ever Done https://www.ebi.ac.uk/training/online/courses/alphafold/an-introductory-guide-to-its-strengths-and-limitations/what-is-alphafold/ https://en.wikipedia.org/wiki/AlphaFold https://deepmind.google/about/ Edit/update: The AlphaFold article in Nature John Jumper...
Thread 'Urgent: Physically repair - or bypass - power button on Asus laptop'
Asus Vivobook S14 flip. The power button is wrecked. Unable to turn it on AT ALL. We can get into how and why it got wrecked later, but suffice to say a kitchen knife was involved: These buttons do want to NOT come off, not like other lappies, where they can snap in and out. And they sure don't go back on. So, in the absence of a longer-term solution that might involve a replacement, is there any way I can activate the power button, like with a paperclip or wire or something? It looks...

Similar threads

Replies
22
Views
2K
Replies
9
Views
2K
Replies
15
Views
3K
Replies
5
Views
2K
Replies
10
Views
10K
Replies
15
Views
3K
  • Sticky
Replies
0
Views
4K
Back
Top