Is there any way to web-scrape a website that's down?

Eclair_de_XII · Aug 28, 2021

I tried Google-searching the site, and found several archive sites. Each archive site has archived the main site and page directory, yes. But every single archive site has seemed to fail to capture the pages on the tkinter objects. I confess that I had taken the site for granted. I'm aware of other tkinter documentation sites on the internet, and I am also aware that other GUI modules exist, like Flask; one user on here mentioned it to me once. All the same, I found effbot the most valuable for tkinter documentation.

jedishrfu · Aug 28, 2021

Short answer no. If you can’t see it how can you scrape it.

There is another way though. Try the internet archive wayback machine. They may have taken a snapshot of the site.

HTTPS://web.archive.org

Eclair_de_XII · Aug 29, 2021

https://web.archive.org/web/20200801000000*/effbot.org

I've found plenty of archives of the site, but the ones I have checked do not seem to have the instruction pages available. Frankly, it would be a bit hasslesome to check every single one; I'm considering using a web-scraping script to search for a working link. As mentioned earlier, the web archive seems to have the page directories but not the pages themselves. For example:

https://web.archive.org/web/20200703091947/http://effbot.org/tkinterbook

Tom.G · Sep 1, 2021

https://cs.gmu.edu/~dfleck/classes/cs112/spring09/slides/tkinter.pdf

https://www.oreilly.com/library/view/python-gui-programming/9781788835886/

And a whole bunch more found with:
https://www.google.com/search?&q=tkinterbook

pbuk · Sep 1, 2021

Or the Python manual at https://docs.python.org/3/library/tk.html.

Flask is not a gui library.

Eclair_de_XII · Sep 8, 2021

Tom.G said:

https://www.google.com/search?&q=tkinterbook

Huh, I actually found a working mirror of the site via this simple search that I wish I had done. Thanks.

Code:

https://www.reddit.com/r/Tkinter/comments/ozdyd8/effbotorgtkinterbook_mirror_recovered_from/

https://dafarry.github.io/tkinterbook/

jedishrfu · Sep 8, 2021

Is that a mirror or a Google cache of the site (aka snapshot)?

Eclair_de_XII · Sep 8, 2021

I don't really know; you'd have to ask the person who recovered the site.

https://github.com/dafarry

pbuk · Sep 8, 2021

jedishrfu said:

Is that a mirror or a Google cache of the site (aka snapshot)?

According to the message on the site it is a scrape from Wayback Machine.

Is there any way to web-scrape a website that's down?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

How to increase phone signal strength by lying about it

Who is responsible for the software when AI takes over programming?

Use of AI (ML/DL) in Science

Could the reason why I can't select any kernels in VS Code be this error?

How useful is this if I want to begin programming?

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight