Searching inside .zip files on internet


by Vespero
Tags: files, inside, internet, searching
Vespero
Vespero is offline
#1
Jan23-14, 01:06 AM
P: 28
Is anyone aware of a way to search through .zip files on the internet (such as in an archive site) without having to download and extract the files? For example, if I have a search phrase that may be in a .doc file inside a .zip file which is potentially stored with many other .zip files, I don't want to have to download them all and have to manually search through them, but would like to be able to at least find the correct .zip to download first.

Many thanks.
Phys.Org News Partner Science news on Phys.org
NASA's space station Robonaut finally getting legs
Free the seed: OSSI nurtures growing plants without patent barriers
Going nuts? Turkey looks to pistachios to heat new eco-city
SixNein
SixNein is offline
#2
Jan23-14, 05:00 PM
PF Gold
SixNein's Avatar
P: 183
Quote Quote by Vespero View Post
Is anyone aware of a way to search through .zip files on the internet (such as in an archive site) without having to download and extract the files? For example, if I have a search phrase that may be in a .doc file inside a .zip file which is potentially stored with many other .zip files, I don't want to have to download them all and have to manually search through them, but would like to be able to at least find the correct .zip to download first.

Many thanks.
If your using HTTP version 1.1 then yes because you can use ranges.

http://en.wikipedia.org/wiki/Byte_serving
.Scott
.Scott is offline
#3
Jan23-14, 05:33 PM
P: 420
Quote Quote by SixNein View Post
If your using HTTP version 1.1 then yes because you can use ranges.

http://en.wikipedia.org/wiki/Byte_serving
I think he is hoping to use a search engine - and I don't think any of them do what he wants.

If he's writing the search code himself, your byte range thing would be useful if he could eliminate many of the files based on their filename. But whenever a zip file contained only files like *.txt, *.docx, he would still need to read the whole zip file.

harborsparrow
harborsparrow is offline
#4
Feb2-14, 10:51 AM
harborsparrow's Avatar
P: 322

Searching inside .zip files on internet


I can't think of any way for a search bot to look inside any file out on a web server without first retrieving the file to local disk. It seems irrelevant whether the file is compressed (zipped) or not. If you are writing code, you can certainly get a library to open the zip so it can be searched as plain text or whatever format you expect.

Unless perhaps you were able to inject pernicious code onto the web server itself so that the code runs THERE, but that would be only if you are permitted to add code to a site. It would not apply to most sites. If your code is looking at files on the web, they must first be fetched to your local disk by the HTTP client. Period.


Register to reply

Related Discussions
Inside tech. sales rep. searching for a focused career Career Guidance 2
ieuser.exe missing in C:Program Files\Internet Explorer - is my IE compromised? Computers 6
getting variable names from .mat files when they are loaded inside a function Math & Science Software 0
download very large files from the Internet Computers 2
can u share files and printers on a router as well as a internet connection? Computing & Technology 9