I’ve found a way to make myself useful. I’m adding pages from the EPA’s website to the Internet Archive. As such, I’ve found perhaps hundreds of pages which are not archived yet. This bookmarklet is easier to use than the official one. Why? It does it within the page w/o opening a new one.
What to Do
* Install the bookmarklet above to your browser.
* Go to a page of links (like the first one in the following section) and then CTRL + left click on all the links.
* Then go through each page clicking the bookmarklet.
* When you’re done, work your way back across each open tab by clicking the back button. Then scroll down and look for additional links + PDFs.
* Close the tabs as you work your way back across them.
* Sometimes it’ll time out so you need to hit the back button, then try the bookmarklet again.
Things to Know
* If it’s a PDF, it usually downloads to your computer. Annoying. Right now I don’t know how to get those in the Internet Archives, but hold onto them.
* If the PDF is hosted online, you can click the bookmarklet to add it to the Internet Archive.
* If the website/page doesn’t allow robot.txt, you can’t add it to the Archive.
* If you notice that you’re working through pages which have been recently archived, go find another set to go through. It’s a better use of your time to find pages which have never been archived before. These random reports haven’t gotten any IA love before.
If you’re interested in strategically going through the EPA site with me, let me know. We can make a plan of action to go through and get the pages in.