How to download an archive of your website using Wayback Machine and Python
Today I just discovered that Wayback Machine has created an API that allows you to download an archive of your website to your local computer. I gave it a try this morning and it works well. This tutorial is for Mac OS X developers but anyone with Python installed on their computer can follow along.
Step-by-step instructions
Confirm that you have python on your computer by typing this into terminal:
python --version
Then you need to install python’s package manager called pip:
sudo easy_install pip
Then you need to install waybackpacker from BuzzFeed's data scientist Jeremy Singer-Vine
pip install waybackpack
Then you need to create a folder where you want to download the files. Since I'm on a Mac, I'm choosing to first change directory to my /home/Desktop
.
cd ~/Desktop
Then create a new folder on your desktop
mkdir my_specific_folder
Then you need to fire off the waybackpack tool. In my case, I'm stopping the downloading in the year 2011.
waybackpack kusc.org -d my_specific_folder -end 2011