It a non-profit that crawls the web basically every month or two and saves the data. They have petabytes of information, all stored in .arc files.