Organization:Internet ArchiveThe Internet Archive discovers and captures web pages through many different web crawls. At any given time several distinct crawls are running, some for months, and some every day or longer. View the web archive through theWayback Machine
Collection: Wide Crawl Number 17: Started August 3rd, 2018 – Still running
Wide17 was seeded with the « Total Domains » list of 256,796,456 URLs provided by Domains Index on June 26th, and crawled with max-hops set to « 3 » and de-duplication set « on ».