WARC, short for Web ARChive, is a data storage format for storing archived web pages. It is an extension of the ARC format, traditionally used by web crawlers to store data from web pages. This project is an effort to create a standalone library for writing WARC files from captured web resources and to integrate it with the current Wget2 codebase.

Organization

Student

Suhas K S

Mentors

  • darnir
close

2020