How are web archives created? Technical aspects of web content capture
Web archives are collections produced by libraries and other heritage institutions to permanently preserve online heritage. They often contain large amounts of material stored from the web through the use of web crawlers. From a usage perspective, they are often unpredictable, non-transparent and inconsistent data sources that contain numerous content gaps.