everytab/pipeline/02_warc_parse
2026-05-20 10:18:15 -04:00
..
db.go update warc parsing with new 3 stage producer, worker, consumer model, increasing speed and saturating cores 2026-05-20 10:18:15 -04:00
log.go improve stats generation 2026-05-20 00:31:38 -04:00
main.go update warc parsing with new 3 stage producer, worker, consumer model, increasing speed and saturating cores 2026-05-20 10:18:15 -04:00
parser.go cap number of favicons to 50 per host 2026-05-20 00:53:24 -04:00
process.go added warc parser 2026-05-17 20:25:59 -04:00
warc.go bump up s3 warc retries to 6 to avoid 503 errors 2026-05-20 01:30:46 -04:00