everytab/pipeline
2026-05-17 22:34:27 -04:00
..
01_cc_index added query.sh to read the cc-index from s3 parquet files and dump it into our psql db 2026-05-17 19:12:25 -04:00
02_warc_parse added warc parser 2026-05-17 20:25:59 -04:00
03_icon_download added icon downloader 2026-05-17 22:09:03 -04:00
04_best_icon don't use svg icons, they aren't supported in coversion 2026-05-17 22:34:27 -04:00