|
|
db81015e0b
|
added query.sh to read the cc-index from s3 parquet files and dump it into our psql db
|
2026-05-17 19:12:25 -04:00 |
|
|
|
65d2757527
|
allow ec2 to access common crawl s3
|
2026-05-17 18:22:41 -04:00 |
|
|
|
fcf203e1d8
|
added infra setup with terraform
|
2026-05-17 16:07:50 -04:00 |
|
|
|
64ae58494b
|
added timestamps, warc parser library, log files, progress bars, and testing the frontend with real data to the PLAN.md
|
2026-05-17 14:16:56 -04:00 |
|
|
|
c50be97fd7
|
added PLAN.md with initial dev plan
|
2026-05-17 14:00:14 -04:00 |
|
|
|
a327fb3db3
|
fixed diagram and last tweaks before we plan and code
|
2026-05-17 13:10:37 -04:00 |
|
|
|
01b5de040c
|
updated architecture, downloading all icons, millisecond seeds, mermaid diagram, and partial index
|
2026-05-17 13:03:37 -04:00 |
|
|
|
cf6d819f1f
|
initial ARCHITECTURE.md document
|
2026-05-17 12:19:06 -04:00 |
|
|
|
8ef465b2a4
|
don't use zdns, just use a local unbound to make things easier
|
2026-05-17 12:18:47 -04:00 |
|
|
|
f6ec08535f
|
initial commit
|
2026-05-15 17:32:55 -04:00 |
|