Next.js App Router + React Server Components Demo

new
past
show
ask
show
jobs
submit

▲MetaGenesis Core – offline verification for computational claims (metagenesis-core.dev)

11 points by Lama9901 3 days ago | 8 comments

rubyrfranklin2 50 minutes ago [-]

Real-time speech translation is something I think about constantly running heyvid.ai — we're always chasing that latency vs. quality tradeoff for multilingual video. JEPA's approach is interesting because it sidesteps the typical encode-decode bottleneck that kills most real-time pipelines. I'd be curious how it holds up on accented or fast speech. Back at Adobe I saw how even 200ms of lag completely destroyed the perceived quality of live demos. The latency budget for translation is so much tighter than transcription-only, so any architectural win like this is worth watching closely.

Lama9901 1 days ago [-]

Author update: spent the day doing a final pass before asking HN to re-up the post.

What changed since the original submission: - 8 active claims (added DT-FEM-01 — FEM/digital twin verification) - 107 tests passing, steward_audit PASS - Every link on the site now points to the actual file in the repo - system_manifest.json synced, all docs consistent

Still solo, still transparent about limitations (reports/known_faults.yaml). Happy to answer any questions about the protocol design.

itsthecourier 2 hours ago [-]

"A hash-only check still passes. MetaGenesis Core adds a second layer: - integrity layer → PASS - semantic layer → FAIL (job_snapshot missing)"

may you please elaborate on this?

Lama9901 2 hours ago [-]

Sure. The semantic layer is a second verification pass that runs independently of file integrity. Here's why SHA-256 alone isn't enough. An adversary can:

Remove job_snapshot from the artifact (stripping the core evidence of what actually ran) Recompute all SHA-256 hashes to match the modified files Rebuild the manifest

A hash-only verifier sees everything consistent and returns PASS. The attack succeeds silently. The semantic layer catches this. After the integrity check passes, it independently verifies:

job_snapshot is present (evidence of the actual computation, not just file hashes) payload.kind matches the registered claim type (can't swap one claim for another) canary_mode flag is consistent (dual-mode execution provenance intact)

If job_snapshot was stripped, the semantic check returns FAIL: job_snapshot missing — even if every SHA-256 is valid. This specific attack is an adversarial test in the public repo: tests/steward/test_cert02_pack_includes_evidence_and_semantic_verify.py

The deeper point — which I didn't explain in the original post: In physics and engineering domains, the semantic layer connects to something stronger than an internal threshold. Young's modulus for aluminium is ~70 GPa. That's not a value I chose — it's been measured independently in thousands of labs worldwide. When MTR-1 runs, it verifies the computation against that physical constant (rel_err ≤ 1%). The chain extends to FEM verification (DT-FEM-01, rel_err ≤ 2%) and drift monitoring (DRIFT-01). The difference: tamper-evident provenance answers "was the bundle modified?" — the physical anchor answers "does the number agree with physical reality?" These are different questions. Both matter, but the second is harder to fake because the ground truth is external to the system. This doesn't apply to ML accuracy or data pipelines — there the value is purely tamper-evident provenance, not physical grounding. The protocol is honest about that distinction in reports/known_faults.yaml.

measurablefunc 1 hours ago [-]

This is another "art" project. Nice work OP.

Lama9901 55 minutes ago [-]

What would change your mind? Genuine question.

The adversarial test is public and runnable in 5 minutes:

  git clone https://github.com/Lama999901/metagenesis-core-public
  python demos/open_data_demo_01/run_demo.py

If output isn't PASS/PASS on your machine, I want to know. If the protocol design is flawed, I want to know where specifically.

Known limitations are machine-readable: reports/known_faults.yaml

Charon77 9 minutes ago [-]

First of all, I don't want to run anyone's code without proper explanation, so help me understand this. Let's start with the verifier. The 3rd party verifier receives a bundle, not knowing what the content is, not having access to the tool used to measure, and just run a single command based on the bundle which presumably contains expected results and actual measurements, both of which can easily be tampered. What good does that solve?

Lama9901 3 minutes ago [-]

Good question — this is exactly the right thing to challenge.

You're right that a bundle alone proves nothing if the creator controls both sides. The tamper-evidence comes from two things you can verify independently:

1. The governance layer. The bundle isn't hand-crafted — it's produced by a CI-enforced pipeline (steward_audit.py, sealed in CI, locked via policy gate). The pipeline is public. You can read exactly what it does before running anything: scripts/steward_audit.py, scripts/mg.py.

2. The physical anchor. For materials and FEM claims — the "expected result" isn't inside the bundle at all. Young's modulus for aluminium (~70 GPa) is a physical constant measured independently in thousands of labs. It's not a value I chose. The verifier checks the result against external physical reality, not against a number I provided.

So the question isn't "did he put the right number in the bundle" — it's "does this computation agree with what physics says aluminium does?"

That's what the physical anchor solves. For ML and data pipelines — you're right, it's provenance only, not physical grounding. Documented honestly in reports/known_faults.yaml :: SCOPE_001.

3 days ago [-]

Rendered at 01:54:13 GMT+0000 (Coordinated Universal Time) with Vercel.