Sealed Identity Architecture — Decision Brief

Status: Approved direction. Implementation in two stages (§4); legal documentation runs in parallel (§6), not as a gate. Owner: TBD (privacy lead). Companion plan: docs/superpowers/plans/2026-05-10-sealed-identity-spike.md.

Posture: Architectural privacy protections are not subject to legal-review veto. The premise of this work is that if we cannot produce a piece of data due to how the system is built, we cannot be compelled to rebuild the system to produce it. That position is well-supported in US law (All Writs Act limits, Apple/FBI 2016) and is consistent with Lantern's Immutable Right #6 (cofounder agreement). Counsel's role here is to make sure our privacy policy, ToS, and subpoena-response playbook accurately describe the architecture — not to grant or withhold permission to ship it.

1. Summary

Lantern's privacy commitments (Immutable Right #6, Business Plan §3.2) are met today by client-side profile encryption and k-anonymity gating, but the phone-number → account-record link is server-resolvable in plaintext. A subpoena of the form "what account has phone +1-555-…?" returns a userId directly. This document proposes a two-stage hardening of that link, evaluates legal/operational/engineering cost, and lists trigger criteria for committing to it.

The brief is intentionally narrow: it covers the auth-table identity link only. Profile data sealing (PBKDF2 + AES-GCM, apps/web/src/lib/encryption.js) is already in place and out of scope.

2. v1 reality (what the code actually does today)

Earlier drafts of this brief described a v1 with HMAC-hashed phones, a login_events table, and a banned_phone_hashes index. None of those exist. This section is the authoritative description.

2.1 Storage

Phone-PIN users live in the Firestore users/{userId} collection. Relevant fields:

Field	Contents	Sensitivity
`phone`	E.164-normalized phone number, plaintext	High — direct PII
`phoneSalt`	Per-user salt for client-side wrapping-key derivation	Public-by-design
`encryptedSeed`	BIP39 entropy AES-GCM-wrapped with a phone+PIN-derived key	Sealed (we cannot decrypt)
`authProofHash`	`HMAC-SHA256(entropy, "lantern-auth-proof-v1")`	Hash; verifier only
`pinFailedAttempts`, `pinLockoutUntil`	Server-enforced lockout state	Operational
`encryptedBirthDate`, `encryptionCanary`, `salt`	Profile encryption	Sealed
`lanternName`, `authMethod`, `lastLoginAt`	Display + audit	Low–medium

There is no auth_table distinct from users/{userId}. The phone→userId link is just users.where('phone', '==', normalized).limit(5) (services/api/auth/src/routes/phone.js:42).

Client POST /auth/phone/lookup with { phone }. Server queries users by plaintext phone, returns { userId, phoneSalt, encryptedSeed, lanternName, authMethod }.
Client derives wrapping key from phone+PIN, decrypts encryptedSeed → entropy (16 bytes).
Client computes proofHmac = HMAC-SHA256(entropy, "lantern-auth-proof-v1") and POST /auth/phone/token with { userId, proofHmac }.
Server compares against stored authProofHash with timingSafeEqual (customToken.service.js:46–55). On match, issues a Firebase custom token; on mismatch, increments pinFailedAttempts; locks out at 5 failures for 15 minutes.

What this already gets us: server cannot recover the PIN; encryptedSeed is not decryptable server-side; profile fields are sealed.

What this does not get us: step 1 returns userId indexed by plaintext phone. A compelled-disclosure request providing a phone number gets back the userId and the entire users/{userId} document (minus the sealed fields).

There is no central login_events collection. PIN attempt counters live on users/{userId} itself. Admin and merchant logins write to adminActions (no IP) (adminAuth.js:81–84). User phone+PIN logins update lastLoginAt only. Cloud Run access logs (timestamps, IPs, request paths) exist outside Firestore at the platform layer.

2.4 Ban enforcement

User-level bans are implemented at the userId level only (moderation.js:22–66) — sets users/{userId}.banned, disables Firebase Auth account. No phone-hash ban table exists. The banned_accounts collection sketched in docs/features/safety/SAFETY_MECHANICS.md (bcrypt-hashed phone + email) is designed but unbuilt.

3. The actual gap, in subpoena terms

Question	v1 response
"Does an account exist for this phone?"	yes/no
"What's the userId for this phone?"	returns `userId` directly
"What did this user do?"	full `users/{userId}` doc + any `userId`-indexed records (frens, waves, lit-lantern events)
"Decrypt their profile"	cannot — sealed
"Decrypt their seed / PIN"	cannot — sealed

The first three rows are the meaningful disclosure surface. Sealing the third (everything keyed by userId) is hard — it's the operational data of the app. Sealing the first two is what this proposal addresses.

4. Proposal — two stages, not one

The original brief jumped straight to encrypting the link. That skips the bigger and cheaper win.

Stage A — Hash the phone (no behavior change for users)

Replace plaintext users/{userId}.phone with phoneHash = HMAC-SHA256(KMS_pepper, e164(phone)). Lookup becomes users.where('phoneHash', '==', clientOrServerComputedHash).limit(5). KMS pepper rotation strategy TBD in spike.

This alone:

Removes plaintext PII from the dominant subpoena entry path
Forces a compelled-disclosure request to either provide the pepper-hashed value (which they cannot, without our KMS access) or compel us to compute it (legally distinguishable from "produce the row")
Unblocks the banned_accounts design in SAFETY_MECHANICS.md (same hash form)
Doesn't touch the proof-of-entropy chain

This is roughly the v1 the original brief thought we already had.

Stage B — Seal the userId resolution (the original proposal, restated)

After Stage A, users/{phoneHash → userId} is still server-resolvable: phone-with-pepper → row → userId. Stage B encrypts the userId itself with a passphrase-derived key:

auth_lookup:  (phoneHash, encryptedUserIdBlob, phoneSalt, encryptedSeed, authProofHash)
              encryptedUserIdBlob = AES-256-GCM(userId, key = HKDF(entropy))

The login flow gains one step:

POST /auth/phone/lookup returns { phoneSalt, encryptedSeed, encryptedUserIdBlob, authProofHash } — no userId.
Client decrypts encryptedSeed with PIN → derives entropy → derives blob key via HKDF → decrypts encryptedUserIdBlob → has userId.
Client sends { userId, proofHmac } to /auth/phone/token as today.

Server never observes phoneHash → userId resolution as a single readable step. Compelled disclosure of the auth_lookup row yields a hashed phone and an opaque blob.

Trade-offs vs. v1:

Capability	Stage A	Stage B
Subpoena "userId for phone X"	hash + row, still resolvable	ciphertext only
CS lookup by phone	unchanged (compute hash, query row)	requires user-initiated session sharing
Anti-fraud on phone-reuse	unchanged	reduced; phone-hash visible but cannot link to userId activity
`banned_accounts` enforcement	enables it	unaffected (independent index)
Login UX	unchanged	one extra round-trip's worth of decryption (sub-50ms client-side)
Engineering scope	medium (~1 sprint)	high (multi-sprint, plus migration)

5. Trigger criteria

Stage A: should land before Phase 5 (soft launch) per INITIAL_LAUNCH.md. Plaintext phones in production are difficult to walk back once we have real users.

Stage B: should also land before Phase 5 (soft launch) if engineering capacity allows. The case for sealing pre-launch:

Sealing the architecture before the first subpoena lands looks like a privacy commitment; sealing it after looks like obstruction.
Migrating existing users is harder than building it for new users — every month of pre-launch growth raises the migration cost.
Public privacy claims (privacy policy, marketing) made under a v1 architecture become legal liabilities if we then change the architecture and the claims drift.

If engineering capacity forces a deferral, Stage B can ship in a Phase 4–5 patch window or, at the latest, Phase 6.

Market-entry triggers that would force Stage B regardless of timing:

Move into a jurisdiction with compelled-redesign powers (UK IPA, Australia TOLA). Those jurisdictions can order us to log decrypted data going forward — the only architectural defense is to ship Stage B before market entry, so the absence of a logging mechanism is the pre-existing state of the system, not a post-hoc retreat.
A merchant, governmental partner, or pilot partner requiring sealed identity as a contracting condition.
A material privacy incident at a comparable platform implicating phone→userId resolution.

6. Open questions for review

Counsel (parallel to implementation, not a gate)

These are documentation/policy tasks. They produce artifacts that describe the shipped architecture; they do not grant or withhold permission to ship it.

Privacy policy + ToS language. Audit the description of phone-number handling and identity resolution against what the code actually does post-Stage-A and post-Stage-B. Avoid claims that overstate sealing (e.g. "we never see your phone number" — false; we see it transiently to compute the hash) or understate it (e.g. silence on the userId-blob means we lose the marketing benefit).
Subpoena-response playbook. A prepared template for the most common request shapes:
- "What account has phone X" → post-Stage-A: we can compute the hash and return whether a row exists, but the row contains no plaintext PII; post-Stage-B: we can return the blob but cannot decrypt it.
- "Decrypt this user's profile" → cannot, by design.
- "Log future activity for phone X" → covered separately; see question 5.
User notification policy. When a subpoena identifier (phone) cannot be linked to a userId server-side, what does notification mean? Possible answer: notify all users via a transparency report, since we cannot identify the specific user.
GDPR DPIA if/when EU market entry is on the roadmap. Sealed identity helps the DPIA, not hurts it — but the document needs to exist.
Compelled-redesign jurisdictions. Document which markets we will and won't enter without a Stage B equivalent already shipped (UK, Australia are the obvious risks). This is a market-entry checklist, not an architecture question.
Pseudonymization classification under GDPR Art. 4(5): does HMAC-SHA-256 with a KMS-held pepper qualify? (Likely yes; document the answer.)

Engineering

Session token lifecycle and rotation under passphrase-derived keys (how do background refreshes work without re-prompting for PIN?)
Recovery flow when user clears app data but retains passphrase / recovery phrase
Migration path for existing users (re-encrypt at next login? batch backfill via a one-time client task?)
Anti-fraud detection redesign for userId-only signals (ban-evasion via phone reuse — does the phone-hash ban list cover the gap?)
Performance impact: extra round-trip for blob fetch + client-side decryption at every login (likely negligible but should be measured)
KMS pepper rotation: how do we re-hash without a phone-number table to iterate over? (Likely answer: lazy re-hash on next successful login, with both old and new pepper accepted during a rotation window.)
App Check / IP rate limiting: does sealing userId force changes to abuse heuristics that currently key on userId?

Operational

Customer support workflow redesign — read-only support views? user-initiated session sharing? out-of-band confirmation?
T&S investigation tools that don't rely on phone→userId linkage
Internal-access audit policy for encryptedUserIdBlob (who can read what under what authorization)
Incident response runbook update — what does "a user's phone was leaked" look like when we can't connect it to their userId?

7. Non-negotiable constraints

Any proposal must respect:

Immutable Right #6 (no data sales) — cannot be voted away, not even unanimously
§3.2 k-anonymity (≥ 3 unique users per reported cell) on any merchant-surfaced metric
No per-user behavioral profiles for ad targeting (Right #6 + §3.2)
No cross-device tracking, no device fingerprinting
Cannot weaken existing privacy commitments to gain operational capability
Phase 1 capital posture (Cofounder Agreement §11): work must be deliverable on founder time + minimal infrastructure spend until Phase 2 trigger

8. Recommended next step

Stage A (hash the phone) ships first. Removes plaintext PII, unblocks banned_accounts, ~1 sprint. See the spike plan.
Stage B (seal the userId) ships immediately after Stage A — preferably pre-Phase-5. The architectural commitment is strongest when made before the first subpoena and before public-facing growth.
Counsel-track work runs in parallel (privacy policy + ToS audit, subpoena playbook, DPIA prep). It produces artifacts that describe what shipped; it does not block what ships.
This brief is the canonical reference. Older drafts pointing to docs/economics/AD_PLACEMENT_ECONOMICS.md are dead links — that file does not exist.

9. Subpoena flow under the end-state architecture

This is what happens when law enforcement hands us a phone number and asks "who is this user?" once Stage A + Stage B are both shipped. The diagram is the answer to that question.

mermaid

flowchart TD
    Q1([Subpoena: identify the user with phone +1-555-XXXX])
    Q1 --> N[Server normalizes input to E.164]
    N --> P[Server fetches PHONE_HASH_PEPPER<br/>from KMS / Secret Manager]
    P --> H["Compute phoneHash =<br/>HMAC-SHA-256(pepperBytes, e164)"]
    H --> LK["Query auth_lookup / phoneHash"]
    LK --> EX{Row exists?}
    EX -->|No| RESP1([Truthful response:<br/>no such account])
    EX -->|Yes| ROW["Row contains:<br/>encryptedUserIdBlob<br/>phoneSalt, encryptedSeed"]
    ROW --> DEC{Can the server<br/>decrypt the blob?}
    DEC -->|"No — blob key = HKDF(entropy);<br/>entropy is only recoverable by<br/>decrypting encryptedSeed with the<br/>user's PIN, which the server never sees<br/>and cannot derive"| RESP2([Truthful response:<br/>opaque ciphertext returned;<br/>no path from phone to userId])
    UID[/userId — never recovered server-side/] -.never reached.- RESP2

    classDef sealed fill:#1f4f3a,color:#fff,stroke:#2d8659,stroke-width:2px
    classDef ghost fill:transparent,color:#888,stroke:#666,stroke-dasharray:5 5
    class RESP1,RESP2 sealed
    class UID ghost

Why each step is irreversible

Step	What we have	What we don't have
Hash the phone	The pepper (in KMS/Secret Manager) and the input phone	A reverse — HMAC isn't reversible; without the pepper, even a leaked DB dump can't be rainbow-tabled against a phone-number dictionary at scale
Look up the row	The `phoneHash` to query against	The `userId` of the row owner — Stage B replaces the userId field with a ciphertext blob
Decrypt the blob	The ciphertext, returned by Firestore	The decryption key — it's `HKDF(entropy)`, and entropy comes only from decrypting `encryptedSeed` with the user's PIN. The PIN never leaves the user's device, and we don't store it in any form (only `authProofHash`, an HMAC of the entropy under a fixed context string, which is one-way)

What we hand over vs. what we don't

Subpoena asks	What we can produce	What we cannot produce
"Does an account exist for phone X?"	yes / no	—
"What's the userId for phone X?"	the row's ciphertext blob	the userId itself
"What did userId Y do?" (if they hand us the userId)	everything indexed by userId	profile fields (still PBKDF2 + AES-GCM client-encrypted)
"Decrypt this user's profile"	nothing	profile bytes (we never had the key)
"Decrypt this user's seed"	nothing	seed bytes (PIN-wrapped, we never had the PIN)

What each stage contributes

Stage	Status	What it adds	What still leaks without it
Stage A — hash the phone with KMS pepper	Phase 1+2 shipped on dev (PR #479); phase 3-5 pending	Removes plaintext phone numbers from the database. A leaked Firestore export becomes useless without the pepper. Enables `banned_accounts` to share the same hash form.	Plaintext `phone` field still present during phases 1-2 (dual-write); the `phoneHash → userId` resolution is still server-side once a phone is provided to the server.
Stage B — encrypt the userId resolution	Not started; gated on engineering capacity, not legal review	Replaces the `userId` in the auth lookup row with a passphrase-keyed ciphertext blob. The server can find the row via `phoneHash`, but the row no longer reveals `userId` server-side. Only the user, by entering their PIN, can decrypt it.	Without Stage B, a subpoena providing a phone produces the corresponding `userId` and any data indexed by it.

Important caveat about the current state

We have shipped Stage A phases 1-2 only (dual-write of phoneHash alongside plaintext phone). Today, a subpoena providing a phone number can still be answered with a userId: the server can either query by plaintext phone (fallback path) or compute the hash and query by phoneHash. Either path returns the row, and the row contains the userId. The diagram above describes the end-state after Stage A phase 4 (drop plaintext) and Stage B (seal the userId blob) both ship.

The interim state still meaningfully reduces certain attack surfaces — a leaked DB dump (without the pepper) is much harder to rainbow-table than the previous plaintext-phone state — but it does not yet make the phone→userId link unrecoverable. The diagram is what we are building toward, not what is live today.

10. References

v1 auth code: services/api/auth/src/routes/phone.js, services/api/auth/src/services/customToken.service.js
Profile encryption: apps/web/src/lib/encryption.js
Existing privacy docs: HOW_ENCRYPTION_WORKS.md, PRIVACY_PRESERVING_DATA_COLLECTION.md
Safety/ban design: docs/features/safety/SAFETY_MECHANICS.md
Roadmap: docs/business/timelines/INITIAL_LAUNCH.md
Cofounder Agreement: Immutable Right #6 (no data sales), §9.4 (Mission Arbiter), §3 (Marketing & Offers Platform constraints)
Business Plan: §3.1 (encryption), §3.2 (anonymity + k-anonymity), §12 (plaintext-metadata limits)

Sealed Identity Architecture — Decision Brief ​

1. Summary ​

2. v1 reality (what the code actually does today) ​

2.1 Storage ​

2.2 Login flow (zero-knowledge proof of PIN, server-resolvable identity) ​

2.3 Login event logging ​

2.4 Ban enforcement ​

3. The actual gap, in subpoena terms ​

4. Proposal — two stages, not one ​

Stage A — Hash the phone (no behavior change for users) ​

Stage B — Seal the userId resolution (the original proposal, restated) ​

5. Trigger criteria ​

6. Open questions for review ​

Counsel (parallel to implementation, not a gate) ​

Engineering ​

Operational ​

7. Non-negotiable constraints ​

8. Recommended next step ​

9. Subpoena flow under the end-state architecture ​

Why each step is irreversible ​

What we hand over vs. what we don't ​

What each stage contributes ​

Important caveat about the current state ​

10. References ​