Fiat-Shamir Transcripts

1. Why this chapter exists

Every challenge in halo2 ( $\theta$ , $\beta$ , $\gamma$ , $y$ , $x$ , the IPA round challenges, the multiopen $x_1 \dots x_4$ ) comes from a single Fiat-Shamir transcript. The transcript is what makes the protocol non-interactive; it is also the part of the codebase where a single off-by-one operation silently breaks soundness. This chapter covers the trait layer (so you can write a custom transcript), the Blake2b implementation (so you know exactly what halo2 hashes), and the common_* vs write_* distinction (which is the most common source of transcript bugs).

2. Definitions

Definition (Transcript). A stateful object that mirrors the view of one party of the protocol. It absorbs the messages sent by the prover and emits challenges when asked, deterministically as a function of everything absorbed so far.

Definition (Common input). A value that is part of the shared protocol input (the verifying key, the instance) and is not sent over the wire as part of the proof. Both prover and verifier hash it into their transcript independently.

Definition (Prover-to-verifier message). A value the prover sends and the verifier reads. Both sides hash it into the transcript; the prover additionally writes it to the proof byte stream, and the verifier additionally reads it from there.

Definition (Challenge). A value squeezed out of the transcript. In halo2 a raw challenge is a 32-byte uniformly random string; a ChallengeScalar<C, T> is a typed scalar derived from it.

3. The code

3.1 The trait layer

halo2_proofs/src/transcript.rs
/// Generic transcript view (from either the prover or verifier's perspective)
pub trait Transcript<C: CurveAffine, E: EncodedChallenge<C>> {
    /// Squeeze an encoded verifier challenge from the transcript.
    fn squeeze_challenge(&mut self) -> E;

    /// Squeeze a typed challenge (in the scalar field) from the transcript.
    fn squeeze_challenge_scalar<T>(&mut self) -> ChallengeScalar<C, T> {
        ChallengeScalar {
            inner: self.squeeze_challenge().get_scalar(),
            _marker: PhantomData,
        }
    }

    /// Writing the point to the transcript without writing it to the proof,
    /// treating it as a common input.
    fn common_point(&mut self, point: C) -> io::Result<()>;

    /// Writing the scalar to the transcript without writing it to the proof,
    /// treating it as a common input.
    fn common_scalar(&mut self, scalar: C::Scalar) -> io::Result<()>;
}

/// Transcript view from the perspective of a verifier that has access to an
/// input stream of data from the prover to the verifier.
pub trait TranscriptRead<C: CurveAffine, E: EncodedChallenge<C>>: Transcript<C, E> {
    /// Read a curve point from the prover.
    fn read_point(&mut self) -> io::Result<C>;

    /// Read a curve scalar from the prover.
    fn read_scalar(&mut self) -> io::Result<C::Scalar>;
}

/// Transcript view from the perspective of a prover that has access to an
/// output stream of messages from the prover to the verifier.
pub trait TranscriptWrite<C: CurveAffine, E: EncodedChallenge<C>>: Transcript<C, E> {
    /// Write a curve point to the proof and the transcript.
    fn write_point(&mut self, point: C) -> io::Result<()>;

    /// Write a scalar to the proof and the transcript.
    fn write_scalar(&mut self, scalar: C::Scalar) -> io::Result<()>;
}

/// We will replace BLAKE2b with an algebraic hash function in a later version.
#[derive(Debug, Clone)]

Three traits cleanly separate concerns:

Transcript: the shared base. Includes squeeze_challenge, squeeze_challenge_scalar, common_point, common_scalar.
TranscriptRead (verifier side): adds read_point, read_scalar. Each of these hashes the value into the transcript and returns it.
TranscriptWrite (prover side): adds write_point, write_scalar. Each of these hashes the value into the transcript and writes it to the byte stream.

The common_* vs write_* distinction:

common_* is used for values both sides already know (the VK itself, the public instance). Both sides call common_*; nothing crosses the wire.
write_* is used for values the prover is sending. Only the prover calls write_*; the verifier calls read_* to consume the bytes and absorb the same value.

If the prover calls write_* where it should call common_*, the verifier will see the value twice in its transcript hash and the challenges will diverge. The reverse is also a soundness break.

3.2 The Blake2b implementation

halo2_proofs/src/transcript.rs
/// We will replace BLAKE2b with an algebraic hash function in a later version.
#[derive(Debug, Clone)]
pub struct Blake2bRead<R: Read, C: CurveAffine, E: EncodedChallenge<C>> {
    state: Blake2bState,
    reader: R,
    _marker: PhantomData<(C, E)>,
}

impl<R: Read, C: CurveAffine, E: EncodedChallenge<C>> Blake2bRead<R, C, E> {
    /// Initialize a transcript given an input buffer.
    pub fn init(reader: R) -> Self {
        Blake2bRead {
            state: Blake2bParams::new()
                .hash_length(64)
                .personal(b"Halo2-Transcript")
                .to_state(),
            reader,
            _marker: PhantomData,
        }
    }
}

impl<R: Read, C: CurveAffine> TranscriptRead<C, Challenge255<C>>
    for Blake2bRead<R, C, Challenge255<C>>
where
    C::Scalar: FromUniformBytes<64>,
{
    fn read_point(&mut self) -> io::Result<C> {
        let mut compressed = C::Repr::default();
        self.reader.read_exact(compressed.as_mut())?;
        let point: C = Option::from(C::from_bytes(&compressed)).ok_or_else(|| {
            io::Error::new(io::ErrorKind::Other, "invalid point encoding in proof")
        })?;
        self.common_point(point)?;

        Ok(point)
    }

    fn read_scalar(&mut self) -> io::Result<C::Scalar> {
        let mut data = <C::Scalar as PrimeField>::Repr::default();
        self.reader.read_exact(data.as_mut())?;
        let scalar: C::Scalar = Option::from(C::Scalar::from_repr(data)).ok_or_else(|| {
            io::Error::new(
                io::ErrorKind::Other,
                "invalid field element encoding in proof",
            )
        })?;
        self.common_scalar(scalar)?;

        Ok(scalar)
    }
}

impl<R: Read, C: CurveAffine> Transcript<C, Challenge255<C>> for Blake2bRead<R, C, Challenge255<C>>
where
    C::Scalar: FromUniformBytes<64>,
{
    fn squeeze_challenge(&mut self) -> Challenge255<C> {
        self.state.update(&[BLAKE2B_PREFIX_CHALLENGE]);
        let hasher = self.state.clone();
        let result: [u8; 64] = hasher.finalize().as_bytes().try_into().unwrap();
        Challenge255::<C>::new(&result)
    }

    fn common_point(&mut self, point: C) -> io::Result<()> {
        self.state.update(&[BLAKE2B_PREFIX_POINT]);
        let coords: Coordinates<C> = Option::from(point.coordinates()).ok_or_else(|| {
            io::Error::new(
                io::ErrorKind::Other,
                "cannot write points at infinity to the transcript",
            )
        })?;
        self.state.update(coords.x().to_repr().as_ref());
        self.state.update(coords.y().to_repr().as_ref());

        Ok(())
    }

    fn common_scalar(&mut self, scalar: C::Scalar) -> io::Result<()> {
        self.state.update(&[BLAKE2B_PREFIX_SCALAR]);
        self.state.update(scalar.to_repr().as_ref());

        Ok(())
    }
}

/// We will replace BLAKE2b with an algebraic hash function in a later version.
#[derive(Debug, Clone)]
pub struct Blake2bWrite<W: Write, C: CurveAffine, E: EncodedChallenge<C>> {
    state: Blake2bState,
    writer: W,
    _marker: PhantomData<(C, E)>,
}

impl<W: Write, C: CurveAffine, E: EncodedChallenge<C>> Blake2bWrite<W, C, E> {
    /// Initialize a transcript given an output buffer.
    pub fn init(writer: W) -> Self {

The Blake2bRead and Blake2bWrite types are the production default. Three implementation choices to remember:

The hash is Blake2b-512 (64-byte output), personalized with "Halo2-Transcript".
A 1-byte prefix is hashed before every absorbed value: BLAKE2B_PREFIX_CHALLENGE = 0, BLAKE2B_PREFIX_POINT = 1, BLAKE2B_PREFIX_SCALAR = 2. These prevent domain confusion attacks (you cannot make a point hash collide with a scalar hash).
The ChallengeScalar<C, T> is squeezed by reducing the 64-byte Blake2b output modulo the scalar field order, via FromUniformBytes. This gives statistical uniformity in the scalar field.

3.3 Typed challenges

halo2_proofs/src/transcript.rs
///
/// The `Type` type can be used to scope the challenge to a specific context, or
/// set to `()` if no context is required.
#[derive(Copy, Clone, Debug)]
pub struct ChallengeScalar<C: CurveAffine, T> {
    inner: C::Scalar,
    _marker: PhantomData<T>,
}

impl<C: CurveAffine, T> std::ops::Deref for ChallengeScalar<C, T> {
    type Target = C::Scalar;

    fn deref(&self) -> &Self::Target {
        &self.inner
    }
}

/// `EncodedChallenge<C>` defines a challenge encoding with a [`Self::Input`]
/// that is used to derive the challenge encoding and `get_challenge` obtains
/// the _real_ `C::Scalar` that the challenge encoding represents.
pub trait EncodedChallenge<C: CurveAffine> {
    /// The Input type used to derive the challenge encoding. For example,
    /// an input from the Poseidon hash would be a base field element;
    /// an input from the Blake2b hash would be a [u8; 64].
    type Input;

    /// Get an encoded challenge from a given input challenge.
    fn new(challenge_input: &Self::Input) -> Self;

    /// Get a scalar field element from an encoded challenge.
    fn get_scalar(&self) -> C::Scalar;

ChallengeScalar<C, T> carries a phantom type T so the compiler can distinguish "the lookup challenge $\theta$ " from "the permutation challenge $\beta$ " even though both are scalars. Each user of the transcript defines a marker type (the multiopen types X1, X2, X3, X4 are an example):

#[derive(Clone, Copy, Debug)]
struct X1 {}
type ChallengeX1<F> = ChallengeScalar<F, X1>;

This catches "I asked for $x_2$ but used it as if it were $x_3$ " at compile time.

3.4 The `Challenge255` encoding

halo2_proofs/src/transcript.rs

/// A 255-bit challenge.
#[derive(Copy, Clone, Debug)]
pub struct Challenge255<C: CurveAffine>([u8; 32], PhantomData<C>);

impl<C: CurveAffine> std::ops::Deref for Challenge255<C> {
    type Target = [u8; 32];

    fn deref(&self) -> &Self::Target {
        &self.0
    }
}

impl<C: CurveAffine> EncodedChallenge<C> for Challenge255<C>
where
    C::Scalar: FromUniformBytes<64>,
{
    type Input = [u8; 64];

    fn new(challenge_input: &[u8; 64]) -> Self {
        Challenge255(
            C::Scalar::from_uniform_bytes(challenge_input)
                .to_repr()
                .as_ref()
                .try_into()
                .expect("Scalar fits into 256 bits"),
            PhantomData,
        )
    }
    fn get_scalar(&self) -> C::Scalar {
        let mut repr = <C::Scalar as PrimeField>::Repr::default();
        repr.as_mut().copy_from_slice(&self.0);
        C::Scalar::from_repr(repr).unwrap()
    }
}

Challenge255 is the in-protocol challenge representation: a 255-bit random string (32 bytes with the top bit masked off). Most halo2 code never sees it directly; it appears inside the EncodedChallenge trait that abstracts over future challenge encodings.

4. Failure modes

common_* / write_* confusion. Swapping one for the other silently breaks soundness. There are no compile-time checks; review every transcript call when changing the prover.
Re-squeezing without absorbing. Two consecutive squeeze_challenge calls return different values (Blake2b is stateful), but only because the state is updated by the squeeze. Calling squeeze_challenge twice without absorbing anything in between is rarely what you want; the resulting challenges are still random but they correspond to different protocol steps. If both prover and verifier do the same thing this is safe, but it almost certainly indicates a bug.
Hashing the VK twice. The VK should be hashed exactly once at the start of the proof, via vk.hash_into(transcript). Doing it twice (or not at all) silently changes every subsequent challenge.
Custom transcripts forgetting the domain prefixes. A custom Transcript implementation must keep separate domains for points, scalars, and challenges, or it admits a point-as-scalar confusion attack.

5. Spec pointers

The Fiat-Shamir heuristic is treated formally in Bellare and Rogaway, "Random oracles are practical", CCS 1993. Modern security proofs in the algebraic group model live in Fuchsbauer et al., "ROM-AGM".
BLAKE2: simpler, smaller, fast as MD5 for the underlying hash construction.

6. Exercises

Open halo2_proofs/src/transcript.rs and identify the three BLAKE2B_PREFIX_* constants. What would happen if you set two of them to the same value?
Implement a NoopTranscript that records the operations performed against it (without computing anything) and write a small test that compares the prover's call sequence to the verifier's. Confirm they agree token-by-token.
Find the hash_into method on VerifyingKey in halo2_proofs/src/plonk.rs and trace exactly which fields of the VK are absorbed.
Why does ChallengeScalar<C, T> carry a phantom T? Answer in one sentence.

Answers in the code

Exercise 1: the constants are 0, 1, 2. Aliasing them would remove the domain separation between challenges, points, and scalars, opening the transcript to a confusion attack.
Exercise 3: VerifyingKey::hash_into calls transcript.common_scalar(self.transcript_repr), where transcript_repr is itself the Blake2b-512 hash of the pinned verifying key (see the from_parts method).
Exercise 4: the phantom binds each challenge to a particular protocol step at compile time, preventing accidental reuse.

7. Further reading

The Halo 2 Book "Implementation" chapter describes the transcript layer in the context of the full proving system.

1. Why this chapter exists​

2. Definitions​

3. The code​

3.1 The trait layer​

3.2 The Blake2b implementation​

3.3 Typed challenges​

3.4 The Challenge255 encoding​

4. Failure modes​

5. Spec pointers​

6. Exercises​

Answers in the code​

7. Further reading​