Proof-Carrying Data Without Succinct Arguments

Bünz, Benedikt; Chiesa, Alessandro; Lin, William; Mishra, Pratyush; Spooner, Nicholas

doi:10.1007/978-3-030-84242-0_24

Benedikt Bünz¹⁰,
Alessandro Chiesa¹¹,
William Lin¹¹,
Pratyush Mishra¹¹ &
…
Nicholas Spooner¹²

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 12825))

Included in the following conference series:

Annual International Cryptology Conference

2981 Accesses
3 Altmetric

Abstract

Proof-carrying data (PCD) is a powerful cryptographic primitive that enables mutually distrustful parties to perform distributed computations that run indefinitely. Known approaches to construct PCD are based on succinct non-interactive arguments of knowledge (SNARKs) that have a succinct verifier or a succinct accumulation scheme.

In this paper we show how to obtain PCD without relying on SNARKs. We construct a PCD scheme given any non-interactive argument of knowledge (e.g., with linear-size arguments) that has a split accumulation scheme, which is a weak form of accumulation that we introduce.

Moreover, we construct a transparent non-interactive argument of knowledge for R1CS whose split accumulation is verifiable via a (small) constant number of group and field operations. Our construction is proved secure in the random oracle model based on the hardness of discrete logarithms, and it leads, via the random oracle heuristic and our result above, to concrete efficiency improvements for PCD.

Along the way, we construct a split accumulation scheme for Hadamard products under Pedersen commitments and for a simple polynomial commitment scheme based on Pedersen commitments.

Our results are supported by a modular and efficient implementation.

The full version of this paper is available online [BCL+20].

You have full access to this open access chapter, Download conference paper PDF

Proof-Carrying Data from Arithmetized Random Oracles

Batch Arguments to NIZKs from One-Way Functions

Bulletproofs++: Next Generation Confidential Transactions via Reciprocal Set Membership Arguments

Keywords

1 Introduction

Proof-carrying data (PCD) [CT10] is a powerful cryptographic primitive that enables mutually distrustful parties to perform distributed computations that run indefinitely, while ensuring that the correctness of every intermediate state of the computation can be verified efficiently. A special case of PCD is incrementally-verifiable computation (IVC) [Val08]. PCD has found applications in enforcing language semantics [CTV13], verifiable MapReduce computations [CTV15], image authentication [NT16], blockchains [Mina, KB20, BMRS20, CCDW20], and others. Given the theoretical and practical relevance of PCD, it is an important research question to build efficient PCD schemes from minimal cryptographic assumptions.

PCD from Succinct Verification. The canonical construction of PCD is via recursive composition of succinct non-interactive arguments (SNARGs) [BCCT13, BCTV14, COS20]. Informally, a proof that the computation was executed correctly for t steps consists of a proof of the claim “the t-th step of the computation was executed correctly, and there exists a proof that the computation was executed correctly for $t-1$ steps”. The latter part of the claim is expressed using the SNARG verifier itself. This construction yields secure PCD (with IVC as a special case) provided the SNARG satisfies an adaptive knowledge soundness property (i.e., is a SNARK). Efficiency requires the SNARK to have sublinear-time verification, achievable via SNARKs for machine computations [BCCT13] or preprocessing SNARKs for circuit computations [BCTV14, COS20].

Requiring sublinear-time verification, however, significantly restricts the choice of SNARK, which limits what is achievable for PCD. These restrictions have practical implications: the concrete efficiency of recursion is limited by the use of expensive curves for pairing-based SNARKs [BCTV14] or heavy use of cryptographic hash functions for hash-based SNARKs [COS20].

PCD from Accumulation. Recently, [BCMS20] gave an alternative construction of PCD using SNARKs that have succinct accumulation schemes; this developed and formalized a novel approach for recursion sketched in [BGH19]. Informally, rather than being required to have sublinear-time verification, the SNARK is required to be accompanied by a cryptographic primitive that enables “postponing” the verification of SNARK proofs by way of an accumulator that is updated at each recursion step. The main efficiency requirement on the accumulation scheme is that the accumulation procedure must be succinctly verifiable, and in particular the accumulator itself must be succinct.

Requiring a SNARK to have a succinct accumulation scheme is a weaker condition than requiring it to have sublinear-time verification. This has enabled constructing PCD from SNARKs that do not have sublinear-time verification [BCMS20], which in turn led to PCD constructions from assumptions and with efficiency properties that were not previously achieved. Practitioners have exploited this freedom to design implementations of recursive composition with improved practical efficiency [Halo20, Pickles20].

Our Motivation. The motivation of this paper is twofold. First, can PCD be built from a weaker primitive than SNARKs with succinct accumulation schemes? If so, can we leverage this to obtain PCD constructions with improved concrete efficiency?

1.1 Contributions

We make theory and systems contributions that advance the state of the art for PCD: (1) We introduce split accumulation schemes for relations, a cryptographic primitive that relaxes prior notions of accumulation. (2) We obtain PCD from any non-interactive argument of knowledge that satisfies this weaker notion of accumulation; surprisingly, this allows for arguments with no succinctness whatsoever. (3) We construct a non-interactive argument of knowledge based on discrete logarithms (and random oracles) whose accumulation verifier has constant size (improving over the logarithmic-size verifier of prior accumulation schemes in this setting). (4) We implement and evaluate constructions from this paper and from [BCMS20].

We elaborate on each of these contributions next.

(1) Split accumulation for relations. Recall from [BCMS20] that an accumulation scheme for a predicate $\varPhi :X\rightarrow \{0,1\}$ enables proving/verifying that each input in an infinite stream $\mathsf {q}_{1},\mathsf {q}_2,\ldots $ satisfies the predicate $\varPhi $, by augmenting the stream with accumulators. Informally, for each i, the prover produces a new accumulator $\mathsf {acc}_{i+1}$ from the input $\mathsf {q}_i$ and the old accumulator $\mathsf {acc}_{i}$; the verifier can check that the triple $(\mathsf {q}_i,\mathsf {acc}_{i},\mathsf {acc}_{i+1})$ is a valid accumulation step, much more efficiently than running $\varPhi $ on $\mathsf {q}_i$. At any time, the decider can validate $\mathsf {acc}_{i+1}$, which establishes that for all $j \le i$ it was the case that $\varPhi (\mathsf {q}_j) = 1$. The accumulator size (and hence the running time of the three algorithms) cannot grow in the number of accumulation steps.

We extend this notion in two orthogonal ways. First we consider relations $\varPhi :X\times W\rightarrow \{0,1\}$ and now for a stream of instances $\mathsf {qx}_{1},\mathsf {qx}_{2},\ldots $ the goal is to establish that there exist witnesses $\mathsf {qw}_{1},\mathsf {qw}_{2},\ldots $ such that $\varPhi (\mathsf {qx}_{i},\mathsf {qw}_{i})=1$ for each i. Second, we consider accumulators $\mathsf {acc}_{i}$ that are split into an instance part $\mathsf {acc}_{i}.\mathbbm {x}$ and a witness part $\mathsf {acc}_{i}.\mathbbm {w}$ with the restriction that the accumulation verifier only gets to see the instance part (and possibly an auxiliary accumulation proof $\mathsf {pf}$). We refer to this notion as split accumulation for relations, and refer to (for contrast) the notion from [BCMS20] as atomic accumulation for languages.

The purpose of these extensions is to enable us to consider accumulation schemes in which predicate witnesses and accumulator witnesses are large while still requiring the accumulation verifier to be succinct (it receives short predicate instances and accumulator instances but not large witnesses). We will see that such accumulation schemes are both simpler and cheaper, while still being useful for primitives such as PCD.

See Sect. 2.1 for more on atomic vs. split accumulation, and the full version for formal definitions.

(2) PCD via split accumulation. A non-interactive argument has a split accumulation scheme if the relation corresponding to its verifier has a split accumulation scheme (we make this precise later). We show that any non-interactive argument of knowledge (NARK) having a split accumulation scheme where the accumulation verifier is sublinear can be used to build a proof-carrying data (PCD) scheme, even if the NARK does not have sublinear argument size. This significantly broadens the class of non-interactive arguments from which PCD can be built, and is the first result to obtain PCD from non-interactive arguments that need not be succinct. Similarly to [BCMS20], if the NARK and accumulation scheme are post-quantum secure, so is the PCD scheme. (It remains an open question whether there are non-trivial post-quantum instantiations of these.)

Theorem 1

(informal). There is an efficient transformation that compiles any NARK with a split accumulation scheme into a PCD scheme. If the NARK and its split accumulation scheme are zero knowledge, then the PCD scheme is also zero knowledge. Additionally, if the NARK and its accumulation scheme are post-quantum secure then the PCD scheme is also post-quantum secure.

Similarly to all PCD results known to date, the above theorem holds in a model where all parties have access to a common reference string, but no oracles. (The construction makes non-black-box use of the accumulation scheme verifier, and the theorem does not carry over to the random oracle model.)

A corollary of Theorem 1 is that any NARK with a split accumulation scheme can be “bootstrapped” into a SNARK for machine computations. (PCD implies IVC and, further assuming collision-resistant hashing, also efficient SNARKs for machine computations [BCCT13].) This is surprising: an argument with decidedly weak efficiency properties implies an argument with succinct proofs and succinct verification!

See Sect. 2.2 for a summary of the ideas behind Theorem 1, and the full version for technical details.

(3) NARK with split accumulation based on DL. Theorem 1 motivates the question of whether we can leverage the weaker condition on the argument system to improve the efficiency of PCD. Our focus is on minimizing the cost of the accumulation verifier for the argument system, because it is the only component that is not used as a black box, and thus typically determines concrete efficiency. Towards this end, we present a (zero knowledge) NARK with (zero knowledge) split accumulation based on discrete logarithms, with a constant-size accumulation verifier; the NARK has a transparent (public-coin) setup.

Theorem 2

(informal). In the random oracle model and assuming the hardness of the discrete logarithm problem, there exists a transparent (zero knowledge) NARK for R1CS and a corresponding (zero knowledge) split accumulation scheme with the following efficiency:

Above, $\mathsf {M}$ denotes the number of constraints in the R1CS instance, $\mathbb {G}$ denotes group scalar multiplications or group elements, and $\mathbb {F}$ denotes field operations or field elements.

The NARK construction from Theorem 2 is particularly simple: it is obtained by applying the Fiat–Shamir transformation to a sigma protocol for R1CS based on Pedersen commitments (and linear argument size). The only “special” feature about the construction is that, as we prove, it has a very efficient split accumulation scheme for the relation corresponding to its verifier. By heuristically instantiating the random oracle, we can apply Theorem 1 (and [BCCT13]) to obtain a SNARK for machines from this modest starting point.

We find it informative to compare Theorem 2 and SNARKs with atomic accumulation based on discrete logarithms [BCMS20]:

the SNARK’s argument size is $O(\log \mathsf {M})$ group elements, much less than the NARK’s $O(\mathsf {M})$ field elements;
the SNARK’s accumulator verifier uses $O(\log \mathsf {M})$ group scalar multiplications and field operations, much more than the NARK’s O(1) group scalar multiplications and field operations.

Therefore Theorem 2 offers a tradeoff that minimizes the cost of the accumulator at the expense of argument size. (As we shall see later, this tradeoff has concrete efficiency advantages.)

Our focus on argument systems based on discrete logarithms is motivated by the fact that they can be instantiated based on efficient curves suitable for recursion: the Tweedle [BGH19] or Pasta [Hop20] curve cycles, which follow the curve cycle technique for efficient recursion [BCTV14]. (In fact, as our construction does not rely on any number-theoretic properties of $|\mathbb {G}|$, we could even use the $(\texttt {secp256k1},\texttt {secq256k1})$ cycle, where secp256k1 is the curve used in Bitcoin.) This focus on discrete logarithms is a choice made for this paper, and we believe that our ideas can lead to efficiency improvements to recursion in other settings (e.g., pairing-based and hash-based arguments) and leave these to future work.

See Sect. 2.3 for a summary of the ideas behind Theorem 1, and the full version for technical details.

(4) Split accumulation for common predicates. We obtain split accumulation schemes with constant-size accumulation verifiers for common predicates: (i) Hadamard products (and more generally any bilinear function) under Pedersen commitments (see Sect. 2.5 for a summary and the full version for details); (ii) polynomial evaluations under Pedersen commitments (see Sect. 2.6 for a summary and the full version for technical details). Split accumulation for Hadamard products is a building block that we use to prove Theorem 1.

(5) Implementation and evaluation. We contribute a set of Rust libraries^{Footnote 1} that realize PCD via accumulation via modular combinations of interchangeable components: (a) generic interfaces for atomic and split accumulation; (b) generic construction of PCD from arguments with atomic and split accumulation; (c) split accumulation for our zkNARK for R1CS; (d) split accumulation for Hadamard products under Pedersen commitments; (e) split accumulation for polynomial evaluations under Pedersen commitments; (f) atomic accumulation for polynomial commitments based on inner product arguments and pairings from [BCMS20]; (g) constraints for all the foregoing accumulation verifiers. Practitioners interested in PCD will find these libraries useful for prototyping and comparing different types of recursion (and, e.g., may help decide if current systems based on atomic recursion [Halo20, Pickles20] are better off via split recursion or not).

We additionally conduct experiments to evaluate our implementation. Our experiments focus on determining the recursion threshold, which informally is the number of constraints that need to be proved at each step of the recursion. Our evaluation demonstrates that, over curves from the popular “Pasta” cycle [Hop20], the recursion threshold for split accumulation of our NARK for R1CS is as low as 52, 000 constraints, which is at least $8.5 \times $ cheaper than the cost of IVC constructed from atomic accumulation for discrete-logarithm-based protocols [BCMS20]. In fact, the recursion threshold is even lower than that for IVC constructed from prior state-of-the-art pairing-friendly SNARKs [Gro16]. While this comes at the expense of much larger proof sizes, this overhead is attractive for notable applications (e.g., incrementally-verifiable ledgers).

See the full version for more details on our implementation and evaluation, respectively.

Remark 1 (concurrent work)

A concurrent work [BDFG20] studies similar questions as this paper. Below we summarize the similarities and the differences between the two papers.

Similarities. Both papers are study by the goal of reducing the cost of recursive arguments. The main object of study in [BDFG20] is additive polynomial commitment schemes (PC schemes), for which [BDFG20] considers different types of aggregation schemes: (1) public aggregation in [BDFG20] is closely related to atomic accumulation specialized to PC schemes from a prior work [BCMS20]; and (2) private aggregation in [BDFG20] is closely related to split accumulation specialized to PC schemes from this paper. Moreover, the private aggregation scheme for additive PC schemes in [BDFG20] is similar to our split accumulation scheme for Pedersen PC schemes (overviewed in Sect. 2.6 and detailed in the full version). The protocols differ in how efficiency depends on the n claims to aggregate/accumulate: the verifier in [BDFG20] uses $n+1$ group scalar multiplications while ours uses 2n. (Informally, [BDFG20] first randomly combines claims and then evaluates at a random point, while we first evaluate at a random point and then randomly combine claims.)

Differences. The two papers develop distinct, and complementary, directions.

The focus of [BDFG20] is to design protocols for any additive PC scheme (and, even more generally, any PC scheme with a linear combination scheme), including the aforementioned private aggregation protocol and a compiler that endows a given PC scheme with zero knowledge.

In contrast, our focus is to formulate a definition of split accumulation for general relation predicates that (a) we demonstrate suffices to construct PCD, and (b) in the random oracle model, we can also demonstrably achieve via a split accumulation scheme based on Pedersen commitments. We emphasize that our definitions are materially different from the case of atomic accumulation in [BCMS20], and necessitate careful consideration of technicalities such as the flavor of adaptive knowledge soundness, which algorithms can be allowed to query oracles, and so on. Hence, we cannot simply rely on the existing foundations for atomic accumulation of [BCMS20] in order to infer the correct definitions and security reductions for split accumulation. Overall, our theoretical work enables us to achieve the first construction of PCD without succinct arguments, and also to obtain a novel NARK for R1CS with a constant-size accumulation verifier.

We stress that the treatment of accumulation at a higher level of abstraction than for PC schemes is essential to prove theorems about PCD. In particular, contrary to what is claimed as a theorem in [BDFG20], it is not known how to build PCD from a PC scheme with an aggregation/accumulation scheme in any model without making additional heuristic assumptions. This is because obtaining a NARK from a PC scheme using known techniques requires the use of a random oracle, which we do not know how to accumulate. In contrast, we construct PCD in the standard model starting directly from an aggregation/accumulation scheme for a NARK, and no additional assumptions. Separately, the security of our accumulation scheme for a NARK in the standard model is an assumption, which is conjectured based on a security proof in the ROM.

Another major difference is that we additionally contribute a comprehensive and modular implementation of protocols from [BCMS20] and this paper, and conduct an evaluation for the discrete logarithm setting. This supports the asymptotic improvements with measured improvements in concrete efficiency.

2 Techniques

We summarize the main ideas behind our results. In Sect. 2.1 we discuss our new notion of split accumulation for relation predicates, and compare it with the notion of atomic accumulation for language predicates from [BCMS20]. In Sect. 2.2 we discuss the proof of Theorem 1. In Sect. 2.3 we discuss the proof of Theorem 2; for this we rely on a new result about split accumulation for Hadamard products, which we discuss in Sect. 2.5. Then, in Sect. 2.6, we discuss our split accumulation for a Pedersen-based polynomial commitment, which can act as a drop-in replacement for polynomial commitments used in prior SNARKs, such as those of [BGH19]. Finally, in Sect. 2.7 we elaborate on our implementation and evaluation. Figure 1 illustrates the relation between our results. The rest of the paper contains technical details, and we provide pointers to relevant sections along the way.

2.1 Accumulation: Atomic vs Split

We review the notion of accumulation from [BCMS20], which we refer to as atomic accumulation, and then describe the weaker notion that we introduce, which we call split accumulation.

Atomic Accumulation for Languages. An accumulation scheme for a language predicate $\varPhi :X\rightarrow \{0,1\}$ is a tuple of algorithms $(\mathrm {P},\mathrm {V},\mathrm {D})$, known as the prover, verifier, and decider, that enable proving/verifying statements of the form $\varPhi (\mathsf {q}_{1}) \wedge \varPhi (\mathsf {q}_2) \wedge \cdots $ more efficiently than running the predicate $\varPhi $ on each input.

This is done as follows. Starting from an initial (“empty”) accumulator $\mathsf {acc}_{1}$, the prover is used to accumulate the first input $\mathsf {q}_{1}$ to produce a new accumulator $\mathsf {acc}_{2} \leftarrow \mathrm {P}(\mathsf {q}_{1},\mathsf {acc}_{1})$; then the prover is used again to accumulate the second input $\mathsf {q}_2$ to produce a new accumulator $\mathsf {acc}_{3} \leftarrow \mathrm {P}(\mathsf {q}_2,\mathsf {acc}_{2})$; and so on.

Each accumulator produced so far enables efficient verification of the predicate on all inputs that went into the accumulator. For example, to establish that $\varPhi (\mathsf {q}_{1}) \wedge \cdots \wedge \varPhi (\mathsf {q}_{T}) =1$ it suffices to check that:

the verifier accepts each accumulation step: $\mathrm {V}(\mathsf {q}_{1},\mathsf {acc}_{1},\mathsf {acc}_{2}) = 1$, $\mathrm {V}(\mathsf {q}_2,\mathsf {acc}_{2},\mathsf {acc}_{3}) = 1$, and so on; and
the decider accepts the final accumulator: $\mathrm {D}(\mathsf {acc}_{T})=1$.

Qualitatively, this replaces the naive cost $T\cdot |\varPhi |$ with the new cost $T\cdot |\mathrm {V}|+|\mathrm {D}|$. This is beneficial when the verifier is much cheaper than checking the predicate directly and the decider is not much costlier than checking the predicate directly. Crucially, the verifier and decider costs (and, in particular, the accumulator size) should not grow with the number $T$ of accumulation steps (which need not be known in advance).

The properties of an accumulation scheme are summarized in the following informal definition, which additionally includes an accumulation proof used to check an accumulation step (but is not passed on).

Definition 1 (informal)

An accumulation scheme for a predicate $\varPhi :X\rightarrow \{0,1\}$ consists of a triple of algorithms $(\mathrm {P},\mathrm {V},\mathrm {D})$, known as the prover, verifier, and decider, that satisfies the following properties.

Completeness: For every accumulator $\mathsf {acc}$ and predicate input $\mathsf {q}\in X$, if $\mathrm {D}(\mathsf {acc}) = 1$ and $\varPhi (\mathsf {q}) = 1$, then for $(\mathsf {acc}^{\star },\mathsf {pf}^{\star }) \leftarrow \mathrm {P}(\mathsf {acc},\mathsf {q})$ it holds that $\mathrm {V}(\mathsf {q},\mathsf {acc},\mathsf {acc}^{\star },\mathsf {pf}^{\star }) = 1$ and $\mathrm {D}(\mathsf {acc}^{\star }) = 1$.
Soundness: For every efficiently-generated old accumulator $\mathsf {acc}$, predicate input $\mathsf {q}\in X$, new accumulator $\mathsf {acc}^{\star }$, and accumulation proof $\mathsf {pf}^{\star }$, if $\mathrm {D}(\mathsf {acc}^{\star }) = 1$ and $\mathrm {V}(\mathsf {q},\mathsf {acc},\mathsf {acc}^{\star },\mathsf {pf}^{\star }) = 1$ then, with all but negligible probability, $\varPhi (\mathsf {q}) = 1$ and $\mathrm {D}(\mathsf {acc}) = 1$.

The above definition omits many details, such as the ability to accumulate multiple accumulators $[\mathsf {acc}_{j}]_{j=1}^{m}$ and multiple predicate inputs $[\mathsf {q}_{i}]_{i=1}^n$ in one step, the optional property of zero knowledge (enabled by the accumulation proof $\mathsf {pf}^{\star }$), the fact that $\mathrm {P},\mathrm {V},\mathrm {D}$ should receive keys $\mathsf {apk},\mathsf {avk},\mathsf {dk}$ generated by an indexer algorithm that receives the specification of $\varPhi $, and others. We refer the reader to [BCMS20] for more details.

The aspect that we wish to highlight here is the following: in order for the verifier to be much cheaper than the predicate ($|\mathrm {V}| \ll |\varPhi |$) it must be that the accumulator itself is much smaller than the predicate ($|\mathsf {acc}| \ll |\varPhi |$) because the verifier receives the accumulator as input. (And if the accumulator is accompanied by a validity proof $\mathsf {pf}$ then this proof must also be small.)

We refer to this setting as atomic accumulation because the entirety of the accumulator is treated as one short monolithic string. In contrast, in this paper we consider a relaxation where this is not the case, and will enable us to obtain new instantiations that lead to new theoretical and practical results.

Split Accumulation for Relations. We propose a relaxed notion of accumulation: a split accumulation scheme for a relation predicate $\varPhi :X\times W\rightarrow \{0,1\}$ is again a tuple of algorithms $(\mathrm {P},\mathrm {V},\mathrm {D})$ as before. Split accumulation differs from atomic accumulation in that: (a) an input to $\varPhi $ consists of a short instance part $\mathsf {qx}$ and a (possibly) long witness part $\mathsf {qw}$; (b) an accumulator $\mathsf {acc}$ is split into a short instance part $\mathsf {acc}.\mathbbm {x}$ and a (possibly) long witness part $\mathsf {acc}.\mathbbm {w}$; (c) the verifier only needs the short parts of inputs and accumulators to verify an accumulation step, along with a short validity proof instead of the long witness parts.

As before, the prover is used to accumulate a predicate input $\mathsf {q}_{i} = (\mathsf {qx}_{i},\mathsf {qw}_{i})$ into a prior accumulator $\mathsf {acc}_{i}$ to obtain a new accumulator and validity proof $(\mathsf {acc}_{i+1},\mathsf {pf}_{i+1}) \leftarrow \mathrm {P}(\mathsf {q}_{i},\mathsf {acc}_{i})$. Different from before, however, we wish to establish that given instances $\mathsf {qx}_{1},\dots ,\mathsf {qx}_{T}$ there exist (more precisely, a party knows) witnesses $\mathsf {qw}_{1},\dots ,\mathsf {qw}_{T}$ such that $\varPhi (\mathsf {qx}_{1},\mathsf {qw}_{1}) \wedge \cdots \wedge \varPhi (\mathsf {qx}_{T},\mathsf {qw}_{T}) =1$. For this it suffices to check that:

the verifier accepts each accumulation step given only the short instance parts: $\mathrm {V}(\mathsf {qx}_{1},\mathsf {acc}_{1}.\mathbbm {x},\mathsf {acc}_{2}.\mathbbm {x},\mathsf {pf}_{2}) = 1$, $\mathrm {V}(\mathsf {qx}_{2},\mathsf {acc}_{2}.\mathbbm {x},\mathsf {acc}_{3}.\mathbbm {x},\mathsf {pf}_{3}) = 1$, and so on; and
the decider accepts the final accumulator (made of both the instance and witness part): $\mathrm {D}(\mathsf {acc}_{T})=1$.

Again the naive cost $T\cdot |\varPhi |$ is replaced with the new cost $T\cdot |\mathrm {V}|+|\mathrm {D}|$, but now it could be that an accumulator is, e.g., as large as $|\varPhi |$; we only need the instance part of the accumulator (and predicate inputs) to be short.

The security property of a split accumulation scheme involves an extractor that outputs a long witness part from a short instance part and proof, and is reminiscent of the knowledge soundness of a succinct non-interactive argument. Turning this high level description into a working definition requires some care, however, and we view this as a contribution of this paper.^{Footnote 2} Informally the security definition could be summarized as follows.

Definition 2 (informal)

A split accumulation scheme for a predicate $\varPhi :X\times W\rightarrow \{0,1\}$ consists of a triple of algorithms $(\mathrm {P},\mathrm {V},\mathrm {D})$ that satisfies the following properties.

Completeness: For every accumulator $\mathsf {acc}$ and predicate input $\mathsf {q}= (\mathsf {qx},\mathsf {qw}) \in X\times W$, if $\mathrm {D}(\mathsf {acc}) = 1$ and $\varPhi (\mathsf {q}) = 1$, then for $(\mathsf {acc}^{\star },\mathsf {pf}^{\star }) \leftarrow \mathrm {P}(\mathsf {q},\mathsf {acc})$ it holds that $\mathrm {V}(\mathsf {qx},\mathsf {acc}.\mathbbm {x},\mathsf {acc}^{\star }.\mathbbm {x},\mathsf {pf}^{\star }) = 1$ and $\mathrm {D}(\mathsf {acc}^{\star }) = 1$.
Knowledge: For every efficiently-generated old accumulator instance $\mathsf {acc}.\mathbbm {x}$, old input instance $\mathsf {qx}$, accumulation proof $\mathsf {pf}^{\star }$, and new accumulator $\mathsf {acc}^{\star }$, if $\mathrm {D}(\mathsf {acc}^{\star }) = 1$ and $\mathrm {V}(\mathsf {qx},\mathsf {acc}.\mathbbm {x},\mathsf {acc}^{\star }.\mathbbm {x},\mathsf {pf}^{\star }) = 1$ then, with all but negligible probability, an efficient extractor can find an old accumulator witness $\mathsf {acc}.\mathbbm {w}$ and predicate witness $\mathsf {qw}$ such that $\varPhi (\mathsf {qx},\mathsf {qw}) = 1$ and $\mathrm {D}((\mathsf {acc}.\mathbbm {x},\mathsf {acc}.\mathbbm {w})) = 1$.

One can verify that split accumulation is indeed a relaxation of atomic accumulation: any atomic accumulation scheme is (trivially) a split accumulation scheme with empty witnesses. Crucially, however, a split accumulation scheme alleviates a major restriction of atomic accumulation, namely, that accumulators and predicate inputs have to be short.

Next, in Sect. 2.2 we show that split accumulation suffices for recursive composition (which has surprising theoretical consequences) and then in Sect. 2.3 we present a NARK with split accumulation scheme based on discrete logarithms.

2.2 PCD from Split Accumulation

We summarize the main ideas behind Theorem 1, which obtains proof-carrying data (PCD) from any NARK that has a split accumulation scheme. To ease exposition, in this summary we focus on IVC, which can be viewed as the special case where a circuit $F$ is repeatedly applied. That is, we wish to incrementally prove a claim of the form “$F^{T}(z_0) = z_{T}$” where $F^{T}$ denotes $F$ composed with itself $T$ times.

Prior Work: Recursion via Atomic Accumulation. Our starting point is a theorem from [BCMS20] that obtains PCD from any SNARK that has an atomic accumulation scheme. The IVC construction implied by that theorem is roughly follows.

The IVC prover receives a previous instance $z_{i}$, proof $\pi _{i}$, and accumulator $\mathsf {acc}_{i}$; accumulates $(z_{i},\pi _{i})$ with $\mathsf {acc}_{i}$ to obtain a new accumulator $\mathsf {acc}_{i+1}$ and accumulation proof $\mathsf {pf}_{i+1}$; and generates a SNARK proof $\pi _{i+1}$ of the following claim expressed as a circuit R (see Fig. 2, middle box): “$z_{i+1} = F(z_{i})$, and there exist a SNARK proof $\pi _{i}$, accumulator $\mathsf {acc}_{i}$, and accumulation proof $\mathsf {pf}_{i+1}$ such that the accumulation verifier accepts $((z_{i},\pi _{i}),\mathsf {acc}_{i},\mathsf {acc}_{i+1},\mathsf {pf}_{i+1})$”. The IVC proof for $z_{i+1}$ is $(\pi _{i+1},\mathsf {acc}_{i+1})$.
The IVC verifier validates an IVC proof $(\pi _{i},\mathsf {acc}_{i})$ for $z_{i}$ by running the SNARK verifier on the instance $(z_{i},\mathsf {acc}_{i})$ and proof $\pi _{i}$, and running the accumulation scheme decider on the accumulator $\mathsf {acc}_{i}$.

In each iteration we maintain the invariant that if $\mathsf {acc}_{i}$ is a valid accumulator (according to the decider) and $\pi _{i}$ is a valid SNARK proof, then the computation is correct up to the $i$-th step.

Note that while it would suffice to prove that “$z_{i+1} = F(z_{i})$, $\pi _{i}$ is a valid SNARK proof, and $\mathsf {acc}_{i}$ is a valid accumulator”, we cannot afford to do so. Indeed: (i) proving that $\pi _{i}$ is a valid proof requires proving a statement about the argument verifier, which may not be sublinear; and (ii) proving that $\mathsf {acc}_{i}$ is a valid accumulator requires proving a statement about the decider, which may not be sublinear. Instead of proving this claim directly, we “defer” it by having the prover accumulate $(z_{i},\pi _{i})$ into $\mathsf {acc}_{i}$ to obtain a new accumulator $\mathsf {acc}_{i+1}$. The soundness property of the accumulation scheme ensures that if $\mathsf {acc}_{i+1}$ is valid and the accumulation verifier accepts $((z_{i},\pi _{i}),\mathsf {acc}_{i},\mathsf {acc}_{i+1},\mathsf {pf}_{i+1})$, then $\pi _{i}$ is a valid SNARK proof and $\mathsf {acc}_{i}$ is a valid accumulator. Thus all that remains to maintain the invariant is for the prover to prove that the accumulation verifier accepts; this is possible provided that the accumulation verifier is sublinear.

Our Construction: Recursion via Split Accumulation. Our construction naturally extends the above idea to the setting of NARKs with split accumulation schemes. Indeed, the only difference to the above construction is that the proof $\pi _{i+1}$ generated by the IVC prover is for the statement “$z_{i+1} = F(z_{i})$, and there exist a NARK proof instance $\pi _{i}.\mathbbm {x}$, an accumulator instance $\mathsf {acc}_{i}.\mathbbm {x}$, and an accumulation proof $\mathsf {pf}_{i+1}$ such that the accumulation verifier accepts $((z_{i},\pi _{i}.\mathbbm {x}),\mathsf {acc}_{i}.\mathbbm {x},\mathsf {acc}_{i+1}.\mathbbm {x},\mathsf {pf}_{i+1})$”, and accordingly the IVC verifier runs the NARK verifier on $((z_{i},\mathsf {acc}_{i}.\mathbbm {x}),\pi _{i})$ (in addition to running the accumulation scheme decider on the accumulator $\mathsf {acc}_{i}$). This is illustrated in Fig. 2 (lower box). Note that the circuit R itself is unchanged from the atomic case; the difference is in whether we pass the entire proof and accumulators or just the $\mathbbm {x}$ part.

Proving that this relaxation yields a secure construction is more complex. Similar to prior work, the proof of security proceeds via a recursive extraction argument, as we explain next.

For an atomic accumulation scheme ([BCMS20]), one maintains the following extraction invariant: the $i$-th extractor outputs $(z_{i},\pi _{i},\mathsf {acc}_{i})$ such that $\pi _{i}$ is valid according to the SNARK, $\mathsf {acc}_{i}$ is valid according to the decider, and $F^{T-i}(z_{i}) = z_{T}$. The $T$-th “extractor” is simply the malicious prover, and we can obtain the $i$-th extractor by applying the knowledge guarantee of the SNARK to the $(i+1)$-th extractor. That the invariant is maintained is implied by the soundness guarantee of the atomic accumulation scheme.

For a split accumulation scheme, we want to maintain the same extraction invariant; however, the extractor for the NARK will only yield $(z_{i},\pi _{i}.\mathbbm {x},\mathsf {acc}_{i}.\mathbbm {x})$, and not the corresponding witnesses. This is where we make use of the extraction property of the split accumulation scheme itself. Specifically, we interleave the knowledge guarantees of the NARK and accumulation scheme as follows: the $i$-th NARK extractor is obtained from the $(i+1)$-th accumulation extractor using the knowledge guarantee of the NARK, and the $i$-th accumulation extractor is obtained from the $i$-th NARK extractor using the knowledge guarantee of the accumulation scheme. We take the malicious prover to be the $T$-th accumulation extractor.

From Sketch to Proof. In the full version we give the formal details of our construction and a proof of correctness. In particular, we show how to construct PCD, a more general primitive than IVC. In the PCD setting, rather than each computation step having a single input $z_{i}$, it receives $m$ inputs from different nodes. Proving correctness hence requires proving that all of these inputs were computed correctly. For our construction, this entails checking $m$ proofs and $m$ accumulators. To do this, we extend the definition of an accumulation scheme to allow accumulating multiple instance-proof pairs and multiple “old” accumulators.

We also note that the application to PCD leads to other definitional considerations, which are similar to those that have appeared in previous works [COS20, BCMS20]. In particular, the knowledge soundness guarantee for both the NARK and the accumulation scheme should be of the stronger “multi-instance witness-extended emulation with auxiliary input and output” type used in previous work. Additionally, the underlying construction of split accumulation achieves only expected polynomial-time extraction (in the ROM), and so the recursive extraction technique requires that we are able to extract from expected-time adversaries.

Remark 2 (knowledge soundness for PCD vs. IVC)

The proof of security for PCD extracts a transcript one full layer at a time. Since a layer consists of many nodes, each with an independently-generated proof and accumulator, a standard “single-instance” extraction guarantee is insufficient in general. However, in the special case of IVC, every layer consists of exactly one node, and so single-instance extraction does suffice.

Remark 3 (flavors of PCD)

The recent advances in PCD from accumulation achieve weaker efficiency guarantees than PCD from succinct verification, and formally these results are incomparable. (Starting from weaker assumptions they obtain weaker conclusions.) The essential feature that all these works achieve is that the efficiency of PCD algorithms is independent of the number of nodes in the PCD computation, which is how PCD is defined. That said, prior work on PCD from succinct verification [BCCT13, BCTV14, COS20] additionally guarantees that verifying a PCD proof is sublinear in a node’s computation; and prior work on PCD from atomic accumulation [BCMS20] merely ensures that a PCD proof has size (but not necessarily verification time) that is sublinear in a node’s computation. The PCD scheme obtained in this paper does not have these additional features: a PCD proof has size that is linear in a node’s computation.

2.3 NARK with Split Accumulation Based on DL

We summarize the main ideas behind Theorem 2, which provides, in the discrete logarithm setting with random oracles, a (zero knowledge) NARK for R1CS that has a (zero knowledge) split accumulation scheme whose accumulation verifier has constant size (more precisely, performs a constant number of group scalar multiplications, field operations, and random oracle calls).

Recall that R1CS is a standard generalization of arithmetic circuit satisfiability where the “circuit description” is given by coefficient matrices, as specified below. (“$\circ $” denotes the entry-wise product.)

Definition 3 (R1CS problem)

Given a finite field $\mathbb {F}$, coefficient matrices $A,B,C\in \mathbb {F}^{\mathsf {M}\times \mathsf {N}}$, and an instance vector $x\in \mathbb {F}^{\mathsf {n}}$, is there a witness vector $w\in \mathbb {F}^{\mathsf {N}-\mathsf {n}}$ such that $Az\circ Bz= Cz$ for ?

We explain our construction incrementally. In Sect. 2.3.1 we begin by describing a NARK for R1CS that is not zero knowledge, and a “basic” split accumulation scheme for it that is also not zero knowledge. In Sect. 2.3.2 we show how to extend the NARK and its split accumulation scheme to both be zero knowledge. In Sect. 2.3.3 we explain why the accumulation scheme described so far is limited to the special case of 1 old accumulator and 1 predicate input (which suffices for IVC), and sketch how to obtain accumulation for m old accumulators and n predicate inputs (which is required for PCD); this motivates the problem of accumulating Hadamard products, which we subsequently address in Sect. 2.5.

We highlight here that both the NARK and the accumulation scheme are particularly simple compared to other protocols in the SNARK literature (especially with regard to constructions that enable recursion!), and view this as a significant advantage for potential deployments of these ideas in the real world.

2.3.1 Without Zero Knowledge

Let $\mathsf {ck}= (G_1,\dots ,G_{\mathsf {M}}) \in \mathbb {G}^{\mathsf {M}}$ be a commitment key for the Pedersen commitment scheme with message space $\mathbb {F}^{\mathsf {M}}$, and let denote its commitment function. Consider the following non-interactive argument for R1CS:

The NARK’s security follows from the binding property of Pedersen commitments. (At this point we are not using any homomorphic properties, but we will in the accumulation scheme.) Moreover, denoting by $\mathsf {K}= \varOmega (\mathsf {M})$ the number of non-zero entries in the coefficient matrices, the NARK’s efficiency is as follows:

The NARK may superficially appear useless because it has linear argument size and is not zero knowledge. Nevertheless, we can obtain an efficient split accumulation scheme for it, as we describe next.^{Footnote 3}

The predicate to be accumulated is the NARK verifier with a suitable split between predicate instance and predicate witness: $\varPhi $ takes as input a predicate instance $\mathsf {qx}= (x,C_{\scriptscriptstyle A},C_{\scriptscriptstyle B},C_{\scriptscriptstyle C})$ and a predicate witness $\mathsf {qw}= w$, and then runs the NARK verifier with R1CS instance $x$ and proof $\pi = (C_{\scriptscriptstyle A},C_{\scriptscriptstyle B},C_{\scriptscriptstyle C},w)$.^{Footnote 4}

An accumulator $\mathsf {acc}$ is split into an accumulator instance $\mathsf {acc}.\mathbbm {x} = (x,C_{\scriptscriptstyle A},C_{\scriptscriptstyle B},C_{\scriptscriptstyle C},C_{\scriptscriptstyle \circ }) \in \mathbb {F}^{\mathsf {n}} \times \mathbb {G}^{4}$ and an accumulator witness $\mathsf {acc}.\mathbbm {w} = w\in \mathbb {F}^{\mathsf {N}- \mathsf {n}}$. The accumulation decider $\mathrm {D}$ validates a split accumulator $\mathsf {acc}= (\mathsf {acc}.\mathbbm {x},\mathsf {acc}.\mathbbm {w})$ as follows: set ; compute the vectors , and ; and check that the following conditions hold:

$$\begin{aligned} C_{\scriptscriptstyle A}{\mathop {=}\limits ^{?}}\mathsf {Commit}(\mathsf {ck},z_{\scriptscriptstyle A})\;,\; C_{\scriptscriptstyle B}{\mathop {=}\limits ^{?}}\mathsf {Commit}(\mathsf {ck},z_{\scriptscriptstyle B})\;,\; C_{\scriptscriptstyle C}{\mathop {=}\limits ^{?}}\mathsf {Commit}(\mathsf {ck},z_{\scriptscriptstyle C})\;,\; C_{\scriptscriptstyle \circ }{\mathop {=}\limits ^{?}}\mathsf {Commit}(\mathsf {ck},z_{\scriptscriptstyle A}\circ z_{\scriptscriptstyle B})\;.\; \end{aligned}$$

Note that the accumulation decider $\mathrm {D}$ is similar, but not equal, to the NARK verifier.

We are left to describe the accumulation prover and accumulation verifier. Both have access to a random oracle $\rho $. For adaptive security, queries to the random oracle should include a hash $\tau $ of the coefficient matrices $A,B,C$ and instance size $\mathsf {n}$, which can be precomputed in an offline phase. (Formally, this is done via the indexer algorithm of the accumulation scheme, which receives the coefficient matrices and instance size, performs all one-time computations such as deriving $\tau $, and produces an accumulator proving key $\mathsf {apk}$, an accumulator verification key $\mathsf {avk}$, and a decision key $\mathsf {dk}$ for $\mathrm {P}$, $\mathrm {V}$, and $\mathrm {D}$ respectively.)

The intuition for accumulation is to set the new accumulator to be a random linear combination of the old accumulator and predicate input, and use the accumulation proof to collect cross terms that arise from the Hadamard product (a bilinear, not linear, operation). This naturally leads to the following simple construction.

The efficiency of the split accumulation scheme can be summarized by the following table:

^{Footnote 5}

The key efficiency feature is that the accumulation verifier only performs 1 call to the random oracle, a constant number of group scalar multiplications, and field operations. (More precisely, the verifier makes $\mathsf {n}$ field operations, but this does not grow with circuit size and, more fundamentally, is inevitable because the accumulation verifier must receive the R1CS instance $x\in \mathbb {F}^{\mathsf {n}}$ as input.)

2.3.2 With Zero Knowledge

We explain how to add zero knowledge to the approach described in the previous section.

First, we extend the NARK to additionally achieve zero knowledge. For this we construct a sigma protocol for R1CS based on Pedersen commitments, which is summarized in Fig. 3; then we apply the Fiat–Shamir transformation to it to obtain a corresponding zkNARK for R1CS. Here the commitment key for the Pedersen commitment is , as we need a spare group element for the commitment randomness. The blue text in the figure represents the “diff” compared to the non-zero-knowledge version, and indeed if all such text were removed the protocol would collapse to the previous one.

Second, we extend the split accumulation scheme to accumulate the modified protocol for R1CS. Again the predicate being accumulated is the NARK verifier but now since the NARK verifier has changed so does the predicate. A zkNARK proof $\pi $ now can be viewed as a pair $(\pi _{1},\pi _{2})$ denoting the prover’s commitment and response in the sigma protocol. Then the predicate $\varPhi $ takes as input a predicate instance $\mathsf {qx}= (x,\pi _{1}) \in \mathbb {F}^{\mathsf {n}} \times \mathbb {G}^{8}$ and a predicate witness $\mathsf {qw}= \pi _{2} \in \mathbb {F}^{\mathsf {N}-\mathsf {n}+4}$, and then runs the NARK verifier with R1CS instance $x$ and proof $\pi = (\pi _{1},\pi _{2})$.

An accumulator $\mathsf {acc}$ is split into an accumulator instance $\mathsf {acc}.\mathbbm {x} = (x,C_{\scriptscriptstyle A},C_{\scriptscriptstyle B},C_{\scriptscriptstyle C},C_{\scriptscriptstyle \circ }) \in \mathbb {F}^{\mathsf {n}} \times \mathbb {G}^{4}$ (the same as before) and an accumulator witness $\mathsf {acc}.\mathbbm {w} = (w,\sigma _{\scriptscriptstyle A},\sigma _{\scriptscriptstyle B},\sigma _{\scriptscriptstyle C},\sigma _{\circ }) \in \mathbb {F}^{\mathsf {N}- \mathsf {n}+4}$. The decider is essentially the same as in Sect. 2.3.1, except that now the four commitments are computed using the corresponding randomness in $\mathsf {acc}.\mathbbm {w}$.

The accumulation prover and accumulation verifier can be extended, in a straightforward way, to support the new zkSNARK protocol; we provide these in Fig. 4, with text in blue to denote the “diff” to accumulate the zero knowledge features of the NARK and with text in red to denote the features to make accumulation itself zero knowledge. There we use $\rho _{\scriptscriptstyle \mathsf {NARK}}$ to denote the oracle used for the zkNARK for R1CS, which is obtained via the Fiat–Shamir transformation applied to a sigma protocol (as mentioned above); for adaptive security, the Fiat–Shamir query includes, in addition to $\pi _{1}$, a hash of the coefficient matrices and the R1CS input $x\in \mathbb {F}^{\mathsf {n}}$ (this means that the Fiat–Shamir query equals $(\tau ,\mathsf {qx})=(\tau ,x,\pi _{1})$).

Note that now the accumulation prover and accumulation verifier are each making 2 calls to the random oracle, rather than 1 as before, because they have to additionally compute the sigma protocol’s challenge.

2.3.3 Towards General Accumulation

The accumulation schemes described in Sects. 2.3.1 and 2.3.2 are limited to a special case, which we could call the “IVC setting”, where accumulation involves 1 old accumulator and 1 predicate input. However, the definition of accumulation requires supporting m old accumulators $[\mathsf {acc}_j]_{j=1}^m = [(\mathsf {acc}_{j}.\mathbbm {x}, \mathsf {acc}_{j}.\mathbbm {w})]_{j=1}^m$ and n predicate inputs ${[(\mathsf {qx}_i, \mathsf {qw}_i)]}_{i=1}^n$, for any m and n. (E.g., to construct PCD we set both m and n equal to the “arity” of the compliance predicate.) How can we extend the ideas described so far to this more general case?

The zkNARK verifier performs two types of computations: linear checks and a Hadamard product check. We describe how to accumulate each of these in the general case.

Linear checks. A split accumulator $\mathsf {acc}=(\mathsf {acc}.\mathbbm {x},\mathsf {acc}.\mathbbm {w})$ in Sect. 2.3.2 included sub-accumulators for different linear checks: $x,C_{\scriptscriptstyle A},C_{\scriptscriptstyle B},C_{\scriptscriptstyle C}$ in $\mathsf {acc}.\mathbbm {x}$ and $w,\sigma _{\scriptscriptstyle A},\sigma _{\scriptscriptstyle B},\sigma _{\scriptscriptstyle C}$ in $\mathsf {acc}.\mathbbm {w}$. We can keep these components and simply use more random coefficients or, as we do, further powers of the element $\beta $. For example, in the accumulation prover $\mathrm {P}$ a computation such as is replaced by a computation such as .
Hadamard product check. A split accumulator $\mathsf {acc}=(\mathsf {acc}.\mathbbm {x},\mathsf {acc}.\mathbbm {w})$ in Sect. 2.3.2 also included a sub-accumulator for the Hadamard product check: $C_{\scriptscriptstyle \circ }$ in $\mathsf {acc}.\mathbbm {x}$ and $\sigma _{\circ }$ in $\mathsf {acc}.\mathbbm {w}$. Because a Hadamard product is a bilinear operation, combining two Hadamard products via a random coefficient led to a quadratic polynomial whose coefficients include the two original Hadamard products and a cross term. This is indeed why we stored the cross term in the accumulation proof $\mathsf {pf}$. However, if we consider the cross terms that arise from combining more than two Hadamard products (i.e., when $m+n>2$) then the corresponding polynomials do not lend themselves to accumulation because the original Hadamard products appear together with other cross terms. To handle this issue, we introduce in Sect. 2.5 a new subroutine that accumulates Hadamard products via an additional round of interaction.

2.4 On Proving Knowledge Soundness

In order to construct accumulation schemes that fulfill the type of knowledge soundness that we ultimately need for PCD (see Sect. 2.2), we formulate a new expected-time forking lemma in the random oracle model, which is informally stated below. In our setting, $(\mathfrak {q},\mathfrak {b},\mathsf {o}) \in L$ if $\mathsf {o}= ([\mathsf {qx}_{i}]_{i=1}^n,\mathsf {acc},\mathsf {pf})$ is such that $\mathrm {D}(\mathsf {acc}) = 1$ and, given that $\rho (\mathfrak {q}) = \mathfrak {b}$, the accumulation verifier accepts: $\mathrm {V}^{\rho }([\mathsf {qx}_{i}]_{i=1}^n,\mathsf {acc}.\mathbbm {x},\mathsf {pf}) = 1$.

Lemma 1

(informal). Let L be an efficiently recognizable set. There exists an algorithm $\mathsf {Fork}$ such that for every expected polynomial time algorithm $A$ and integer $N\in \mathbb {N}$ the following holds. With all but negligible probability over the choice of random oracle $\rho $, randomness r of $A$, and randomness of $\mathsf {Fork}$, if $A^{\rho }(r)$ outputs a tuple $(\mathfrak {q},\mathfrak {b},\mathsf {o}) \in L$ with $\rho (\mathfrak {q})=\mathfrak {b}$, then $\mathsf {Fork}^{A,\rho }(1^{N},\mathfrak {q},\mathfrak {b},\mathsf {o},r)$ outputs $[(\mathfrak {b}_{j},\mathsf {o}_{j})]_{j=1}^{N}$ such that $\mathfrak {b}_{1},\dots ,\mathfrak {b}_{N}$ are pairwise distinct and for each $j\in [N]$ it holds that $(\mathfrak {q},\mathfrak {b}_{j},\mathsf {o}_{j}) \in L$.

This forking lemma differs from prior forking lemmas in three significant ways. First, it is in the random oracle model rather than the interactive setting (unlike [BCC+16]). Second, we can obtain any polynomial number of accepting transcripts in expected polynomial time with only negligible loss in success probability (unlike forking lemmas for signature schemes, which typically extract two transcripts in strict polynomial time [BN06]). Finally, it holds even if the adversary itself runs in expected (as opposed to strict) polynomial time. This is important for our application to PCD where the extractor in one recursive step becomes the adversary in the next. This last feature requires some care, since the running time of the adversary, and in particular the length of its random tape, may not be bounded.

Moreover, in our security proofs we at times additionally rely on an expected-time variant of the zero-finding game lemma from [BCMS20] to show that if a particular polynomial equation holds at a point obtained from the random oracle via a “commitment” to the equation, then it must with overwhelming probability be a polynomial identity. For more details, see the full version.

2.5 Split Accumulation for Hadamard Products

We construct a split accumulation scheme for a predicate $\varPhi _{\scriptscriptstyle \mathsf {HP}}$ that considers the Hadamard product of committed vectors. For a commitment key $\mathsf {ck}$ for messages in $\mathbb {F}^{\ell }$, the predicate $\varPhi _{\scriptscriptstyle \mathsf {HP}}$ takes as input a predicate instance $\mathsf {qx}=(C_{1},C_{2},C_{3}) \in \mathbb {G}^{3}$ consisting of three Pedersen commitments, a predicate witness $\mathsf {qw}=(a,b,\omega _{1},\omega _{2},\omega _{3})$ consisting of two vectors $a,b\in \mathbb {F}^{\ell }$ and three opening randomness elements $\omega _{1},\omega _{2},\omega _{3}\in \mathbb {F}$, and checks that $C_{1}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck},a;\omega _{1})$, $C_{2}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck},b;\omega _{2})$, and $C_{3}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck},a\circ b;\omega _{3})$. In other words, $C_{3}$ is a commitment to the Hadamard product of the vectors committed in $C_{1}$ and $C_{2}$.

Theorem 3

(informal). The Hadamard product predicate $\varPhi _{\scriptscriptstyle \mathsf {HP}}$ has a split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {HP}}$ that is secure in the random oracle model (and assuming the hardness of the discrete logarithm problem) where verifying accumulation requires 5 group scalar multiplications and O(1) field operations per claim, and results in an accumulator whose instance part is 3 group elements and witness part is $O(\ell )$ field elements. Moreover, the accumulation scheme can be made zero knowledge at a sub-constant overhead per claim.

Below we summarize the ideas behind this result. Our construction directly extends to accumulate any bilinear function (see Remark 4).

A Bivariate Identity. The accumulation scheme is based on a bivariate polynomial identity, and is the result of turning a public-coin two-round reduction into a non-interactive scheme by using the random oracle. Given n pairs of vectors $[(a_{i},b_{i})]_{i=1}^{n}$, consider the following two polynomials with coefficients in $\mathbb {F}^{\ell }$:

The Hadamard product of the two polynomials can be written as

$$\begin{aligned} a(X,Y) \circ b(X) = {\textstyle \sum _{i=1}^{2n-1}} X^{i-1} t_{i}(Y) \quad \text {where}\quad t_{n}(Y) = {\textstyle \sum _{i=1}^{n}} Y^{i-1} a_{i} \circ b_{i} . \end{aligned}$$

The expression of the coefficient polynomials $\{t_{i}(Y)\}_{i \ne n}$ is not important; instead, the important aspect here is that a coefficient polynomial, namely $t_{n}(Y)$, includes the Hadamard products of all n pairs of vectors as different coefficients. This identity is the starting point of the accumulation scheme, which informally evaluates this expression at random points to reduce the n Hadamard products to 1 Hadamard product. Similar ideas are used to reduce several Hadamard products to a single inner product in [BCC+16, BBB+18].

Batching Hadamard Products. We describe a public-coin two-round reduction from n Hadamard product claims to 1 Hadamard product claim. The verifier receives n predicate instances $[\mathsf {qx}_{i}]_{i=1}^n=[(C_{1,i},C_{2,i},C_{3,i})]_{i=1}^{n}$ each consisting of three Pedersen commitments, and the prover receives corresponding predicate witnesses $[\mathsf {qw}_{i}]_{i=1}^n=[(a_{i},b_{i},\omega _{1,i},\omega _{2,i},\omega _{3,i})]_{i=1}^{n}$ containing the corresponding openings.

The verifier sends a first challenge $\mu \in \mathbb {F}$.
The prover computes the product polynomial $a(X,\mu ) \circ b(X) = \sum _{i=1}^{2n-1} X^{i-1} t_{i}(\mu ) \in \mathbb {F}^{\ell }[X]$; for each $i \in [2n-1]\setminus \{n\}$, computes the commitment ; and sends to the verifier an accumulation proof .
The verifier sends a second challenge $\nu \in \mathbb {F}$.
The verifier computes and outputs a new predicate instance $\mathsf {qx}=(C_{1},C_{2},C_{3})$:
$$\begin{aligned} C_{1}&= {\textstyle \sum _{i=1}^{n}} \nu ^{i-1} \mu ^{i-1} C_{1,i} ,\\ C_{2}&= {\textstyle \sum _{i=1}^{n}} \nu ^{n-i} C_{2,i},\\ C_{3}&= {\textstyle \sum _{i=1}^{n-1}} \nu ^{i-1} C_{t,i} + \nu ^{n-1} {\textstyle \sum _{i=1}^{n}} \mu ^{i-1} C_{3,i} + {\textstyle \sum _{i=1}^{n-1}} \nu ^{n+i-1} C_{t,n+i} . \end{aligned}$$
The prover computes and outputs a corresponding predicate witness $\mathsf {qw}=(a,b,\omega _{1},\omega _{2},\omega _{3})$:

Observe that the new predicate instance $\mathsf {qx}=(C_{1},C_{2},C_{3})$ consists of commitments to $a(\nu ,\mu ), b(\nu ),a(\nu ,\mu ) \circ b(\nu )$ respectively, and the predicate witness $\mathsf {qw}=(a,b,\omega _{1},\omega _{2},\omega _{3})$ consists of corresponding opening information. The properties of low-degree polynomials imply that if any of the n claims is incorrect (there is $i \in [n]$ such that $\varPhi _{\scriptscriptstyle \mathsf {HP}}(\mathsf {qx}_i,\mathsf {qw}_i)=0$) then, with high probability, so is the output claim ($\varPhi _{\scriptscriptstyle \mathsf {HP}}(\mathsf {qx},\mathsf {qw})=0$).

Split Accumulation. The batching protocol described above yields a split accumulation scheme for $\varPhi _{\scriptscriptstyle \mathsf {HP}}$ in the random oracle model. An accumulator $\mathsf {acc}$ has the same form as a predicate input $(\mathsf {qx},\mathsf {qw})$: $\mathsf {acc}.\mathbbm {x}$ has the same form as a predicate instance $\mathsf {qx}$, and $\mathsf {acc}.\mathbbm {w}$ has the same form as a predicate witness $\mathsf {qw}$. The accumulation decider $\mathrm {D}$ simply equals $\varPhi _{\scriptscriptstyle \mathsf {HP}}$ (this is well-defined due to the prior sentence). The accumulation prover and accumulation verifier are as follows.

The accumulation prover $\mathrm {P}$ runs the interactive reduction by relying on the random oracle to generate the random verifier messages (i.e., it applies the Fiat–Shamir transformation to the reduction), in order to produce an accumulation proof $\mathsf {pf}$ as well as an accumulator $\mathsf {acc}= (\mathsf {qx},\mathsf {qw})$ whose instance part is computed like the verifier of the reduction and witness part is computed like the prover of the reduction.
The accumulation verifier $\mathrm {V}$ re-derives the challenges using the random oracle, and checks that $\mathsf {qx}$ was correctly derived from $[\mathsf {qx}_{i}]_{i=1}^n$ (also via the help of the accumulation proof $\mathsf {pf}$).

The construction described above is not zero knowledge. One way to achieve zero knowledge is for the accumulation prover to sample a random predicate input that satisfies the predicate, accumulate it, and include it as part of the accumulation proof $\mathsf {pf}$. In our construction we opt for a more efficient solution, leveraging the fact that we are not actually interested in accumulating the random predicate input.

Efficiency. The efficiency claimed in Theorem 3 is evident from the construction. The (short) instance part of an accumulator consists of 3 group elements, while the (long) witness part of an accumulator consists of $O(\ell )$ field elements. The accumulator verifier $\mathrm {V}$ performs 2 random oracle calls, 5 group scalar multiplication, and O(1) field operations per accumulated claim.

Security. Given an adversary that produces Hadamard product claims $[\mathsf {qx}_{i}]_{i=1}^n= [(C_{1,i},C_{2,i},C_{3,i}) ]_{i=1}^{n}$, a single Hadamard product claim $\mathsf {qx}= (C_{1},C_{2},C_{3})$ and corresponding witness $\mathsf {qw}=(a,b,\omega _{1},\omega _{2},\omega _{3})$, and an accumulation proof $\mathsf {pf}$ that makes the accumulation verifier accept, we need to extract witnesses $[\mathsf {qw}_{i}]_{i=1}^n=[(a_{i},b_{i},\omega _{1,i},\omega _{2,i},\omega _{3,i})]_{i=1}^{n}$ for the instances $[\mathsf {qx}_{i}]_{i=1}^n$. Our security proof works in the random oracle model, assuming hardness of the discrete logarithm problem.

In the proof we apply our expected-time forking lemma twice (see Sect. 2.4 for a discussion of this lemma and the full version for details including a corollary that summarizes its double invocation). This lets us construct a two-level tree of transcripts with branching factor n on the first challenge $\mu $ and branching factor $2n-1$ on the second challenge $\nu $. Given such a transcript tree, the extractor works as follows:

1.
Using the transcripts corresponding to challenges $\{(\mu _1,\nu _{1,k})\}_{k \in [n]}$ we extract $\ell $-element vectors $[a_{i}]_{i=1}^n,[b_{i}]_{i=1}^n$ and field elements $[\omega _{1,i}]_{i=1}^{n}, [\omega _{2,i}]_{i=1}^{n}$ such that $[a_{i}]_{i=1}^n$ and $[b_{i}]_{i=1}^n$ are committed in $[C_{1,i}]_{i=1}^n$ and $[C_{2,i}]_{i=1}^n$ under randomness $[\omega _{1,i}]_{i=1}^n$ and $[\omega _{2,i}]_{i=1}^n$, respectively.
2.
Define and , using the vectors extracted above; then let $t_{i}(Y)$ be the coefficient of $X^{i-1}$ in $a(X,Y) \circ b(X)$. For each $j \in [n]$, using the transcripts corresponding to challenges $\{(\mu _j,\nu _{j,k})\}_{k \in [2n-1]}$, we extract field elements $[\tau _{i}^{(j)}]_{i=1}^{2n-1}$ such that $t_{n}(\mu _{j})$ is committed in $\sum _{i=1}^{n-1} \mu _j^{i-1} C_{3,i}$ under randomness $\tau _{n}^{(j)}$ and $[t_{i}(\mu _{j}),t_{n+i}(\mu _{j})]_{i=1}^{n-1}$ are committed in under randomness $[\tau _{i}^{(j)},\tau _{n+i}^{(j)}]_{i=1}^{n-1}$ respectively.
3.
Compute the solution $[\omega _{3,i}]_{i=1}^{n}$ to the linear system $\{\tau _{n}^{(j)} = \sum _{i=1}^{n-1} \mu _j^{i-1} \omega _{3,i}\}_{j \in [n]}$. Together with the relation $\{t_{n}(\mu _j) = \sum _{i=1}^{n-1} \mu _j^{i-1} a_i \circ b_i\}_{j \in [n]}$, we deduce that $C_{3,i}$ is a commitment to $a_{i} \circ b_{i}$ under randomness $\omega _{3,i}$ for all $i \in [n]$.
4.
For each $i \in [n]$, output .

Remark 4 (extension to any bilinear operation)

The ideas described above extend, in a straightforward way, to accumulating any bilinear operation of committed vectors. Let $f :\mathbb {F}^{\ell } \times \mathbb {F}^{\ell } \rightarrow \mathbb {F}^{m}$ be a bilinear operation, i.e., such that: (a) $f(a+a',b)=f(a,b)+f(a',b)$; (b) $f(a,b+b')=f(a,b)+f(a,b')$; (c) $\alpha \cdot f(a,b) = f(\alpha a,b) = f(a,\alpha b)$. Let $\varPhi _{f}$ be the predicate that takes as input a predicate instance $\mathsf {qx}=(C_{1},C_{2},C_{3}) \in \mathbb {G}^{3}$ consisting of three Pedersen commitments, a predicate witness $\mathsf {qw}=(a,b,\omega _{1},\omega _{2},\omega _{3})$ consisting of two vectors $a,b\in \mathbb {F}^{\ell }$ and three opening randomness elements $\omega _{1},\omega _{2},\omega _{3}\in \mathbb {F}$, and checks that $C_{1}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck}_{\ell },a;\omega _{1})$, $C_{2}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck}_{\ell },b;\omega _{2})$, and $C_{3}= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck}_{m},f(a,b);\omega _{3})$. The Hadamard product $\circ :\mathbb {F}^{\ell } \times \mathbb {F}^{\ell } \rightarrow \mathbb {F}^{\ell }$ is a bilinear operation, as is the scalar product $\langle \cdot ,\cdot \rangle :\mathbb {F}^{\ell } \times \mathbb {F}^{\ell } \rightarrow \mathbb {F}$. Our accumulation scheme for Hadamard products works the same way, mutatis mutandis, for a general bilinear map f.

2.6 Split Accumulation for Pedersen Polynomial Commitments

We construct an efficient split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ for a predicate $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ that checks a polynomial evaluation claim for a “trivial” polynomial commitment scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$ based on Pedersen commitments (see Fig. 5). In more detail, for a Pedersen commitment key $\mathsf {ck}$ for messages in $\mathbb {F}^{d+ 1}$, the predicate $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ takes as input a predicate instance $\mathsf {qx}=(C, z, v) \in \mathbb {G}\times \mathbb {F}\times \mathbb {F}$ and a predicate witness $\mathsf {qw}=p\in \mathbb {F}^{\le d}[X]$, and checks that $C= \mathsf {CM}.\mathsf {Commit}(\mathsf {ck},p)$, $p(z) = v$, and $\mathrm {deg}(p) \le d$. In other words, the predicate $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ checks that the polynomial $p$ of degree at most $d$ committed in $C$ evaluates to $v$ at $z$.

Theorem 4

(informal). The (Pedersen) polynomial commitment predicate $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ has a split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ that is secure in the random oracle model (and assuming the hardness of the discrete logarithm problem). Verifying accumulation requires 2 group scalar multiplications and O(1) field additions/multiplications per claim, and results in an accumulator whose instance part is 1 group element and 2 field elements and whose witness part is d field elements. (See Table 1)

One can use $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ to obtain a split accumulation scheme for a different NARK; see Remark 5 for details.

In Table 1 we compare the efficiency of our split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ for the predicate $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ with the efficiency of the atomic accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$ [BCMS20] for the equivalent predicate defined by the check algorithm of the (succinct) PC scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$ based on the inner-product argument on cyclic groups [BCC+16, BBB+18, WTS+18]. The takeaway is that the accumulation verifier for $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ is significantly cheaper than the accumulation verifier for $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$.

Technical details are in the full version; in the rest of this section we sketch the ideas behind Theorem 4.

Table 1. Efficiency comparison between the atomic accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$ for $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$ in [BCMS20] and the split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ for $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$ in this work. Above $\mathbb {G}$ denotes group scalar multiplications or group elements, and $\mathbb {F}$ denotes field operations or field elements.($\dagger $: $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$ relies on knowledge soundness of $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$, which results from applying the Fiat–Shamir transformation to a logarithmic-round protocol. The security of this protocol has only been proven via a superpolynomial-time extractor [BMM+19] or in the algebraic group model [GT20].)

Full size table

First we describe a simple public-coin interactive reduction for combining two or more evaluation claims into a single evaluation claim, and then explain how this interactive reduction gives rise to the split accumulation scheme. We prove security in the random oracle model, using an expected-time extractor.

Batching Evaluation Claims. First consider two evaluation claims $(C_{1},z,v_{1})$ and $(C_{2},z,v_{2})$ for the same evaluation point $z$ (and degree $d$). We can use a random challenge $\alpha \in \mathbb {F}$ to combine these claims into one claim $(C',z,v')$ where and . If either of the original claims does not hold then, with high probability over the choice of $\alpha $, neither does the new claim. This idea extends to any number of claims for the same evaluation point, by taking and .

Next consider two evaluation claims $(C_{1},z_{1},v_{1})$ and $(C_{2},z_{2},v_{2})$ at (possibly) different evaluation points $z_{1}$ and $z_{2}$. We explain how these can be combined into four claims all at the same point. Below we use the fact that $p(z)=v$ if and only if there exists a polynomial $w(X)$ such that $p(X) = w(X) \cdot (X-z) +v$.

Let $p_{1}(X)$ and $p_{2}(X)$ be the polynomials “inside” $C_{1}$ and $C_{2}$, respectively, that are known to the prover.

1.
The prover computes the witness polynomials and and sends the commitments and .
2.
The verifier sends a random evaluation point $z^* \in \mathbb {F}$.
3.
The prover computes and sends the evaluations .
4.
The verifier checks the relation between each witness polynomial and the original polynomial at the random evaluation point $z^*$:
$$\begin{aligned} y_{1}=y'_{1} \cdot (z^*-z_{1})+y'_{1} \quad \text {and}\quad y_{2}=y'_{2} \cdot (z^*-z_{2})+y'_{2} . \end{aligned}$$
Next, the verifier outputs four evaluation claims for $p_{1}(z^*)=y_{1},p_{2}(z^*)=y_{2},w_{1}(z^*)=y'_{1},w_{2}(z^*)=y'_{2}$:
$$\begin{aligned} (C_{1},z^*,y_{1})\;,\; (C_{2},z^*,y_{2})\;,\; (W_{1},z^*,y'_{1})\;,\; (W_{2},z^*,y'_{2})\;. \end{aligned}$$

More generally, we can reduce m evaluation claims at m points to 2m evaluation claims all at the same point.

By combining the two techniques, one obtains a public-coin interactive reduction from any number of evaluation claims (regardless of evaluation points) to a single evaluation claim.

Split Accumulation. The batching protocol described above yields a split accumulation scheme for $\varPhi _{\scriptscriptstyle \mathsf {PC}}$ in the random oracle model. An accumulator $\mathsf {acc}$ has the same form as a predicate input: the instance part is an evaluation claim and the witness part is a polynomial. Next we describe the algorithms of the accumulation scheme.

The accumulation prover $\mathrm {P}$ runs the interactive reduction by relying on the random oracle to generate the random verifier messages (i.e., it applies the Fiat–Shamir transformation to the reduction), in order to combine the instance parts of old accumulators and inputs to obtain the instance part of a new accumulator. Then $\mathrm {P}$ also combines the committed polynomials using the same linear combinations in order to derive the new committed polynomial, which is the witness part of the new accumulator. The accumulation proof $\mathsf {pf}$ consists of the messages to the verifier in the reduction, which includes the commitments to the witness polynomials $W_{i}$ and the evaluations $y_{i},y'_{i}$ at $z^*$ of $p_{i},w_{i}$ (that is, ).
The accumulation verifier $\mathrm {V}$ checks that the challenges were correctly computed from the random oracle, and performs the checks of the reduction (the claims were correctly combined and that the proper relation between each $y_{i},y'_{i},z_{i},z^*$ holds).
The accumulation decider $\mathrm {D}$ reads the accumulator in its entirety and checks that the polynomial (the witness part) satisfies the evaluation claim (the instance part). (Here the random oracle is not used.)

Efficiency. The efficiency claimed in Theorem 4 (and Table 1) is evident from the construction. The accumulation prover $\mathrm {P}$ computes $n+m$ commitments to polynomials when combining n old accumulators and m predicate inputs (all polynomials are for degree at most d). The (short) instance part of an accumulator consists of 1 group element and 2 field elements, while the (long) witness part of an accumulator consists of O(d) field elements. The accumulator decider $\mathrm {D}$ computes 1 commitment (and 1 polynomial evaluation at 1 point) in order to validate an accumulator. Finally, the cost of running the accumulator verifier $\mathrm {V}$ is dominated by $2(n+m)$ scalar multiplication of the linear commitments.

Security. Given an adversary that produces evaluation claims $[\mathsf {qx}_{i}]_{i=1}^n= [(C_{i},z_{i},v_{i})]_{i=1}^{n}$, a single claim $\mathsf {qx}= (C,z,v)$ and polynomial $\mathsf {qw}= s(X)$ with $s(z^*) = v$ to which $C$ is a commitment, and accumulation proof $\mathsf {pf}$ that makes the accumulation verifier accept, we need to extract polynomials $[\mathsf {qw}_{i}]_{i=1}^n= [p_{i}(X)]_{i=1}^{n}$ with $p_{i}(z_{i}) = v_{i}$ to which $C_{i}$ is a commitment. Our security proof (in the full version) works in the random oracle model, assuming hardness of the discrete logarithm problem.

In the proof, we apply our expected-time forking lemma (see Sect. 2.4) to obtain 2n polynomials $[s^{(j)}]_{j=1}^{2n}$ for the same evaluation point $z^*$ but distinct challenges $\alpha _j$, where n is the number of evaluation claims. The checks in the reduction procedure imply that $s^{(j)}(X) = \sum _{i=1}^{n} \alpha _j^i p_{i}(X) + \sum _{i=1}^{n} \alpha _j^{n+i} w_{i}(X)$, where $w_{i}(X)$ is the witness corresponding to $p_{i}(X)$; hence we can recover the $p_{i}(X),w_{i}(X)$ by solving a linear system (given by the Vandermonde matrix in the challenges $[\alpha _j]_{j=1}^{2n}$). We then use an expected-time variant of the zero-finding game lemma from [BCMS20] (see the full version) to show that if a particular polynomial equation on $p_{i}(X),w_{i}(X)$ holds at the point $z^*$ obtained from the random oracle, it must with overwhelming probability be an identity. Applying this to the equation induced by the reduction shows that, with high probability, each extracted polynomial $p_{i}$ satisfies the corresponding evaluation claim $(C_{i},z_{i},v_{i})$.

Remark 5

(from $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$ to an accumulatable NARK). If one replaced the (succinct) polynomial commitment scheme that underlies the preprocessing zkSNARK in [CHM+20] with the aforementioned (non-succinct) trivial Pedersen polynomial commitment scheme then (after some adjustments and using our Theorem 4) one would obtain a zkNARK for R1CS with a split accumulation scheme whose accumulation verifier is of constant size but other asymptotics would be worse compared to Theorem 2.

First, the cryptographic costs and the quasilinear costs of the NARK and accumulation scheme would also grow in the number $\mathsf {K}$ of non-zero entries in the coefficient matrices, which can be much larger than $\mathsf {M}$ and $\mathsf {N}$ (asymptotically and concretely). Second, the NARK prover would additionally use a quasilinear number of field operations due to FFTs. Finally, in addition to poorer asymptotics, this approach would lead to a concretely more expensive accumulation verifier and overall a more complex protocol.

Nevertheless, one can design a concretely efficient zkNARK for R1CS based on the Pedersen PC scheme and our accumulation scheme for it. This naturally leads to an alternative construction to the one in Sect. 2.3 (which is instead based on accumulation of Hadamard products), and would lead to a slightly more expensive prover (which now would use FFTs) and a slightly cheaper accumulation verifier (a smaller number of group scalar multiplications). We leave this as an exercise for the interested reader.

2.7 Implementation and Evaluation

We elaborate on our implementation and evaluation of accumulation schemes and their application to PCD.

The Case for a PCD Framework. Different PCD constructions offer different trade-offs. The tradeoffs are both about asymptotics (see Remark 3) and about practical concerns, as we review below.

PCD from sublinear verification [BCCT13, BCTV14, COS20] is typically instantiated via preprocessing SNARKs based on pairings.^{Footnote 6} This route offers excellent verifier time (a few milliseconds regardless of the computation at a PCD node), but requires a private-coin setup (which complicates deployment) and cycles of pairing-friendly elliptic curves (which are costly in terms of group arithmetic and size).
PCD from atomic accumulation [BCMS20] can, e.g., be instantiated via SNARKs based on cyclic groups [BGH19]. This route offers a transparent setup (easy to deploy) and logarithmic-size arguments (a few kilobytes even for large computations), using cycles of standard elliptic curves (more efficient than their pairing-friendly counterparts). On the other hand, this route yields linear verification times (expensive for large computations) and logarithmic costs for accumulation (increasing the cost of recursion).
PCD from split accumulation (this work) can, e.g., be instantiated via NARKs based on cyclic groups. This route still offers a transparent setup and allows using cycles of standard elliptic curves. Moreover, it offers constant costs for accumulation, but at the expense of argument size, which is now linear.

It would be desirable to have a single framework that supports different PCD constructions via a modular composition of simpler building blocks. Such a framework would enable a number of desirable features: (a) ease of replacing older building blocks with new ones; (b) ease of prototyping different PCD constructions for different applications (which may have different needs), thereby enabling practitioners to make informed choices about which PCD construction is best for them; (c) simpler and more efficient auditing of complex cryptographic systems with many intermixed layers. (Realizing even a single PCD construction is a substantial implementation task.); and (d) separation of “application” logic from the underlying recursion via a common PCD interface. Together, these features would enable further industrial deployment of PCD, as well as making future research and comparisons simpler.

Implementation. The above considerations motivated our implementation efforts for PCD. Our code base has two main parts, one for realizing accumulation schemes and another for realizing PCD from accumulation (the latter is integrated with PCD from succinct verification under a unified PCD interface).

Framework for accumulation. We designed a modular framework for (atomic and split) accumulation schemes, and use it to implement, under a common interface, several accumulation schemes: (a) the atomic accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {AGM}}$ in [BCMS20] for the PC scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {AGM}}$; (b) the atomic accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$ in [BCMS20] for the PC scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$; (c) the split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ in this paper for the PC scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$; (d) the split accumulation scheme $\mathsf {AS}_{\scriptscriptstyle \mathsf {HP}}$ in this paper for the Hadamard product predicate $\varPhi _{\scriptscriptstyle \mathsf {HP}}$; (e) the split accumulation scheme for our NARK for R1CS. Our framework also provides a generic method for defining R1CS constraints for the verifiers of these accumulation schemes; we leverage this to implement R1CS constraints for all of these accumulation schemes.
PCD from accumulation. We use the foregoing framework to implement a generic construction of PCD from accumulation. We support the PCD construction of [BCMS20] (which uses atomic accumulation) and the PCD construction in this paper (which uses split accumulation). Our code builds on, and extends, an existing PCD library.^{Footnote 7} Our implementation is modular: it takes as ingredients an implementation of any NARK, an implementation of any accumulation scheme for that NARK, and constraints for the accumulation verifier, and produces a concrete PCD construction. This allows us, for example, to obtain a PCD instantiation based on our NARK for R1CS and its split accumulation scheme.

Evaluation for DL Setting. When realizing PCD in practice the main goal is to “minimize the cost of recursion”, that is, to minimize the number of constraints that need to be recursively proved in each PCD step (excluding the constraints for the application) without hurting other parameters too much (prover time, argument size, and so on). We evaluate our implementation with respect to this goal, with a focus on understanding the trade-offs between atomic and split accumulation in the discrete logarithm setting.

The DL setting is of particular interest to practitioners, as it leads to systems with a transparent (public-coin) setup that can be based on efficient cycles of (standard) elliptic curves [BGH19, Hop20]; indeed, some projects are developing real-world systems that use PCD in the DL setting [Halo20, Pickles20]. The main drawback of the DL setting is that verification time (and sometimes argument size) is linear in a PCD node’s computation. This inefficiency is, however, tolerable if a PCD node’s computation is not too large, as is the case in the aforementioned projects. (Especially so when taking into account the disadvantages of PCD based on pairings, which involves relying on a private-coin setup and more expensive curve cycles.)

We evaluate our implementation to answer two questions: (a) how efficient is recursion with split accumulation for our simple zkNARK for R1CS? (b) what is the constraint cost of split accumulation for $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$ compared to atomic accumulation for $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$? All our experiments are performed over the 255-bit Pallas curve in the Pasta cycle of curves [Hop20], which is used by real-world deployments.

Split accumulation for R1CS. Our evaluation demonstrates that the cost of recursion for IVC with our split accumulation scheme for the simple NARK for R1CS is low, both with zero knowledge ($\sim 99 \times 10^3$ constraints) and without ($\sim 52 \times 10^3$ constraints). In fact, this cost is even lower than the cost of IVC based on highly efficient pairing-based circuit-specific SNARKs. Furthermore, like in the pairing-based case, this cost does not grow with the size of computation being checked. This is much better than prior constructions of IVC based on atomic accumulation for $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$ in the DL setting, as we will see next.
Comparison of accumulation for PC schemes. Several (S)NARKs are built from PC schemes, and the primary cost of recursion for these is determined by the cost of accumulation for the PC scheme. In light of this we compare the costs of two accumulation schemes:
- the atomic accumulation scheme for the PC scheme $\mathsf {PC}_{\scriptscriptstyle \mathsf {IPA}}$ [BCMS20];
- the new split accumulation scheme for $\mathsf {PC}_{\scriptscriptstyle \mathsf {Ped}}$.
Our evaluation demonstrates that the constraint cost of the $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$ accumulation verifier is 8 to 20 times cheaper than that of the $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$ accumulation verifier. In Fig. 6 we report the asymptotic cost of $|\mathrm {V}|$ (the constraint cost of $\mathrm {V}$) in $\mathsf {AS}_{\scriptscriptstyle \mathsf {IPA}}$, $\mathsf {AS}_{\scriptscriptstyle \mathsf {PC}}$, and $\mathsf {AS}_{\scriptscriptstyle \mathsf {R1CS}}$.^{Footnote 8}

We note that the cost of all the aforementioned accumulation schemes is dominated by the cost of many common subcomponents, and so improvements in these subcomponents will preserve the relative cost. For example, applying existing techniques [Halo20, Pickles20] for optimizing the constraint cost of elliptic curve scalar multiplications should benefit all our schemes in a similar way.

Notes

1.
https://github.com/arkworks-rs/accumulation.
2.
By “working definition” we mean a definition that we can provably fulfill under concrete hardness assumptions in the random oracle model, and, separately, that provably suffices for recursive composition in the plain model without random oracles.
3.
We could even “re-arrange” computation between the NARK and the accumulation scheme, and simplify the NARK further to be the NP decider (the verifier receives just the witness $w$ and checks that the R1CS condition holds). We do not do so because this does not lead to any savings in the accumulation verifier (the main efficiency metric of interest) and also because the current presentation more naturally leads to the zero knowledge variant described in Sect. 2.3.2. (We note that the foregoing rearrangement is a general transformation that does not preserve zero knowledge or succinctness of the given NARK.).
4.
For now we view the commitment key $\mathsf {ck}$ and coefficient matrices $A,B,C$ as hardcoded in the accumulation predicate $\varPhi $; our definitions later handle this more precisely.
5.
The verifier performs 4 group scalar multiplication by computing $\beta \cdot \mathsf {qx}.C_{\scriptscriptstyle C}$ and then $\beta \cdot \mathsf {pf}+ \beta ^2 \cdot \mathsf {qx}.C_{\scriptscriptstyle C}= \beta \cdot (\mathsf {pf}+ \beta \cdot \mathsf {qx}.C_{\scriptscriptstyle C})$ via another group scalar multiplication. Further it is possible to combine $C_{\scriptscriptstyle A}$ and $C_{\scriptscriptstyle B}$ in one commitment in both the NARK and the accumulation scheme. This reduces the group scalar multiplications in the verifier to 3, and the accumulator size to $3\;\mathbb {G}+ \mathsf {n}\;\mathbb {F}$.
6.
Instantiations based on hashes are also possible [COS20] but are (post-quantum and) less efficient.
7.
https://github.com/arkworks-rs/pcd.
8.
This comparison is meaningful because the cost of accumulating polynomial commitments provides a lower bound on the cost accumulating SNARKs that rely on these PC schemes.

References

Bünz, B., Bootle, J., Boneh, D., Poelstra, A., Wuille, P., Maxwell, G.: Bulletproofs: short proofs for confidential transactions and more. In: S & P 2018 (2018)
Google Scholar
J. Bootle, A. Cerulli, P. Chaidos, J. Groth, and C. Petit. “Efficient Zero- Knowledge Arguments for Arithmetic Circuits in the Discrete Log Setting”. In: EUROCRYPT ’16
Chapter MATH Google Scholar
Bitansky, N., Canetti, R., Chiesa, A., Tromer, E.: Recursive composition and bootstrapping for SNARKs and proof-carrying data. In: STOC 2013 (2013)
Google Scholar
Bünz, B., Chiesa, A., Lin, W., Mishra, P., Spooner, N.: Proof-carrying data without succinct arguments. In: IACR Cryptol. ePrint Arch. (2020). https://eprint.iacr.org/2020/1618
Bünz, B., Chiesa, A., Mishra, P., Spooner, N.: Proof-carrying data from accumulation schemes. In: TCC 2020 (2020)
Google Scholar
E. Ben-Sasson, A. Chiesa, E. Tromer, and M. Virza. “Scalable Zero Knowledge via Cycles of Elliptic Curves”. In: CRYPTO ’14
Article MathSciNet MATH Google Scholar
Boneh, D., Drake, J., Fisch, B., Gabizon, A.: Halo Infinite: Recursive zk-SNARKs from any Additive Polynomial Commitment Scheme. Cryptology ePrint Archive, Report 2020/1536
Google Scholar
Bowe, S., Grigg, J., Hopwood, D.: Halo: Recursive Proof Composition without a Trusted Setup. Cryptology ePrint Archive, Report 2019/1021
Google Scholar
Bünz, B., Maller, M., Mishra, P., Tyagi, N., Vesely, P.: Proofs for Inner Pairing Products and Applications. Cryptology ePrint Archive, Report 2019/1177
Google Scholar
Bonneau, J., Meckler, I., Rao, V., Shapiro, E.: Coda: Decentralized Cryptocurrency at Scale. Cryptology ePrint Archive, Report 2020/352
Google Scholar
Bellare, M., Neven, G.: Multi-signatures in the plain public-Key model and a general forking lemma. In: CCS 2006 (2006)
Google Scholar
Chen, W., Chiesa, A., Dauterman, E., Ward, N.P.: Reducing Participation Costs via Incremental Verification for Ledger Systems. Cryptology ePrint Archive, Report 2020/1522
Google Scholar
Chiesa, A., Hu, Y., Maller, M., Mishra, P., Vesely, N., Ward, N.: Marlin: Preprocessing zkSNARKs with Universal and Updatable SRS. In: EUROCRYPT 2020 (2020)
Google Scholar
Chiesa, A., Ojha, D., Spooner, N.: Fractal: Post-Quantum and Transparent Recursive Proofs from Holography. In: EUROCRYPT 2020 (2020)
Google Scholar
Chiesa, A., Tromer, E.: Proof-Carrying Data and Hearsay Arguments from Signature Cards. In: ICS 2010 (2010)
Google Scholar
Chong, S., Tromer, E., Vaughan, J.A.: Enforcing Language Semantics Using Proof-Carrying Data. Cryptology ePrint Archive, Report 2013/513
Google Scholar
Chiesa, A., Tromer, E., Virza, M.: Cluster Computing in Zero Knowledge. In: EUROCRYPT 2015 (2015)
Chapter MATH Google Scholar
Groth, J.: On the Size of Pairing-Based Non-interactive Arguments. In: EUROCRYPT 2016 (2016)
Chapter Google Scholar
Ghoshal, A., Tessaro, S.: Tight State-Restoration Soundness in the Algebraic Group Model. Cryptology ePrint Archive, Report 2020/1351
Google Scholar
Bowe, S., Grigg, J., Hopwood, D.: Halo2 (2020). https://github.com/zcash/halo2
Hopwood, D.: The Pasta Curves for Halo 2 and Beyond. https://electriccoin.co/blog/the-pasta-curves-for-halo-2-and-beyond/
Kattis, A., Bonneau, J.: Proof of Necessary Work: Succinct State Verification with Fairness Guarantees. Cryptology ePrint Archive, Report 2020/190
Google Scholar
O(1) Labs. “Mina Cryptocurrency”. https://minaprotocol.com/
Naveh, A., Tromer, E.: PhotoProof: cryptographic image authentication for any set of permissible transformations. In: S & P 2016 (2016)
Google Scholar
O(1) Labs. Pickles. https://github.com/o1-labs/marlin
P. Valiant. “Incrementally Verifiable Computation or Proofs of Knowledge Imply Time/Space Efficiency”. In: TCC ’08
Chapter MATH Google Scholar
Wahby, R.S., Tzialla, I., Shelat, A., Thaler, J., Walfish, M.: Doubly- efficient zkSNARKs without trusted setup. In: S & P 2018 (2018)
Google Scholar

Download references

Acknowledgements

This research was supported in part by the Ethereum Foundation, NSF, DARPA, a grant from ONR, and the Simons Foundation. Nicholas Spooner was supported by DARPA under Agreement No. HR00112020023.

Author information

Authors and Affiliations

Stanford University, Stanford, USA
Benedikt Bünz
UC Berkeley, Berkeley, USA
Alessandro Chiesa, William Lin & Pratyush Mishra
Boston University, Boston, USA
Nicholas Spooner

Authors

Benedikt Bünz
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Chiesa
View author publications
You can also search for this author in PubMed Google Scholar
William Lin
View author publications
You can also search for this author in PubMed Google Scholar
Pratyush Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Spooner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benedikt Bünz .

Editor information

Editors and Affiliations

Columbia University, New York City, NY, USA
Tal Malkin
University of Michigan, Ann Arbor, MI, USA
Chris Peikert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bünz, B., Chiesa, A., Lin, W., Mishra, P., Spooner, N. (2021). Proof-Carrying Data Without Succinct Arguments. In: Malkin, T., Peikert, C. (eds) Advances in Cryptology – CRYPTO 2021. CRYPTO 2021. Lecture Notes in Computer Science(), vol 12825. Springer, Cham. https://doi.org/10.1007/978-3-030-84242-0_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-84242-0_24
Published: 11 August 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-84241-3
Online ISBN: 978-3-030-84242-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

International Association for Cryptologic Research (opens in a new tab)