Add more requirements for ClientDataJSON serialisation. #1375

agl · 2020-02-24T21:58:54Z

ClientDataJSON is currently defined to be the JSON encoding of the
CollectedClientData. This implies that validators require a full JSON
parsing library to check needed entries in the ClientDataJSON such as
the challenge, type, and origin.

This is a problematic dependency in some cases. This change seeks to
address that by being stricter about the encoding, while still
generating JSON. Thus existing validators do not need to change but
those Relying Parties' webapps that are willing to require the use recent
WebAuthn-implementing browsers can avoid the full generality of JSON.

Preview | Diff

ClientDataJSON is currently defined to be the JSON encoding of the CollectedClientData. This implies that validators require a full JSON parsing library to check needed entries in the ClientDataJSON such as the challenge, type, and origin. This is a problematic dependency in some cases. This change seeks to address that by being stricter about the encoding, while still generating JSON. Thus existing validators do not need to change but those willing to require recent WebAuthn-implementing browsers can avoid the full generality of JSON.

index.bs

equalsJeffH

Overall this looks OK—the motivating "let's not have implementers have to have a full JSON parser" is worthwhile—though the manner in which the algs are presented ought to be consistent with how we present algs in other portions of the spec, like:

explicitly declare variables, such as the "partial result", e.g., as |partialResult|
use constructs from infra.spec.whatwg.org, e.g., for "append" if it's arguably appropriate.
use proper code point syntax.
perhaps not use the ‘ and ’
see also @emlun's comments

index.bs

equalsJeffH · 2020-02-26T20:20:08Z

in 2020-02-26 f2f meeting, feedback is that this needs further Note:/comments targeted at implementers that:

full JSON parser is preferred impl
add a validators' algorithm for those who wish to take advantage of the special sauce this limited & deterministic encoding provides
- this alg needs to be crafted such that futue extensions to clientDataJSON client platforms may employ will not break existing
add further explanation of the "hooks" this provides, eg fixed ordering of the fixed four key-value pairs, etc.
explicitly state that these four fields are fixed forever, we can never change them in future spec versions
explain the consequences of any future fifth (or more) key-value pairs added to the spec in terms of new browsers and RPs --- (consideration: overall in webauthn we're commiting to not make bkwards incompatible changes)
perhaps more to add here....

This change reflects w3c/webauthn#1375 in code. Since that spec change isn't breaking, it's fine to implement this first as the format still conforms to general JSON. This change will allow some validators to test against it, prove out that it works, and then we'll report back results to the W3C group — hopefully resulting in the WebAuthn spec change being accepted. Change-Id: Icedf405bf7eb91b752493badccad892aa9822e9f Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/2083394 Commit-Queue: Adam Langley <agl@chromium.org> Auto-Submit: Adam Langley <agl@chromium.org> Reviewed-by: Martin Kreichgauer <martinkr@google.com> Cr-Commit-Position: refs/heads/master@{#746098}

equalsJeffH · 2020-03-11T19:11:37Z

On 2020-03-11 call @agl reports that he has tugged the OpenSSH folks' sleeves and they are thinking about this..... more news later....

equalsJeffH · 2020-03-18T19:12:28Z

on 2020-03-18 call, @agl reports SSH people are chewing on it, still on hold.

equalsJeffH · 2020-04-08T19:11:49Z

on 2020-04-08 call: @agl reports that ssh folk think this is ok.

agl · 2020-04-08T19:12:10Z

This is now good to review; we believe that it all works in practice.

equalsJeffH

Overall LGTM. There's a few very minor things I'll go ahead and change if no one objects, see below (@agl has passed 'assignee' of this PR on to me).

Other reviewers, please review so we can get this landed :)

index.bs

quartzjer · 2020-04-15T18:55:22Z

As an outsider to this process (though one with some protocol design experience), this entire proposal/workaround seems deeply dangerous at worst, and an ugly wart at best.

There should be no justification for not requiring a standard JSON serializer and parser, that is just asking for unknown and unforeseen trouble in a spec. JSON is used in all manners of security sensitive and even constrained protocols.

Asking implementors to do manual serialization of part of a JSON document is almost guaranteed to result in one or more vulnerabilities. I’m disappointed that anyone from SSH is even asking for or expecting this kind of compromise just for their benefit. There are many formal and audited codebases that include JSON parsers, and even some parsers that will tokenize side-effect free without any memory allocations/operations.

It took a lot for me to decide to say something here after this was pointed out to me since I’ve not participated in any of these processes and don’t have a full background, so I’ll understand if I’m off track. What I’ve read here though was shocking enough that I felt compelled to speak just in case I’m not the only observer who has these concerns and hope that the whole spec isn’t cobbled by a single particularly stubborn use-case.

dwaite · 2020-04-15T19:28:57Z

Do we also need to make comments around verification of extensions and attestations, given that an implementation without JSON is somewhat likely to also not have an implementation of CBOR?

Apply Jeff's suggestions Co-Authored-By: =JeffH <jdhodges@google.com>

agl · 2020-04-15T23:12:38Z

Do we also need to make comments around verification of extensions and attestations, given that an implementation without JSON is somewhat likely to also not have an implementation of CBOR?

The differences stem from different sensitivities of different bits of code. Registering a new authenticator (during which attestation processing may be done) is usually less sensitive than pre-auth verification of a signature. Extension processing could be a problem, although the CBOR used is already significantly subsetted by CTAP2 and currently no authentication extensions are commonly used.

There should be no justification for not requiring a standard JSON serializer and parser.

I roughly agree that JSON is not an unreasonably complex format, even in sensitive locations. I wrote a, I think, demonstrably memory-safe JSON parser as an example of how it could be done. However, it's not my decision and I must admit that, compared to SSH's length-prefixed bytestrings, JSON is a lot more heavyweight and would significantly increase the amount of code in the highly-sensitive preauth flow.

(For those looking for a safe JSON parser in C, I suggest looking at the Wuffs framework, on which the Chromium's future JSON parser may well be based.)

Asking implementors to do manual serialization of part of a JSON document is almost guaranteed to result in one or more vulnerabilities.

I disagree here. While things could get hairy in general, the input strings here are either static values from the browser, base64url-encoded values, or punycoded origin strings. Of these, only the last is not trivially safe for plain template insertion. This is not a general data structure that's being encoded, it has a few, known keys and values. Lastly, there are few browser implementations and they have to do much more complex and subtle things to ensure safety in WebAuthn than this.

It's worth emphasing that no site needs to worry about this, and are recommended not to worry about this. No site need touch anything other than a standard JSON parser.

emlun

LGTM!

jcjones

What're your thoughts regarding the comment at

webauthn/index.bs

Line 2828 in 8d0060a

    
           Note: The {{CollectedClientData}} may be extended in the future. Therefore it's critical when parsing to be tolerant of unknown keys and of any reordering of the keys.

:

Think that should also link to the serialization rules, as now the tolerance is only to the reordering of the non-static elements?

Otherwise, I've worked through the algorithms several times by hand and assuming byte-matching for punycode domains, it works fine for me.

equalsJeffH · 2020-04-29T18:40:15Z

Good catch @jcjones -- added in d4300ee -- thx!

…serialisation. (#1375)

agl requested a review from equalsJeffH February 24, 2020 21:59

equalsJeffH requested a review from emlun February 24, 2020 23:38

equalsJeffH assigned agl Feb 24, 2020

equalsJeffH added type:technical subtype:algorithms/WebIDL subtype:RP-impl-cons labels Feb 24, 2020

emlun requested review from akshayku and jcjones February 25, 2020 12:20

emlun reviewed Feb 25, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

equalsJeffH suggested changes Feb 26, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

nadalin added this to the L2-WD-03 milestone Feb 26, 2020

Address various comments.

b0177ac

equalsJeffH requested review from equalsJeffH and emlun April 15, 2020 15:43

equalsJeffH assigned equalsJeffH and unassigned agl Apr 15, 2020

equalsJeffH approved these changes Apr 15, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

Apply suggestions from code review

7ade411

Apply Jeff's suggestions Co-Authored-By: =JeffH <jdhodges@google.com>

emlun approved these changes Apr 22, 2020

View reviewed changes

jcjones approved these changes Apr 27, 2020

View reviewed changes

incorp jcjones' feedback, thx!

d4300ee

akshayku approved these changes Apr 29, 2020

View reviewed changes

equalsJeffH merged commit d530669 into w3c:master Apr 29, 2020

WebAuthnBot pushed a commit that referenced this pull request Apr 29, 2020

Built by Travis-CI: d530669 Add more requirements for ClientDataJSON …

c0aa74e

…serialisation. (#1375)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more requirements for ClientDataJSON serialisation. #1375

Add more requirements for ClientDataJSON serialisation. #1375

agl commented Feb 24, 2020 •

edited by pr-preview bot

equalsJeffH left a comment

equalsJeffH commented Feb 26, 2020 •

edited

equalsJeffH commented Mar 11, 2020

equalsJeffH commented Mar 18, 2020

equalsJeffH commented Apr 8, 2020

agl commented Apr 8, 2020

equalsJeffH left a comment

quartzjer commented Apr 15, 2020

dwaite commented Apr 15, 2020

agl commented Apr 15, 2020

emlun left a comment

jcjones left a comment

equalsJeffH commented Apr 29, 2020

Add more requirements for ClientDataJSON serialisation. #1375

Add more requirements for ClientDataJSON serialisation. #1375

Conversation

agl commented Feb 24, 2020 • edited by pr-preview bot

equalsJeffH left a comment

Choose a reason for hiding this comment

equalsJeffH commented Feb 26, 2020 • edited

equalsJeffH commented Mar 11, 2020

equalsJeffH commented Mar 18, 2020

equalsJeffH commented Apr 8, 2020

agl commented Apr 8, 2020

equalsJeffH left a comment

Choose a reason for hiding this comment

quartzjer commented Apr 15, 2020

dwaite commented Apr 15, 2020

agl commented Apr 15, 2020

emlun left a comment

Choose a reason for hiding this comment

jcjones left a comment

Choose a reason for hiding this comment

equalsJeffH commented Apr 29, 2020

agl commented Feb 24, 2020 •

edited by pr-preview bot

equalsJeffH commented Feb 26, 2020 •

edited