Spec new externtype encoding #10

bvisness · 2025-12-04T17:46:31Z

Per #7, we are adding another compact encoding that deduplicates externtypes as well as module names. This PR updates the formal spec.

The way I'm handling identifiers in the text format seems obviously wrong somehow, but the entire way that the compact encoding is handled in the import section is also wrong, so maybe it's ok for now and we can fix up the fiddly details before phase 4.

bvisness · 2025-12-04T20:41:21Z

What in the world do you have to do to pass CI these days? I ran make test and make testpromote repeatedly until make test succeeded, and yet make test is apparently failing.

rossberg · 2025-12-04T22:36:28Z

The dependencies for the test target are rather complicated and unfortunately incomplete somewhere. Before pushing to the repo, I recommend starting the test run from a make clean if you want to make sure it's not missing anything.

rossberg

Spec is technically fine as is modulo cross-reference nits. However, there are two things I find problematic about the new text format extension:

The relative order of externtype and item names is inconsistent with other forms.
The bound identifiers are not preceded by a corresponding keyword that declares their namespace. That is completely at odds with the rest of the text format.

For the former, I would suggest changing the order, at least in the text format. For the latter, I would simply remove the ability to bind identifiers in the new short hand. That would be consistent with existing forms like param and local, where you can't use the multi-form if you want to bind names (e.g., (local i32 i32) vs (local $x i32) (local $y i32)).

rossberg · 2025-12-05T07:36:52Z

document/core/text/modules.rst

 Abbreviations
 .............

 Multiple imports with the same ${:nm_1} may be declared together:


Suggested change

Multiple imports with the same ${:nm_1} may be declared together:

Multiple imports with the same :ref:`name <syntax-name>` ${:nm_1} may be declared together:

rossberg · 2025-12-05T07:39:38Z

document/core/text/modules.rst

-$${grammar: Timports_/abbrev}
+$${grammar: Timports_/abbrev-compact1}
+
+Multiple imports with the same ${:nm_1} and ${:xt} may also be declared together, in which case identifiers may be placed on individual items instead of the ${:externtype}:


Suggested change

Multiple imports with the same ${:nm_1} and ${:xt} may also be declared together, in which case identifiers may be placed on individual items instead of the ${:externtype}:

Multiple imports with the same name ${:nm_1} and :ref:`external type <syntax-externtype>` ${:xt} may also be declared together, in which case identifiers may be placed on individual items instead of the external type:

bvisness · 2025-12-05T14:15:15Z

I'm not sure how to reasonably change the order of item name and externtype for the new text format extension. Like, I guess I could do (import "mod" (item "foo") (item "bar") externtype), but that seems quite strange to me given that the point is to declare many items with the same type and a parser would not have any context (until the very end) to distinguish it from (import "mod" (item "foo" externtype1) (item "bar" externtype2)). To me it makes perfect sense to declare the common type up front in some way.

Ideas for your consideration:

;; other existing forms of import for comparison
(import "mod" "foo" externtype)
(import "mod" (item "foo" externtype) (item "bar" externtype))

;; new import form deduplicating externtype
(import "mod" (type externtype (item "foo") (item "bar")) ;; option 1, currently proposed
(import "mod" (type externtype "foo" "bar")) ;; option 2
(import "mod" (type externtype) "foo" "bar") ;; option 3
(import "mod" (type externtype) (item "foo") (item "bar")) ;; option 4

If dropping the ability to add identifiers, then I think I am partial to option 2.

I'm open to suggestions for other text encodings that would possibly swap the order of items and types as you say, but I just can't come up with one that makes sense.

rossberg · 2025-12-08T10:30:30Z

The variations

(import "mod" "foo" externtype)
(import "mod" (item "foo") (item "bar") externtype)
(import "mod" (item "foo" externtype) (item "bar" externtype))

look perfectly consistent to me. I don't think there is a parsing issue between the second and third, it should be LALR(1) just fine, and can easily be transformed to LL(1). The only minor thing is that, in an LL-parser, the list of names has to be buffered before building the list of imports, but that is totally fine for a text parser (less so for the binary format), and there are much worse things of that sort going on elsewhere in the text format.

bvisness added 4 commits December 4, 2025 11:16

Update binary spec to deduplicate externtypes

3c9766e

Update text encoding for new abbreviation

3b50ef1

Update test files

46ef662

Actually update test files, question mark?

21708e5

For actual real this time

d9e68c5

rossberg reviewed Dec 5, 2025

View reviewed changes

bvisness added 2 commits December 8, 2025 10:43

Change text abbreviation for new compact encoding

ae94623

Update test files

28103b4

bvisness merged commit 340c725 into main Dec 8, 2025
12 checks passed

bvisness deleted the externtype-encoding branch December 8, 2025 17:07

bvisness mentioned this pull request Dec 8, 2025

Where to place the externtype in the text format? #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Spec new externtype encoding #10

Spec new externtype encoding #10

Uh oh!

bvisness commented Dec 4, 2025

Uh oh!

bvisness commented Dec 4, 2025

Uh oh!

rossberg commented Dec 4, 2025 •

edited

Loading

Uh oh!

rossberg left a comment

Uh oh!

rossberg Dec 5, 2025

Uh oh!

rossberg Dec 5, 2025

Uh oh!

bvisness commented Dec 5, 2025

Uh oh!

rossberg commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	Multiple imports with the same ${:nm_1} may be declared together:
	Multiple imports with the same :ref:`name <syntax-name>` ${:nm_1} may be declared together:

	Multiple imports with the same ${:nm_1} and ${:xt} may also be declared together, in which case identifiers may be placed on individual items instead of the ${:externtype}:
	Multiple imports with the same name ${:nm_1} and :ref:`external type <syntax-externtype>` ${:xt} may also be declared together, in which case identifiers may be placed on individual items instead of the external type:

Spec new externtype encoding #10

Spec new externtype encoding #10

Uh oh!

Conversation

bvisness commented Dec 4, 2025

Uh oh!

bvisness commented Dec 4, 2025

Uh oh!

rossberg commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rossberg left a comment

Choose a reason for hiding this comment

Uh oh!

rossberg Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

rossberg Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

bvisness commented Dec 5, 2025

Uh oh!

rossberg commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rossberg commented Dec 4, 2025 •

edited

Loading