Rhombus expansion and enforestation on shrubberies #162

mflatt · 2021-07-14T20:53:49Z

This proposal builds on #122, defining a macro-expansion layer suitable for shrubberies.

In other words, it still doesn't define a language like #lang rhombus, but it defines an expansion and enforestation layer that toward that goal. It's analogous to Racket's core expander, but also defined in terms of Racket's expander.

The implementation in the proposal is currently the same as the https://github.com/mflatt/shrubbery-rhombus-0 package that makes #lang shubbery run in Racket and DrRacket (with just a few operators and a definition form).

jeapostrophe · 2021-07-15T00:34:28Z

Big comments:

The four syntactic categories are not really defended. You basically say, "Racket has three, Rhombus adds one more". If someone has never heard of Racket, how would we explain these things?

What even is a "syntactic category"? I think something like, "Based on the surrounding context, this shrubbery blob is put into one of these categories and it is the choice of the context NOT of the blob". In a naive LISP, there's just two categories because e := atom | (e . e) and there's no extension for atoms; although Racket is not naive like this.

If that answer is correct, then I think this document should say something about how we know what syntactic category a particular blob is in. I think your API is like this, because it says that rhombus-top is the interface to specifying a sequence of declarations, definitions, and expressions... but it doesn't actually say how we know that a particular blob is any one of those. Am I meant to look at parse.rkt to see how that function decides? I think we need a "guide" explanation of how to know.

I think that explanation should also justify why it is this particular set of four things.... maybe:

declarations --- Things at the top of a module are special... Why? I think I know the answer is that they might be something that the Racket macro expander has to look at first to discover more macros. But, what is an explanation for these things being special independent of the Racket macro expander? Perhaps, "declarations are part of a module, which they can influence by introducing dependencies"? If something isn't explicitly a declaration, then most declaration-consumers will take a definition?
definitions --- A definition is part of a "scope" which it can influence by introducing bindings. If something isn't explicitly a expression, then most definition-consumers will take an expression?
expressions --- An expression cannot influence anything syntactically (except through procedural-level operations like syntax-local-lift-declaration) so it can only expand to a value expression.

These feel quite natural and general. However, patterns feel very specific:

patterns --- Most binding positions will use a matching algorithm that receives a value, checks if it is valid, then defines (syntax and value) bindings based on features of that value, such as the two components of a cons cell.

That feels very particular to one "language". In other words, all of these categories are specifying an "interface" --- what they receive and what they return --- where what they receive is syntax with a promise about where it occurs and what they return is the "influence" or "effect" they can have on their context. The categories are roughly defined by the effect they can have: declarations do module-effects, like imports and submodules; definitions do binding-effects; expressions have no effects. Your "patterns" have a constraint-effect (the matcher function) and a binding-effect, where the first effect is "outward" in that it communicates to the pattern match "Don't select me" and the second effect is "inward" in that it influences a "sibling" based on the particular syntax of the matcher. Perhaps these outward/inward effects could be expressed more generally:

bindings ---A binding position is a core concept that occurs in many declarations, definitions, and expressions and it can expand to a pair of syntaxes: one which is an outward expression and the second which is an inward definition.

A "match transformer" might be

(define-syntax cons
 (singleton-struct .... #:prop binding-transformer cons-bt)))
(define-syntax (cons-bt stx)
 (syntax-parse
  [(_ carb:binding cdrb:binding)
   (cons
    #'(lambda (x) (and (cons? x) (carb.out (car x)) (cdrb.out (cdr x))))
    #'(begin
         (splicing-syntax-parameterize ([current-match-value (car (current-match-value))]) (carb.in)
         (splicing-syntax-parameterize ([current-match-value (cdr (current-match-value))]) (cdrb.in)))]))

This, of course, "knows" that it is patch of match, which is why it knows that the out effect is expected to be a procedure and the in effect is expected to look at current-match-value

I am particularly concerned about how this idea of binding patterns could, for instance, be used for non-value work, like in type declarations; consider this Haskell:

myFunction :: forall a. Ord a => [a] -> [(a, a)]

Perhaps we could write

myFunction :: forall (a <: Ord) . [a] -> [(a, a)]

to use an bounded quantification style with a binding pattern. In this case the <: operator would need to do something like

(cond
 [(syntax-am-i-doing-type-expansion?)
  (cons #'(constraint) #'(expose-type-class-members-of constraint CALLER-FILL-ME-IN))]
 [(syntax-or-is-it-pattern-matching?)
  ....])

Small comments:

I believe that this sentence --- A potential advantage of non-transitive precedence avoiding an order among operands that have make no sense next to each other. --- has a typo, because I can't understand it.

If two operators both claim a precedence relationship to each other, the relationship must be consistent; --- What is the consequence of violation of this "must"? Enforestation is undefined? It's a compile-time error?

Big shed --- I feel like the (cons/c (or/c identifier? 'default) '(stronger same weaker)) interface is verbose and think (list/c (listof identifer?) x3 (or/c stronger same weaker)) where the sets are written out with one value for default is better. If you don't agree, you should write down the error rules for when something appears twice.

Along similar lines, the Rhombus expander supports a certain style of infix and prefix operators, but it does not directly support all possible kinds of operators. --- I think you should explicitly name some desirable operators you know you won't support.

I think that :: as a declaration operator is very desirable

mflatt · 2021-07-15T14:35:49Z

@jeapostrophe - thanks for the comments.

"Binding" is a better word than "pattern", so I've switched to using that word. Where "binding" was previously used for the define-syntax sense of mapping an operator name to an operator implementation, the proposal now uses the word "mapping".

You're right that the category of a shrubbery for expansion is determined by its context, and I've updated the description to say that. I've also updated to clarify that the four categories are just the ones directly supported by the expander, while a language built on the expander can have even more categories. The rationale now starts with a paragraph justifying the four categories (which is simple: experience with Racket).

I'm not sure I understand your type-declaration example. I would expect a typed language to have an additional syntactic category for types, and the rationale now notes that possibility. I would hope that the new category is supported through a new kind of compile-time value, and not a compile-time function that an expander calls to determine there category where it's being used.

I take your "If someone has never heard of Racket, how would we explain these things?" comment as being primarily about how to justify the four syntactic categories. The comment could also suggest that the proposal is gibberish to someone who has never heard of Racket, and I would agree. If and when a Rhombus language built on this concepts exists, then it will be possible to explain everything in those terms. Meanwhile, this proposal bootstraps by using Racket for general concepts and to make the API concrete.

jeapostrophe · 2021-07-15T18:29:05Z