ginger names
This commit is contained in:
parent
fdd02aff26
commit
bfcdeb6233
173
static/src/_posts/2022-01-21-ginger-names.md
Normal file
173
static/src/_posts/2022-01-21-ginger-names.md
Normal file
@ -0,0 +1,173 @@
|
|||||||
|
---
|
||||||
|
title: >-
|
||||||
|
Ginger Names
|
||||||
|
description: >-
|
||||||
|
Thoughts about a fundamental data type.
|
||||||
|
tags: tech
|
||||||
|
series: ginger
|
||||||
|
---
|
||||||
|
|
||||||
|
The ginger language has, so far, 2 core types implemented: numbers and names.
|
||||||
|
Obviously there will be more coming later, but at this stage of development
|
||||||
|
these are all that's really needed. Numbers are pretty self explanatory, but
|
||||||
|
it's worth talking about names a bit.
|
||||||
|
|
||||||
|
As they are currently defined, a name's only property is that it can either be
|
||||||
|
equal or not equal to another name. Syntactically they are encoded as being any
|
||||||
|
alphanumeric token starting with an alphabetic character. We might _think_ of
|
||||||
|
them as being strings, but names lack nearly all capabilities that strings have:
|
||||||
|
they cannot be iterated over, they cannot be concatenated, they cannot be split.
|
||||||
|
Names can only be compared for equality.
|
||||||
|
|
||||||
|
## Utility
|
||||||
|
|
||||||
|
The use-case for names is self-explanatory: they are words which identify
|
||||||
|
something from amongst a group.
|
||||||
|
|
||||||
|
Consider your own name. It _might_ have an ostensible meaning. Mine, Brian,
|
||||||
|
means "high" (as in... like a hill, which is the possible root word). But when
|
||||||
|
people yell "Brian" across the room I'm in, they don't mean a hill. They mean
|
||||||
|
me, because that word is used to identify me from amongst others. The etymology
|
||||||
|
is essentially background information which doesn't matter.
|
||||||
|
|
||||||
|
We use names all the time in programming, though we don't always call them that.
|
||||||
|
Variable names, package names, type names, function names, struct field names.
|
||||||
|
There's also keys which get used in hash maps, which are essentially names, as
|
||||||
|
well as enumerations. By defining name as a core type we can cover a lot of
|
||||||
|
ground.
|
||||||
|
|
||||||
|
## Precedence
|
||||||
|
|
||||||
|
This is not the first time a name has been used as a core type. Ruby has
|
||||||
|
symbols, which look like `:this`. Clojure has keywords, which also look like
|
||||||
|
`:this`, and it has symbols, which look like `this`. Erlang has atoms, which
|
||||||
|
don't have a prefix and so look like `this`. I can't imagine these are the only
|
||||||
|
examples. They are all called different things, but they're all essentially the
|
||||||
|
same thing: a runtime value which can only be compared for equality.
|
||||||
|
|
||||||
|
I can't speak much about ruby, but I _can_ speak about clojure and erlang.
|
||||||
|
|
||||||
|
Clojure is a LISP language, meaning the language itself is described using the
|
||||||
|
data types and structures built into the language. Ginger is also a LISP, though
|
||||||
|
it uses graphs instead of lists.
|
||||||
|
|
||||||
|
Clojure keywords are generally used as keys to hash maps, sentinel values, and
|
||||||
|
enumerations. Besides keywords, clojure also makes use of symbols, which are
|
||||||
|
used for variable and library names. There seems to be some kind of split
|
||||||
|
ability on symbols, as they are expected to be separated on their periods when
|
||||||
|
importing, as in `clojure.java.io`. There's also a quoting mechanism in clojure,
|
||||||
|
where prefixing a symbol, or other value, with a single quote, like `'this`,
|
||||||
|
prevents it from being evaluated as a variable or function call.
|
||||||
|
|
||||||
|
It's also possible to have something get quoted multiple layers deep, like
|
||||||
|
`'''this`. This can get confusing.
|
||||||
|
|
||||||
|
Erlang is not a LISP language, but it does have atoms. These values are used in
|
||||||
|
the same way that clojure keywords are used. There is no need for a
|
||||||
|
corresponding symbol type like clojure has, since erlang is not a LISP and has
|
||||||
|
no real macro system. Atoms are sort of used like symbols, in that functions and
|
||||||
|
packages are identified by an atom, and so one can "call" an atom, like
|
||||||
|
`this()`, in order to evaluate it.
|
||||||
|
|
||||||
|
## Just Names
|
||||||
|
|
||||||
|
I don't really see the need for clojure's separation between keywords and
|
||||||
|
symbols. Symbols still need to be quoted in order to prevent evaluation either
|
||||||
|
way, so you end up with three different entities to juggle (keywords, symbols,
|
||||||
|
and symbols which won't be evaluated). Erlang's solution is simpler, atoms are
|
||||||
|
just atoms, and since evaluation is explicit there's no need for quoting. Ginger
|
||||||
|
names are like erlang atoms in that they are the only tool at hand.
|
||||||
|
|
||||||
|
The approaches of erlang vs clojure could be reframed as explicit vs implicit
|
||||||
|
evaluation of operations calls.
|
||||||
|
|
||||||
|
In ginger evaluation is currently done implicitly, but in only two cases:
|
||||||
|
|
||||||
|
* A value on an edge is evaluated to the first value which is a graph (which
|
||||||
|
then gets interpreted as an operation).
|
||||||
|
|
||||||
|
* A leaf vertex with a name value is evaluated to the first value which is not a
|
||||||
|
name.
|
||||||
|
|
||||||
|
In all other cases, the value is left as-is. A graph does not need to be quoted,
|
||||||
|
since the need to evaluate a graph as an operation is already based on its
|
||||||
|
placement as an edge or not. So the only case left where quoting is needed (if
|
||||||
|
implicit evaluation continues to be used) is a name on a leaf vertex, as in the
|
||||||
|
example before.
|
||||||
|
|
||||||
|
As an example to explore explicit vs implicit quoting in ginger, if we want to
|
||||||
|
programatically call the `AddValueIn` method on a graph, which terminates an
|
||||||
|
open edge into a value, and that value is a name, it might look like this with
|
||||||
|
implicit evaluation (the clojure-like example):
|
||||||
|
|
||||||
|
```
|
||||||
|
out = addValueIn < (g (quote < someName;) someValue; );
|
||||||
|
|
||||||
|
* or, to borrow the clojure syntax, where single quote is a shortcut:
|
||||||
|
|
||||||
|
out = addValueIn < (g; 'someName; someValue; );
|
||||||
|
```
|
||||||
|
|
||||||
|
In an explicit evaluation language, which ginger so far has not been and so this
|
||||||
|
will look weird, we might end up with something like this:
|
||||||
|
|
||||||
|
```
|
||||||
|
out = addValueIn < (eval < g; someName; eval < someValue; );
|
||||||
|
|
||||||
|
* with $ as sugar for the `eval`, like ' is a shortcut for `quote` in clojure:`
|
||||||
|
|
||||||
|
out = addValueIn < ($g; someName; $someValue; );
|
||||||
|
```
|
||||||
|
|
||||||
|
I don't _like_ either pattern, and since it's such a specific case I feel like
|
||||||
|
something less obtrusive could come up. So no decisions here yet.
|
||||||
|
|
||||||
|
## Uniqueness
|
||||||
|
|
||||||
|
There's another idea I haven't really gotten to the bottom of yet. The idea is
|
||||||
|
that a name, _maybe_, shouldn't be considered equal to the same name unless they
|
||||||
|
belong to the same graph.
|
||||||
|
|
||||||
|
For example:
|
||||||
|
|
||||||
|
```
|
||||||
|
otherFoo = { out = 'foo } < ();
|
||||||
|
|
||||||
|
out = equal < ('foo; otherFoo; );
|
||||||
|
```
|
||||||
|
|
||||||
|
This would output false. `otherFoo`'s value is the name `foo`, and the value
|
||||||
|
it's being compared to is also a name `foo`, but they are from different graphs
|
||||||
|
and so are not equal. In essence, names are automatically namespaces.
|
||||||
|
|
||||||
|
This idea only really makes sense in the context of packages, where a user
|
||||||
|
(a developer) wants to import functionality from somewhere else and use it
|
||||||
|
in their program. The code package which is imported will likely use name
|
||||||
|
values internally to implement its functionality, but it shouldn't need to worry
|
||||||
|
about naming conflicts with values passed in by the user. While it's possible to
|
||||||
|
avoid conflicts if a package is designed conscientiously, it's also easy to mess
|
||||||
|
up if one isn't careful. This becomes especially true when combining
|
||||||
|
functionality of packages with overlapping functionality, where the data
|
||||||
|
returned from one might looks _similar_ to that used by the other, but it's not
|
||||||
|
necessarily true.
|
||||||
|
|
||||||
|
On the other hand, this could create some real headaches for the developer, as
|
||||||
|
they chase down errors which are caused because one `foo` isn't actually the
|
||||||
|
same as another `foo`.
|
||||||
|
|
||||||
|
What it really comes down to is the mechanism which packages use to function as
|
||||||
|
packages. Forced namespaces will require packages to export all names which they
|
||||||
|
expect the user to need to work with the package. So the ergonomics of that
|
||||||
|
exporting, both on the user's and package's side, are really important in order
|
||||||
|
to make this bearable.
|
||||||
|
|
||||||
|
So it's hard to make any progress on determining if this idea is gonna work
|
||||||
|
until the details of packaging are worked out. But for this idea to work the
|
||||||
|
packaging is going to need to be designed with it in mind. It's a bit of a
|
||||||
|
puzzle, and one that I'm going to marinate on longer, in addition to the quoting
|
||||||
|
of names.
|
||||||
|
|
||||||
|
And that's names, their current behavior and possible future behavior. Keep an
|
||||||
|
eye out for more ginger posts in.... many months, because I'm going to go work
|
||||||
|
on other things for a while (I say, with a post from a month ago having ended
|
||||||
|
with the same sentiment).
|
Loading…
Reference in New Issue
Block a user