ginger names

2022-01-21 22:28:42 -07:00 · 2022-01-21 22:28:42 -07:00 · bfcdeb6233
commit bfcdeb6233
parent fdd02aff26
1 changed files with 173 additions and 0 deletions
--- a/static/src/_posts/2022-01-21-ginger-names.md
+++ b/static/src/_posts/2022-01-21-ginger-names.md
@ -0,0 +1,173 @@
+---
+title: >-
+    Ginger Names
+description: >-
+    Thoughts about a fundamental data type.
+tags: tech
+series: ginger
+---
+
+The ginger language has, so far, 2 core types implemented: numbers and names.
+Obviously there will be more coming later, but at this stage of development
+these are all that's really needed. Numbers are pretty self explanatory, but
+it's worth talking about names a bit.
+
+As they are currently defined, a name's only property is that it can either be
+equal or not equal to another name. Syntactically they are encoded as being any
+alphanumeric token starting with an alphabetic character. We might _think_ of
+them as being strings, but names lack nearly all capabilities that strings have:
+they cannot be iterated over, they cannot be concatenated, they cannot be split.
+Names can only be compared for equality.
+
+## Utility
+
+The use-case for names is self-explanatory: they are words which identify
+something from amongst a group.
+
+Consider your own name. It _might_ have an ostensible meaning. Mine, Brian,
+means "high" (as in... like a hill, which is the possible root word). But when
+people yell "Brian" across the room I'm in, they don't mean a hill. They mean
+me, because that word is used to identify me from amongst others. The etymology
+is essentially background information which doesn't matter.
+
+We use names all the time in programming, though we don't always call them that.
+Variable names, package names, type names, function names, struct field names.
+There's also keys which get used in hash maps, which are essentially names, as
+well as enumerations. By defining name as a core type we can cover a lot of
+ground.
+
+## Precedence
+
+This is not the first time a name has been used as a core type. Ruby has
+symbols, which look like `:this`. Clojure has keywords, which also look like
+`:this`, and it has symbols, which look like `this`. Erlang has atoms, which
+don't have a prefix and so look like `this`. I can't imagine these are the only
+examples. They are all called different things, but they're all essentially the
+same thing: a runtime value which can only be compared for equality.
+
+I can't speak much about ruby, but I _can_ speak about clojure and erlang.
+
+Clojure is a LISP language, meaning the language itself is described using the
+data types and structures built into the language. Ginger is also a LISP, though
+it uses graphs instead of lists.
+
+Clojure keywords are generally used as keys to hash maps, sentinel values, and
+enumerations. Besides keywords, clojure also makes use of symbols, which are
+used for variable and library names. There seems to be some kind of split
+ability on symbols, as they are expected to be separated on their periods when
+importing, as in `clojure.java.io`. There's also a quoting mechanism in clojure,
+where prefixing a symbol, or other value, with a single quote, like `'this`,
+prevents it from being evaluated as a variable or function call.
+
+It's also possible to have something get quoted multiple layers deep, like
+`'''this`. This can get confusing.
+
+Erlang is not a LISP language, but it does have atoms. These values are used in
+the same way that clojure keywords are used. There is no need for a
+corresponding symbol type like clojure has, since erlang is not a LISP and has
+no real macro system. Atoms are sort of used like symbols, in that functions and
+packages are identified by an atom, and so one can "call" an atom, like
+`this()`, in order to evaluate it.
+
+## Just Names
+
+I don't really see the need for clojure's separation between keywords and
+symbols. Symbols still need to be quoted in order to prevent evaluation either
+way, so you end up with three different entities to juggle (keywords, symbols,
+and symbols which won't be evaluated). Erlang's solution is simpler, atoms are
+just atoms, and since evaluation is explicit there's no need for quoting. Ginger
+names are like erlang atoms in that they are the only tool at hand.
+
+The approaches of erlang vs clojure could be reframed as explicit vs implicit
+evaluation of operations calls.
+
+In ginger evaluation is currently done implicitly, but in only two cases:
+
+* A value on an edge is evaluated to the first value which is a graph (which
+  then gets interpreted as an operation).
+
+* A leaf vertex with a name value is evaluated to the first value which is not a
+  name.
+
+In all other cases, the value is left as-is. A graph does not need to be quoted,
+since the need to evaluate a graph as an operation is already based on its
+placement as an edge or not. So the only case left where quoting is needed (if
+implicit evaluation continues to be used) is a name on a leaf vertex, as in the
+example before.
+
+As an example to explore explicit vs implicit quoting in ginger, if we want to
+programatically call the `AddValueIn` method on a graph, which terminates an
+open edge into a value, and that value is a name, it might look like this with
+implicit evaluation (the clojure-like example):
+
+```
+out = addValueIn < (g (quote < someName;) someValue; );
+
+* or, to borrow the clojure syntax, where single quote is a shortcut:
+
+out = addValueIn < (g; 'someName; someValue; );
+```
+
+In an explicit evaluation language, which ginger so far has not been and so this
+will look weird, we might end up with something like this:
+
+```
+out = addValueIn < (eval < g; someName; eval < someValue; );
+
+* with $ as sugar for the `eval`, like ' is a shortcut for `quote` in clojure:`
+
+out = addValueIn < ($g; someName; $someValue; );
+```
+
+I don't _like_ either pattern, and since it's such a specific case I feel like
+something less obtrusive could come up. So no decisions here yet.
+
+## Uniqueness
+
+There's another idea I haven't really gotten to the bottom of yet. The idea is
+that a name, _maybe_, shouldn't be considered equal to the same name unless they
+belong to the same graph.
+
+For example:
+
+```
+otherFoo = { out = 'foo } < ();
+
+out = equal < ('foo;  otherFoo; );
+```
+
+This would output false. `otherFoo`'s value is the name `foo`, and the value
+it's being compared to is also a name `foo`, but they are from different graphs
+and so are not equal. In essence, names are automatically namespaces.
+
+This idea only really makes sense in the context of packages, where a user
+(a developer) wants to import functionality from somewhere else and use it
+in their program. The code package which is imported will likely use name
+values internally to implement its functionality, but it shouldn't need to worry
+about naming conflicts with values passed in by the user. While it's possible to
+avoid conflicts if a package is designed conscientiously, it's also easy to mess
+up if one isn't careful. This becomes especially true when combining
+functionality of packages with overlapping functionality, where the data
+returned from one might looks _similar_ to that used by the other, but it's not
+necessarily true.
+
+On the other hand, this could create some real headaches for the developer, as
+they chase down errors which are caused because one `foo` isn't actually the
+same as another `foo`.
+
+What it really comes down to is the mechanism which packages use to function as
+packages. Forced namespaces will require packages to export all names which they
+expect the user to need to work with the package. So the ergonomics of that
+exporting, both on the user's and package's side, are really important in order
+to make this bearable.
+
+So it's hard to make any progress on determining if this idea is gonna work
+until the details of packaging are worked out. But for this idea to work the
+packaging is going to need to be designed with it in mind. It's a bit of a
+puzzle, and one that I'm going to marinate on longer, in addition to the quoting
+of names.
+
+And that's names, their current behavior and possible future behavior. Keep an
+eye out for more ginger posts in.... many months, because I'm going to go work
+on other things for a while (I say, with a post from a month ago having ended
+with the same sentiment).