1. 24 Jan, 2021 12 commits
    • Eric Myhre's avatar
      schema compiler: enum validation rules. · 6b5a471f
      Eric Myhre authored
      And I think this now ports all the rules we'd previously written
      against the attempted schema2 (the wrapper style) API.
      So I can now remove the rest of that in the next commit.
      
      It will soon be time to start updating all the gengo stuff to
      use this, including all of its tests.
      6b5a471f
    • Eric Myhre's avatar
      schema compiler: union validation rules. · ea79ea43
      Eric Myhre authored
      Also, shift some error message text towards a more consistent phrasing.
      ea79ea43
    • Eric Myhre's avatar
      schema compiler: union rules. · 6897bb3c
      Eric Myhre authored
      Commit to the strategy of having the first flunked rule for a type
      result in short-circuit skipping of subsequent rules.  It's simple,
      and it's sufficient.
      6897bb3c
    • Eric Myhre's avatar
      schema: connect validation, Compile() now works. · 1b282cfa
      Eric Myhre authored
      More rules are still to come.
      1b282cfa
    • Eric Myhre's avatar
      schema: clear up some things about TypeName vs TypeReference. · b6009032
      Eric Myhre authored
      Both are now accessible.  Name is not always present.
      
      Get rid of casts that are unnecessary.
      
      Constructors for anonymous types are still upcoming;
      all the current constructors dupe the name into the reference field.
      Planning to add distinct methods on the Compiler for anon types.
      b6009032
    • Eric Myhre's avatar
    • Eric Myhre's avatar
      schema: architecture design records for compiler. · 8777aae1
      Eric Myhre authored
      It details a variety of considered approaches.
      
      Spoiler: I'm not actually super pleased with the one I'm currently
      pursuing.  The amount of boilerplate I'm grinding out for this is
      really, really no fun at all.  It's possibly that the reasoning leading
      here is still sound.  It's just unpleasant.
      8777aae1
    • Eric Myhre's avatar
      schema: beginning (re)implementation of validation rules. · fe935c9c
      Eric Myhre authored
      Carving out hunks of the schema2 implementation of them (which still
      hoped to use the dmt more directly) as I port them.
      
      As comments in the diff state: I had a hope here that I could make
      this relatively table-driven, which would increase legibility and
      make it easier to port these checks to other implementations as well.
      We'll... see how that goes; it's not easy to flatten.
      fe935c9c
    • Eric Myhre's avatar
    • Eric Myhre's avatar
      schema: so much boilerplate for feeding information to the Compiler that I... · d74ecb3e
      Eric Myhre authored
      schema: so much boilerplate for feeding information to the Compiler that I wrote another supplementary code generator.
      
      (I'm getting very weary of golang.)
      
      This new bit of codegen makes the compiler.go file fairly readable
      again, though, so I'm satisfied with it.
      
      The Compiler API is now complete enough that I can start repairing
      other things to use it properly.  The schemadmt.Schema.Compile()
      function and all of its helpers compile again now.  So does *most*
      of the whole codegen system... with the notable exception of all
      the hardcoded typesystem spawning which used the old placeholder
      methods which have now been stricken.
      
      TypeSystem now maintains order.  This allowed me to remove some
      sort operations from the code generator.  This also means the next
      time any existing codegen is re-run, the output file will shift
      significantly.  However, it shouldn't do so again in the future.
      d74ecb3e
    • Eric Myhre's avatar
      schema/compiler: move into schema package. · de9e49b0
      Eric Myhre authored
      As with parent commit: this is a checkpoint.  CI will not be passing.
      de9e49b0
    • Eric Myhre's avatar
      schema: working to unify interfaces and dmt. Intermediate checkpoint commit. · f1859e77
      Eric Myhre authored
      This commit does not pass CI or even fully compile, and while I usually
      try to avoid those, A) I need a checkpoint!, and B) I think this one is
      interestingly illustrative, and I'll probably want to refer to this
      diff and the one that will follow it in the future as part of
      architecture design records (or even possibly experience reports about
      golang syntax).
      
      In this commit: we have three packages:
      
      - schema: full of interfaces (and only interfaces)
      - schema/compiler: creates values matching schema interfaces
      - schema/dmt: contains codegen'd types that parse schema documents.
      
      The dmt package feeds data to the compiler package, and the compiler
      package emits values matching the schema interface.
      This all works very nicely and avoids import cycles.
      
      (Avoiding import cycles has been nontrivial, here, unfortunately.
      The schema/schema2 package (which is still present in this commit,
      but will be removed shortly -- I've scraped most of it over into
      this new 'compiler' package already, just not a bunch of the validation
      rules stuff, yet) was a dream of making this all work by just having
      thin wrapper types around the dmt types.  This didn't fly...
      because codegen'd nodes comply with `schema.TypedNode`, and complying
      with `schema.TypedNode` means they have a function which references
      `schema.Type`... and that means we really must depend on that
      interface and the package it's in.  Ooof.)
      
      The big downer with this state, and why things are currently
      non-compiling at this checkpoint I've made here, is that we have to
      replicate a *lot* of methods into single-use interfaces in the schema
      package for this to work.  This belies the meaning of "interface".
      The reason we'd do this -- the reason to split 'compiler' into its own
      package -- is most because I wanted to keep all the constructor
      mechanisms for schema values out of the direct path of the user's eye,
      because most users shouldn't be using the compiler directly at all.
      
      But... I'm shifting to thinking this attempt to segregate the compiler
      details isn't worth it.  A whole separate package costs too much.
      Most concretely, it would make it impossible to make the `schema.Type`
      interface "closed" (e.g. by having an unexported method), and I think
      at that point we would be straying quite far from desired semantics.
      f1859e77
  2. 21 Jan, 2021 1 commit
    • Daniel Martí's avatar
      schema/gen/go: cache genned code in os.TempDir · 0b3adb9d
      Daniel Martí authored
      This means we no longer clutter the repository with lots of files, even
      if they are git-ignored. It's always a bit of a red flag when you run
      "go test ./..." and the result is a bunch of leftover files.
      
      We still want to keep the files around, for the sake of Go's build
      cache. And we still want their paths to be static between "go test"
      runs. So put them in a static dir under os.TempDir.
      
      This does mean that concurrent runs of these tests will likely not work
      well. I don't imagine that's going to be a problem anytime soon, though.
      If it really becomes a problem in the future, we could figure something
      out like grabbing a file lock for the directory.
      
      The idea behind using os.TempDir is that it will likely remain in place
      between a number of "go test" runs within a hacking session, but it will
      be eventually cleaned up by the system, such as when rebooting.
      
      Note that we need to use globbing since one can't build "proper
      packages" located outside a module. The only exception is building an
      ad-hoc set of explicit Go files. While at it, use filepath.Join, to be
      nice.
      0b3adb9d
  3. 18 Jan, 2021 1 commit
  4. 10 Jan, 2021 2 commits
    • Daniel Martí's avatar
      schema/gen/go: remove two common subtest levels · d02c3602
      Daniel Martí authored
      Practically every subtest ends up at 7 or so levels of names, like:
      
      	TestMapsContainingMaybe/maybe-using-ptr/generate/compile/bhvtest/non-nullable/typed-create
      
      However, note that the "generate" and "compile" levels are always there,
      so their presence just adds verbosity in the output and makes the
      developer's life more difficult.
      
      Extremely nested sub-tests are already rare, so at least we can just
      keep the components that add useful information in the output.
      
      "bhvtest" is also pretty redundant, but that one actually matters - its
      subtest can be skipped depending on build tags.
      d02c3602
    • Daniel Martí's avatar
      schema/gen/go: please vet a bit more · 6796504d
      Daniel Martí authored
      In particular, this removes ~50 out of the 2.7k warnings in 'go vet
      ./...' in this repository. Mainly, the "unreachable code" ones.
      
      This was caused by edge cases in some of the generated code which caused
      an unconditional return or panic statement to be followed by other code.
      Fix all of them with a bit more template logic.
      
      Some of the Next methods go a bit further. If they serve no purpose as
      the switch has no cases to be matched, just unconditionally return an
      error. In the future we can perhaps reuse a single function for that.
      
      Finally, I was having a hard time actually following the logic in
      kindedUnionNodeAssemblerMethodTemplateMunge, so I've indented the code a
      bit to follow the template logic and scoping.
      
      These changes move us towards pleasing vet, which is nice, but also make
      the code waste a bit less space.
      6796504d
  5. 07 Jan, 2021 1 commit
  6. 03 Jan, 2021 4 commits
  7. 31 Dec, 2020 1 commit
  8. 27 Dec, 2020 1 commit
  9. 25 Dec, 2020 1 commit
    • Daniel Martí's avatar
      all: rename schema.Kind to TypeKind, ipld.ReprKind to Kind · 2d7d25c4
      Daniel Martí authored
      As discussed on the issue thread, ipld.Kind and schema.TypeKind are more
      intuitive, closer to the spec wording, and just generally better in the
      long run.
      
      The changes are almost entirely automated via the commands below. Very
      minor changes were needed in some of the generators, and then gofmt.
      
      	sed -ri 's/\<Kind\(\)/TypeKind()/g' **/*.go
      	git checkout fluent # since it uses reflect.Value.Kind
      
      	sed -ri 's/\<Kind_/TypeKind_/g' **/*.go
      	sed -i 's/\<Kind\>/TypeKind/g' **/*.go
      	sed -i 's/ReprKind/Kind/g' **/*.go
      
      Plus manually undoing a few renames, as per Eric's review.
      
      Fixes #94.
      2d7d25c4
  10. 17 Dec, 2020 1 commit
    • Daniel Martí's avatar
      all: rename AssignNode to ConvertFrom · 6e6625bd
      Daniel Martí authored
      This should be more intuitive to Go programmers, since assignments are
      generally trivial operations, but conversions imply that extra work
      might be needed to adapt the value to fit in the recipient.
      
      The entire change is just:
      
      	sed -ri 's/AssignNode/ConvertFrom/g' **/*.go
      
      Downstream users can very likely use the same line to fix their function
      declarations and calls.
      
      Fixes #95.
      6e6625bd
  11. 16 Dec, 2020 1 commit
    • Daniel Martí's avatar
      all: rewrite interfaces and APIs to support int64 · f6e9a891
      Daniel Martí authored
      We only supported representing Int nodes as Go's "int" builtin type.
      This is fine on 64-bit, but on 32-bit, it limited those node values to
      just 32 bits. This is a problem in practice, because it's reasonable to
      want more than 32 bits for integers.
      
      Moreover, this meant that IPLD would change behavior if built for a
      32-bit platform; it would not be able to decode large integers, for
      example, when in fact that was just a software limitation that 64-bit
      builds did not have.
      
      To fix this problem, consistently use int64 for AsInt and AssignInt.
      
      A lot more functions are part of this rewrite as well; mainly, those
      revolving around collections and iterating. Some might never need more
      than 32 bits in practice, but consistency and portability is preferred.
      Moreover, many are interfaces, and we want IPLD interfaces to be
      flexible, which will be important for ADLs.
      
      Below are some GNU sed lines which can be used to quickly update
      function signatures to use int64:
      
      	sed -ri 's/(func.* AsInt.*)\<int\>/\1int64/g' **/*.go
      	sed -ri 's/(func.* AssignInt.*)\<int\>/\1int64/g' **/*.go
      	sed -ri 's/(func.* Length.*)\<int\>/\1int64/g' **/*.go
      	sed -ri 's/(func.* LookupByIndex.*)\<int\>/\1int64/g' **/*.go
      	sed -ri 's/(func.* Next.*)\<int\>/\1int64/g' **/*.go
      	sed -ri 's/(func.* ValuePrototype.*)\<int\>/\1int64/g' **/*.go
      
      Note that the function bodies, as well as the code that calls said
      functions, may need to be manually updated with the integer type change.
      That cannot be automated, because it's possible that an automated fix
      would silently introduce potential overflows not being handled.
      
      Some TODOs and FIXMEs for overflow checks are removed, since we remove
      some now unnecessary int64->int conversions. On the other hand, the
      older codecs based on refmt need to gain some overflow check TODOs,
      since refmt uses ints. That is okay for now, since we'll phase out refmt
      pretty soon.
      
      While at it, update codectools to use int64 for token Length fields, so
      that it properly supports full IPLD integers without machine-dependent
      behavior and overflow checks. The budget integer is also updated to be
      int64, since the lengths it uses are now int64.
      
      Note that this refactor needed changes to the Go code generator as well
      as some of the tests, for the purpose of updating all the code.
      
      Finally, note that the code-generated iterator structs do not use int64
      fields internally, even though they must return int64 numbers to
      implement the interface. This is because they use the numeric fields to
      count up to a small finite amount (such as the number of fields in a Go
      struct), or up to the length of a map/slice. Neither of them can ever
      outgrow "int".
      
      Fixes #124.
      f6e9a891
  12. 13 Dec, 2020 5 commits
  13. 04 Dec, 2020 3 commits
    • Eric Myhre's avatar
      draft of schema types using codegen for data model, with a package for the... · 2e68fd35
      Eric Myhre authored
      draft of schema types using codegen for data model, with a package for the fully validated data which is implemented by retaining and accessing into the raw data.
      2e68fd35
    • Eric Myhre's avatar
      codegen: assembler for struct with map representation now validates all... · f8d654da
      Eric Myhre authored
      codegen: assembler for struct with map representation now validates all non-optional fields are present.
      
      This continues what https://github.com/ipld/go-ipld-prime/pull/111/ did
      and adds the same logic to the map representation.  The actual state
      tracking works the same way (and was mostly already there).
      
      Rearranged the tests slightly.
      
      Made error messages include both field name and serial key when they
      differ due to a rename directive.  (It's possible this error would get
      nicer if it used a list of StructField instead of just strings, but it
      would also get more complicated.  Maybe revisit later.)
      f8d654da
    • Daniel Martí's avatar
      all: fix a lot of "unkeyed literal" vet warnings · 354f194f
      Daniel Martí authored
      Reduces the output of 'go vet ./...' from 374 lines to 96. Many warnings
      remain, but I have lost my patience for today.
      
      Most of the changes below were automated, especially the single-line
      mixins expressions. Unfortunately, many of the Traits structs required
      manual copy-pasting.
      354f194f
  14. 30 Nov, 2020 1 commit
    • Will's avatar
      Allow overriden types (#116) · fe47b7f0
      Will authored
      This change will look at the destination package that codegen is being built into, and will skip generation of types that are already declared by files not prefixed with `ipldsch_`.
      
      This isn't the cleanest escape-hatch, but it's a start.
      fe47b7f0
  15. 17 Nov, 2020 4 commits
    • Will Scott's avatar
      add import to ipld in ipldsch_types.go · 9656675b
      Will Scott authored
      cleanup from #105
      9656675b
    • Eric Myhre's avatar
      codegen: rename files. · 32e66f20
      Eric Myhre authored
      An underscore; and less "gen", because reviewers indicated it felt redundant.
      32e66f20
    • Eric Myhre's avatar
      codegen: deterministic order for types in output. · 4f333954
      Eric Myhre authored
      I'd still probably prefer to replace this with simply having a stable
      order that is carried through consistently, but that remains blocked
      behind getting self-hosted types, and while it so happens I also got
      about 80% of the way there on those today, the second 80% may take
      another day.  Better make this stable rather than wait.
      4f333954
    • Eric Myhre's avatar
      codegen: rearrange output into finite number of files. · 76193e5d
      Eric Myhre authored
      Also, emit some comments around the type definitions.
      
      The old file layout is still available, but renamed to GenerateSplayed.
      It will probably be removed in the future.
      
      The new format does not currently have stable output order.
      I'd like to preserve the original order given by the schema,
      but our current placeholder types for schema data don't have this.
      More work needed on this.
      76193e5d
  16. 14 Nov, 2020 1 commit