Introducing Gallia: a library for data manipulation

Another update:

  1. Switched license to BSL:
    a. See the FAQ explaining the license in non-legal terms; in a nutshell it’s free if you what you do is essential or your are a small entity
    b. See the license itself: especially the Additional Use Grant part

  2. Added a full-blown example of leveraging Spark RDDs with Gallia, in combination with EMR: see repo
    a. Try it out with test-run the script
    b. See driver: GeneManiaSparkDriver.scala
    c. See transformations in spark-unaware GeneMania.scala (in parent repo)

  3. Added macros in their own repo, which basically help going to and from from case classes/gallia instances; see them in action here.

  4. Next steps will be:
    a. Publishing binaries: requires some legal scrutiny due to Apache 2 dependencies
    b. Trying out Scala 3/Dotty and starting to adapt the code wherever it’s not too disruptive
    c. Trying to present Gallia in conference/meetups (I’ll come wherever there’s free pizza)

Stay tuned!