As far as practicality goes it has 3 problems that I’m working on with various degree of success.
1st is speeeed, most implementations I’ve seen a horribly slow and the algorithm itself is prone to exponental explosion. My idea here is using a clustered approach where dataset is split by hamming distance and each cluster is processed in parallel.
2nd general obscurity, despite the fact that my institute used it for decades featuring numerous research projects, there is 0 reusable libraries even internally. jsm4s aims to fill that void.
3rd is adapting datasets to the strict boolean logic of JSM. I plan a short blog post that will touch on this topic, in short it’s rather involved but quite doable.