Another "innovation" that somehow requires 26 million parameters to recall 3 decent responses from the most basic conversation I've had with a friend. https://github.com/cactus-compute/needle