Inference Engine for Retrieval Augmented Systems

Inspired from google deepmind's RETRO project, Piramid is meant to convert traditional RAG applications involving separate LLM and Database connections into one single hosted binary to serve and fuse transformer's attention with database queries.

Read the blogView on GitHubcargo install piramid