Actually, the name definitely came after noticing RAM prices. Though the idea where the graph-in-memory only for ephemeral RAG sessions came first, we won't pretend the naming wasn't influenced by RAM being in the spotlight.
The separate graph and vector storage can indeed add overhead for short-lived tasks. I've found that using a dual-memory architecture, where episodic and semantic memories coexist, can streamline this process and reduce complexity. If you're interested in seeing how this could work, I put together some tutorials on similar setups: https://github.com/NirDiamant/agents-towards-production
VoidWhisperer | a month ago
mirekrusin | a month ago
[OP] ktyptorio | a month ago
zwaps | a month ago
Where might one see more about what type of indexing you do to get the graph?
threecheese | a month ago
[OP] ktyptorio | a month ago
ekianjo | a month ago
[OP] ktyptorio | a month ago
Vector search (HNSW): Find top-k similar entities/text units from the query embedding
Graph traversal (BFS): From those seed entities, traverse relationships (up to 2 hops by default) to find connected entities
This catches both semantically similar entities AND structurally related ones that might not match the query text.
Implementation: https://github.com/gibram-io/gibram/blob/main/pkg/engine/eng...
kordlessagain | a month ago
nirdiamant | a month ago