What's holding me back from AI repos and agents isn't running it locally though. Its the lack of granular control. I'm not even sure what I want. I certainly don't want to approve every request, but the idea of large amounts of personal data being accessible, unchecked, to an AI is concerning.
I think perhaps an agent that focuses just on security, that learns about your personal preferences, is what might be needed.
Not 100% sure what are the ingestions methods available ? Browser extension clipper and RSS are two. I guess I can manually create a node/atom ? Can it scan a local folder for markdown notes ? Or ocr some pdf -> markdown/frontmatter sidecar files -> atomic node ? That would be the dream.
I saw sqlite-vec for semantic search so I assume notes are stored in sqlite.
- What considerations did you have for the storage layer?
- Also does storage on disk increase linearly as notes/atoms grow?