This project helps easy implementation of LLMs to your Java projects. It's functional but still needs some improvements.
Current Features:
Connect to llama.cpp
Can send inputs and retrieve bot response in real time
Issues:
Long inputs causes lower end devices to slow down
Don't have memory to reference previous responses
Future Plans:
Allocate some memory to device's GPUs
Fix memory issue(Current ideas: Create local database for it to reference previous conversation or Use smart prompting)
Create an alternative for autoGPTs by using different models dealing task for specific purposes