this post was submitted on 24 Jul 2024
53 points (90.8% liked)

Selfhosted

52479 readers
2193 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

Im using Ollama on my server with the WebUI. It has no GPU so its not quick to reply but not too slow either.

Im thinking about removing the VM as i just dont use it, are there any good uses or integrations into other apps that might convince me to keep it?

you are viewing a single comment's thread
view the rest of the comments
[–] RandomLegend@lemmy.dbzer0.com 8 points 1 year ago (2 children)

Any model recommendation for that?

The ones i tried get stuck in a loop at some point due to the small context windows.

[–] 1rre@discuss.tchncs.de 3 points 1 year ago (2 children)

Yeah even gpt4o couldn't keep track of encounters, run battles etc. in my case...

I think if you wanted to do it mechanically consistently you'd probably need to integrate it into a vtt where you give it context and potentially fine-tune it to give quest related summaries & gming rather than just "stuff"

[–] Bluesheep@lemmy.world 2 points 1 year ago

I don’t know how tech savvy you are, but I’m assuming since your on lemmy it’s pretty good :)

The way we’ve solved this sort of problem in the office is by using the LLM’s JSON response, and a prompt that essentially keeps a set of JSON objects alongside the actual chat response.

In the DND example, this would be a set character sheets that get returned every response but only changed when the narrative changes them. More expensive, and needing a larger context window, but reasonably effective.

[–] RandomLegend@lemmy.dbzer0.com 2 points 1 year ago

VTT integration would be one hell of a job to do.

[–] WeLoveCastingSpellz@lemmy.dbzer0.com 1 points 1 year ago* (last edited 1 year ago) (1 children)

the answer is very spesific to ur pc and amount of vram you have availşble to you. But anything lama 3 even 8b models finetuned to DM or write stories should theoritically work. The other reply that reccomends connecting to another program to make sure rules are consistent sounds like a great idea whşch I have not tried. I use silly tavern as the ui whşch has lots of options and shit to mske thşngs wkrk well. I would reccomend goşng şnto the "KoboldAI" discord and askşng şn the support sectşon folk there are very helpfull sorry for not beşng able to gşve a strsight answer Also boost the context size way up that shit makes dşfference I habe like 16k or sumthin. good luck!

[–] RandomLegend@lemmy.dbzer0.com 3 points 1 year ago (1 children)

What on earth is going on with your keyboad?!

Besides that, i have 20GB of VRAM and 64GB or RAM. I can run the mixtral 8x7b model relatively usable. Currently i use oobabooga the most.

[–] WeLoveCastingSpellz@lemmy.dbzer0.com 0 points 1 year ago* (last edited 1 year ago)

I type very poorly on my phone. with that much vram ypu csn get somethşng lşke a 70b model defineyly ask around in the koboldai community that shşt's crszy