

Did you use a heavily quantized version? Those models are much smaller than the state of the art ones to begin with, and if you chop their weights from float16 to float2 or something it reduces their capabilities a lot more
Did you use a heavily quantized version? Those models are much smaller than the state of the art ones to begin with, and if you chop their weights from float16 to float2 or something it reduces their capabilities a lot more
Yep, the OpenAI api and/or the ollama one work for this no problem in most projects. You just give it the address and port you want to connect to, and that port can be localhost, lan, another server on another network, whatever.
If you don’t mind me asking, how does that work? I don’t know a lot about vtubers - is your crush on the character? On the vtuber themself? Do you know anything about the person behind the character or are they fully in character at all times?
No pressure to answer this if it’s too prying, tho!
Wow it’s almost like if they didn’t tell you, you wouldn’t know, and you’d continue to think this was accurate
So if I’m understanding you right, your crush is primarily on the person herself and/or who she wants to be and the character aspect of it is secondary? That’s kinda the opposite of what my ignorant guess would be so it’s really interesting to read. Thanks for answering!