Faster Ollama alternative

RandomlyRight@sh.itjust.works · 3 months ago

I’ve read about this method in the GitHub issues, but to me it seemed impractical to have different models just to change the context size, and that was the point I started looking for alternatives

RandomlyRight@sh.itjust.works · 3 months ago

It was multiple models, mainly 32-70B

RandomlyRight@sh.itjust.works · 3 months ago

There are many projects out there optimizing the speed significantly. Ollama is unbeaten in the convenience though

RandomlyRight@sh.itjust.works · 3 months ago

Yeah, but there are many open issues on GitHub related to these settings not working right. I’m using the API, and just couldn’t get it to work. I used a request to generate a json file, and it never generated one longer than about 500 lines. With the same model on vllm, it worked instantly and generated about 2000 lines

RandomlyRight@sh.itjust.works · 3 months ago

Faster Ollama alternative

RandomlyRight@sh.itjust.works · 4 months ago

Take a look at NVIDIA Project Digits. It’s supposed to release in May for 3k usd and will be kind of the only sensible way to host LLMs then:

https://www.nvidia.com/en-us/project-digits/

RandomlyRight@sh.itjust.works · 8 months ago

How is Apple pretty bad?

RandomlyRight@sh.itjust.works · 9 months ago

I’ve discovered it just a few days ago and now use it on all my machines

RandomlyRight@sh.itjust.works · edit-2 9 months ago

For anyone trying this, make sure you do not have “- TS_USERSPACE=false” in your yaml from previous experimentation. After removing this, it works for me too.

In the documentation they say to add sysctl entries, it is possible in docker compose like so:

tailscale:
    sysctls:
      - net.ipv4.ip_forward=1
      - net.ipv6.conf.all.forwarding=1

But it does not seem to make a difference for me. Does anyone know why these would not be required in this specific setup?

RandomlyRight@sh.itjust.works · 9 months ago

Thank you, really appreciate it!

RandomlyRight@sh.itjust.works · 9 months ago

Do you have any links/sources about this? I’m not saying you’re wrong, I’m just interested

RandomlyRight@sh.itjust.works · 1 year ago

Do you have an example? I’m genuinely curious, I’ve heard a lot about this theory but can’t really imagine how you would differentiate bots from mindless redditors farming for karma by saying „This.“

RandomlyRight@sh.itjust.works · 1 year ago

You can only resign from being part of the church, which many young people do once they see this on their first paycheck.

RandomlyRight@sh.itjust.works · 2 years ago

This is pretty cool! Just installed it. But the css doesn’t seem to load properly, following the tutorial. Do you also have weird stuff going on with the top bar still showing sometimes?