What are the steps required to get this running in VS Code? If they had linked t...

leourbina · 2024-07-16T16:31:55 1721147515

If you can run this using ollama, then you should be able to use https://www.continue.dev/ with both IntelliJ and VSCode. Haven’t tried this model yet - but overall this plugin works well.

scosman · 2024-07-16T16:57:43 1721149063

They say no llama.cpp support yet, so no ollama yet (which uses llama.cpp)

HanClinto · 2024-07-16T17:14:21 1721150061

Correct. The only back-end that Ollama uses is llama.cpp, and llama.cpp does not yet have Mamba2 support. The issues to track Mamba2 and Codestral Mamba support are here:

https://github.com/ggerganov/llama.cpp/issues/8519

https://github.com/ggerganov/llama.cpp/issues/7727

Mamba support was added in March of this year:

https://github.com/ggerganov/llama.cpp/pull/5328

I have not yet seen a PR to address Mamba2.

sadeshmukh · 2024-07-16T17:03:27 1721149407

Ollama is supported: https://docs.continue.dev/setup/select-provider

trsohmers · 2024-07-16T17:06:23 1721149583

They meant that there is no support for Codestral Mamba for llama.cpp yet.

osmano807 · 2024-07-16T19:11:57 1721157117

Unrelated, all my devices freeze when accessing this page, desktop Firefox and Chrome, mobile Firefox and Brave. Is this the best alternative to access code ai helpers besides the GitHub Copilot and Google Gemini on VSCode?

raphaelj · 2024-07-16T19:33:22 1721158402

I've been using it for a few months (with Starcoder 2 for code, and GPT-4o for chat). I find the code completion actually better than Github Copilot.

My main complain is that the chat sometimes fails to correctly render some GPT-4o output (e.g. LaTeX expressions), but it's mostly fixed with a custom system prompt. It also significantly reduces the battery life of my Macbook M1, but that's expected.

oliverulerich · 2024-07-16T20:45:04 1721162704

I'm quite happy with Cody from Sourcegraph https://marketplace.visualstudio.com/items?itemName=sourcegr...

refulgentis · 2024-07-16T16:49:46 1721148586

"All you need is users" doesn't seem optimal IMHO, Stability.ai providing an object lesson in that.

They just released weights, and being a for profit, need to optimize for making money, not eyeballs. It seems wise to guide people to the API offering.

bhouston · 2024-07-16T17:15:04 1721150104

On top of Hacker News (the target demographic for coders) without an effective monetizable call to action? What a missed opportunity.

Github Copilot makes +100M/year, if not way way more.

Having a VS Code extension for Mistral would be a revenue stream if it was one-click and better or cheaper than Github Copilot. It is malpractice in my mind to not be doing this if you are investing in creating coding models.

treyd · 2024-07-17T00:31:59 1721176319

How the hell does Copilot make $100M/yr? That seems an order of magnitude higher than I would expect at the high end.

kcb · 2024-07-17T03:19:45 1721186385

I was thinking the opposite...remember there's enterprise subscriptions and multi-million dollar contracts with single companies.

ketzo · 2024-07-17T01:06:09 1721178369

if we’re talking individual subscriptions that’s ~1M paying subscribers. honestly that number would not totally shock me?

plus they’ve got some kinda enterprise/team offering; assuming they charge extra there, I could easily see $100M ARR

but that’s pure conjecture, and generous at that; I don’t think we have any hard numbers

refulgentis · 2024-07-17T05:19:20 1721193560

baq · 2024-07-17T08:23:46 1721204626

yeah exactly only $100M/yr? barely covers expenses

refulgentis · 2024-07-16T17:36:09 1721151369

I see, that makes sense: make an extension and charge for it.

I assumed they meant free x local. It doesn't seem rational to make this one paid: its significantly smaller than their better model, and even more so than Copilot's.

passion__desire · 2024-07-16T17:23:41 1721150621

But they also signal competence in the space which means M&A. Or big nation states in future would hire them to produce country models once the space matures as was Emad's vision.

refulgentis · 2024-07-16T17:35:26 1721151326

Did Emad's vision end up manifest? ex. did a nation-state end up paying Stability for a country model?

Would it help signal competency? They're a small team focused on making models, not VS Code extensions.

Would they do M&A? The founding team is ex-Googlers and has found significant attention in the MBA world via being an EU champion.

NotMichaelBay · 2024-07-16T23:59:27 1721174367

What does a "country model" mean? Optimized for that country's specific language, or with state propaganda or something else?

michaelt · 2024-07-17T09:20:32 1721208032

If you believe LLMs are going to end up built into everything and doing everything, from moderating social media to writing novels and history books, making such a model will be the most political thing that has ever happened.

If your country believes guns=bad nipples=good war=hell but you get your novels and history books written by an LLM trained by people who believe guns=good nipples=bad war=heroic it would be naive to expect the output to reflect your values and not theirs.

Even close allies of the US would be nervous to have such power in the hands of American multinational corporations alone - so the French state could be very eager for Mistral to produce a competitive product.

refulgentis · 2024-07-17T01:48:47 1721180927

More or less; it was about as serious as your median Elon product tweet the last decade, or median coin nonsense.

Half-baked idea that obviously the models would need to be tuned for different languages / for specific knowledge, therefore countries would pay to do that.

There were many ideas like that, none of them panned out, hence the defenestration. All love for the guy, he did a very, very good thing. It's just meaningless to invoke it here, not only because it's completely off-topic, if anything that's already the play as the EU champion, and because the Stability gentleman was just thinking out loud, nothing more.

DalasNoin · 2024-07-17T10:24:54 1721211894

I feel like local models could be an amazing coding experience because you could disconnect from the internet. Usually I need to open chatgpt or google every so often to solve some issue or generate some function, but this also introduces so many distractions. imagine being able to turn off internet completely and only have a chat assistant that runs locally. I fear though that it is just going to be a bit to slow at generating tokens on CPU to not be annoying.

regularfry · 2024-07-17T13:55:32 1721224532

I don't have a gut feel for how much difference the Mamba arch makes to inference speed, nor how much quantisation is likely to ruin things, but as a rough comparison Mistral-7B at 4 bits per param is very usable on CPU.

The issue with using any local models for code generation comes up with doing so in a professional context: you lose any infrastructure the provider might have for avoiding regurgitation of copyright code, so there's a legal risk there. That might not be a barrier in your context, but in my day-to-day it certainly is.

sleepytimetea · 2024-07-16T16:35:09 1721147709

Looking through the Quickstart docs, they have an API that can generate code. However, I don't think they have a way to do "Day 2" code editing.

Also, doesn't seem to have a freemium tier...need to start paying even before trying it out ?

"Our API is currently available through La Plateforme. You need to activate payments on your account to enable your API keys."

sv123 · 2024-07-16T17:00:33 1721149233

I signed up when codestral was first available and put my payment details in. Been using it daily since then with continue.dev but my usage dashboard shows 0 tokens, and so far have not been billed for anything... Definitely not clear anywhere, but it seems to be free for now? Or some sort of free limit that I am not hitting.

sunaookami · 2024-07-16T17:27:41 1721150861

Through codestral.mistral.ai? It's free until August 1st: https://docs.mistral.ai/capabilities/code_generation/

>Monthly subscription based, free until 1st of August

PufPufPuf · 2024-07-17T07:54:11 1721202851

Currently the best (most user-friendly) way to run models locally is to use Ollama with Continue.dev. This one is not available yet, though: https://github.com/ggerganov/llama.cpp/issues/8519

yogeshp · 2024-07-17T06:16:12 1721196972

Website codegpt.co also has a plugin for both VS Code and Intellij. When model becomes available in Ollama, you can connect plugin in VS code to local ollama instance.

antifa · 2024-07-18T04:31:13 1721277073

Maybe not this model, but checkout TabbyML for offline/selfhostws LLMs in vscode.

sfsylvester · 2024-07-18T09:15:41 1721294141

Also looks like an older version of Codestral works well with TabbyML: https://tabby.tabbyml.com/blog/2024/07/09/tabby-codestral/

Thank you for sharing, this is almost exactly what I've been looking for, for ages!