9 comments

  • viraptor 1 hour ago
    I've played with this a bit and it's ok. I'd place it somewhere around sonnet 4.5 level, probably below. But with this aggressive pricing you can just run 3 copies to do the same thing, choose the one that succeeded and still come out way ahead with the cost. Not as great as following instructions as Claude models and can get lost, but still "good enough".

    I'm very happy with using it to just "do things". When doing in depth debugging or a massive plan is needed, I'd go with something better, but later going through the motions? It works.

  • jondwillis 2 hours ago
    > MiniMax has been continuously transforming itself in a more AI-native way. The core driving forces of this process are models, Agent scaffolding, and organization. Throughout the exploration process, we have gained increasingly deeper understanding of these three aspects. Today we are releasing updates to the model component, namely MiniMax M2.1, hoping to help more enterprises and individuals find more AI-native ways of working (and living) sooner.

    This compresses to: “We are updating our model, MiniMax, to 2.1. Agent harnesses exist and Agents are getting more capable.”

    A good model and agent harness, pointed at the task of writing this post, might suggest less verbosity and complexity— it comes off as fake and hype-chasing to me, even if your model is actually good. I disengage there.

    I saw yall give a lightning talk recently and it was similarly hype-y. Perhaps this is a translation or cultural thing.

    • tw1984 1 hour ago
      so when MiniMax released a pretty capable model, you choose to ignore the model itself and just focus a single sentence they wrote in the release note and started bad mouthing it.

      is it a cultural thing?

      • simlevesque 24 minutes ago
        If I use a software I need to trust it.
    • zaptrem 1 hour ago
      Not sure it’s a cultural thing since most of the copy coming out of DeepSeek has been pretty straightforward.
  • esafak 1 hour ago
    > It exhibits consistent and stable results in tools such as Claude Code, Droid (Factory AI), Cline, Kilo Code, Roo Code, and BlackBox, while providing reliable support for Context Management mechanisms including Skill.md, Claude.md/agent.md/cursorrule, and Slash Commands.

    One of the demos shows them using Claude Code, which is interesting. And the next sections are titled 'Digital Employee' and 'End-to-End Office Automation'. Their ambitions obviously go beyond coding. A sign of things to come...

    • jimmydoe 0 minutes ago
      they are going IPO in HKEX in a few weeks. some hype up are necessary, not too far fetched imo, pretty much same as anthropic playbook.
  • tomcam 1 hour ago
    I still can’t figure out what it does
    • yinuoli 45 minutes ago
      It's a neural network model, and it could generate text following a given text.
    • esafak 1 hour ago
      It's an LLM for coding.
    • prmph 1 hour ago
      You are not alone
  • Invictus0 18 minutes ago
    How is everyone monitoring the skill/utility of all these different models? I am overwhelmed by how many they are, and the challenge of monitoring their capability across so many different modalities.
  • mr_o47 1 hour ago
    I won't say it's same on the level of claude models but it's definitely good at coming up with frontend designs
  • jdright 2 hours ago
  • p-e-w 2 hours ago
    One of the cited reviews goes:

    “We're excited for powerful open-source models like M2.1 […]”

    Yet as far as I can tell, this model isn’t open at all. Not even open weights, nevermind open source.

    • viraptor 1 hour ago
      It's scheduled for release. They jumped the gun with the news. But at far as we know, it's still coming out, just like M2.
      • p-e-w 1 hour ago
        I don’t get it. What’s the holdup? Uploading a model to Hugging Face isn’t exactly difficult.
    • bearjaws 1 hour ago
      Yeah I don't see anyway to download this, ollama has it as cloud only.
  • monster_truck 2 hours ago
    That they are still training models against Objective-C is all the proof you need that it will outlive Swift.

    When is someone going to vibe code Objective-C 3.0? Borrowing all of the actual good things that have happened since 2.0 is closer than you'd think thanks to LLVM and friends.

    • viraptor 1 hour ago
      Why would they not? Existing objective-c apps will still need updates and various work. Models are still trained on assembler for architectures that don't meaningfully exist today as well.