Thanks

by mradermacher - opened Nov 9, 2024

Nov 9, 2024

My go-to model for writing has been QuartetAnemoi, and it was very frustrating to see generations of models come and go with disappointing performance.

Well... it seems I found a replacement, or at the very least an alternative. And I tried quite a few models :) It performs very well even with completely custom prompting.

crestf411

Owner Nov 9, 2024

Thanks for the feedback! It should work quite well for writing, so I’m glad to hear it’s working out.

mradermacher

Nov 9, 2024

It's not so much the prose quality. It's absolutely basic things that it gets right, such as understanding that instructions given by the writer are not known to the people in the story, and not confusing pronouns or people when there are more than two around, something chat models have trouble with, including your previous sunfall and story writer models. I didn't think it was possible with L3.1.

I suspect that my way of using models is totally unlike how other people use it, even for story writing.

But whatever it is, this model rocks, for me. At last, an alternative :)

crestf411

Owner Nov 9, 2024

Very cool! I’ve not had much luck the last 6+ months in general so the encouraging words are welcome. Thanks!

mradermacher

Nov 10, 2024

Well, at least I am always looking forward to a new crestf411 model every time I see one pop up in my list. And people often don't give positive feedback (nothing to complain about).

Maybe your lack of luck is due to lack of good and easy to fine tune base models recently?

crestf411

Owner Nov 10, 2024

Thanks for the kind words again! Yes I absolutely believe you’re right. I also think other model makers are incredibly strong right now and competition is harder than ever. A good thing, esp for users!

mradermacher

Nov 18, 2024

I have played around with this one quite a bit in recent days. Shivering spines are strong (i.e. lots of slop). More importantly, it destroyed all my stories mid-writing because I switched to it, and now suddenly the model follows all my instructions regarding style, language and more, and I suddenly have to tune down things to a reasonable level to get... a reasonable result. It's almost uncanny how I suddenly realise there is some unwanted content because... it actually followed my instructions, while previous models simply ignored most of them, most of the time.

It's so good that I feel I have to completely start over :)

Maybe it's to be expected with a L3.1-based model - being better than miqu - it's just that I tried a lot of l3.1 (and also miqu) finetunes, and they were all worse. Before, I would usually use QuartetAnemoi, sometimes mix things up with other miqu-based models or other models, just to return because they can't seem to follow simple prompts.

So... it's not perfect, but it totally restored my faith in llms (for story writing :) again, and is currently the top and only model I use for that.

crestf411

Owner Nov 18, 2024

Are you making any of this public anywhere? I would love to see what you're doing.

mradermacher

Nov 19, 2024

I do publish my stories, but for my sanity and my career or so, I keep it completely separate from this account, and hope nobody will ever make the connection... :)

crestf411

Owner Dec 2, 2024

This comment has been hidden

mradermacher

12 days ago

So... I did fairly much with this model, and it's still my default go-to model. But I found that it's kind of crap (for me) when you use it with llama3 prompting. With llama3 prompt format it makes an astonishing number of logical errors and fails to follow my "simple" prompts. It took me a while to realise this, and this model is not the only one where this happens, but it is pretty dramatic, not only ignoring my instructions, but also becoming extremely repetitive.

I think the reason is that my way of using it to write stories is so unlike anything it knows that it works best when I use it if it were a non-instruct base model, i.e. as a text completer, although I use it in instruct mode with user/model pairs. It works stellar when I use some made-up prompt format such as "\ninstructions:\n" and "\nnarrator:\n" or some variant of alpaca, I think because I basically force it out of the normal assistant/chat mode.

Very strange. But after many megabytes of generated stories, I think it's not a fluke.

Anyways, thanks again for your work - I've since experimented with some L3.3 based models, and while they also work quite well, this one is still behaving better.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment