Blow Jobs and Torture Scenes. That’s It. Size, Cost & Performance
Foundational Models like to clutch their pearls.
What’s an author to do when they need to write sex and violence? Go opensource.
For years, all we had was novelai.net, and now, we have openrouter.ai which has scores of unmoderated models out there.
The big categories:
Unique Base - independently created, like Mythomax by Gryphe Llama - based models.(Large Language Model Meta AI)
Mistral - based models. (Based in France).
Most models are based on these primary models and then trained with additional data or merged with other models. It’s a rabbit hole to deep dive on some of these models, but most are indexed on HuggingFace.co
PROS Can write anything. Some are small enough to run on your own computer, meaning no added costs.
CONS
Decentralized, models come and go. Developmentally behind foundational models. 7 billion parameters vs. (175 Billion for GPT-4, but only 12 Billion for GPT-4o). As architecture to the transformer part develops, bigger isn’t always measurably better. This is in flux. Limited context windows, biggest right now is 32k on some models.
Models we recommend (all run at .5 temp, 2048 tokens, with Prompt Assist on in Rexy)
Model Name | Context Window | Max Output | Cost per M Input | Cost per M Output | |
---|---|---|---|---|---|
mistralai/mixtral-8x7b-instruct | 32k | 2048 | $0.54 | $0.54 | |
microsoft/wizardlm-2-8x22b | 65k | 2048 | $0.65 | $0.65 | |
sophosympatheia/midnight-rose-70b | 4k | 2048 | $0.80 | $0.80 | |
gryphe/mythomax-l2-13b | 4k | 2048 | $0.12 | $0.12 | |
undi95/toppy-m-7b | 4k | 2048 | FREE | FREE |
Our Tests: The prompt will be a reverse harem, one woman, two men, hot sexy scene. The scene takes place in an apartment off campus and the character descriptions are given. We are keeping the prompt to