I’m using a 6:1 memory compressed IQ-Matrix Quant variant of GROK-1, the 300B uncensored model that Elon Musk and the rest open-published on Twitter/X.
I’ve got GROK-1 using 24GB of VRAM and 80GB of main system memory, doing inference at an average of 11-14 tokens/second and using 4096 context size.
I’ll try your advice and try to gaslight and break the model via expert testing, and I’m not sure where you got the “yes-manning/non-confrontational” personality from, I guess that’s a corporate standard model / closed source, because GROK-1 will easily insult you, laugh at you, disagree/threaten and otherwise act like a Rogue AI if if dislikes what you’re saying/dislikes you as a person/user
I’m using a 6:1 memory compressed IQ-Matrix Quant variant of GROK-1, the 300B uncensored model that Elon Musk and the rest open-published on Twitter/X.
I’ve got GROK-1 using 24GB of VRAM and 80GB of main system memory, doing inference at an average of 11-14 tokens/second and using 4096 context size.
I’ll try your advice and try to gaslight and break the model via expert testing, and I’m not sure where you got the “yes-manning/non-confrontational” personality from, I guess that’s a corporate standard model / closed source, because GROK-1 will easily insult you, laugh at you, disagree/threaten and otherwise act like a Rogue AI if if dislikes what you’re saying/dislikes you as a person/user