OpenAI GPT-OSS models use MXFP4 to cut inference costs

8 points | by rntn 4 months ago

No comments yet.