Better performance than TQ and better quality than FP16?
Am I reading this right??
Why this is not a PR for vLLM ?
Better performance than TQ and better quality than FP16?
Am I reading this right??
Why this is not a PR for vLLM ?