-
Notifications
You must be signed in to change notification settings - Fork 13.4k
-
Add support for launching quantized weights on 310p NPU like Atlas 300i duo cards. There is none backends which supports quantization for 310p except mindIE(only INT8 with special format convertion). Quant support will make this card best value for local inference.
Beta Was this translation helpful? Give feedback.
All reactions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment