Vector Quantization Methods

Outlier-aware Quantization Framework Co-designed With Heterogeneous NVM For SLM Deployment on Edge Platforms (UCSD et al.)

A new technical paper titled “QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design” was published by researchers at University of California San Diego and ...

IEEE

Diagnosing and Improving Vector-Quantization-Based Blind Image Restoration

Abstract: Vector-Quantization (VQ) based discrete generative models are widely used to learn powerful high-quality (HQ) priors for blind image restoration (BIR). In this paper, we diagnose the ...

Bleeping Computer

Google exposes BadAudio malware used in APT24 espionage campaigns

China-linked APT24 hackers have been using a previously undocumented malware called BadAudio in a three-year espionage campaign that recently switched to more sophisticated attack methods. Since 2022, ...

TechNode

Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

Huawei’s Zurich Computing Systems Laboratory has released SINQ (Sinkhorn Normalization Quantization), an open-source quantization method that reduces the memory requirements of large language models ...

VentureBeat

Huawei's new open source technique shrinks LLMs to make them run on less powerful, less expensive hardware

Huawei’s Computing Systems Lab in Zurich has introduced a new open-source quantization method for large language models (LLMs) aimed at reducing memory demands without sacrificing output quality.

GitHub

[Feature] Diffusers integration of the SVDQuant quantization method.

I would love to see the svdquant quantization integrated to diffusers as a quantization backend like the others found here: https://huggingface.co/docs/diffusers/api ...

Business Wire

Elastic Announces Faster Filtered Vector Search with ACORN-1 and Default Better Binary Quantization Compression

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, announced new performance and cost-efficiency breakthroughs with two significant enhancements to its vector search. Users ...

Frontiers

Quantized convolutional neural networks: a hardware perspective

With the rapid development of machine learning, Deep Neural Network (DNN) exhibits superior performance in solving complex problems like computer vision and natural language processing compared with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results