Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/48256| Title: | DPOQ: Dynamic Precision Onion Quantization | Authors: | LI, Bowen Huang, Kai CHEN, Siang Xiong, Dongliang CLAESEN, Luc |
Issue Date: | 2021 | Source: | Balasubramanian, Vineeth N.; Tsang, Ivor (Ed.). Proceedings of Machine Learning Research PMLR, p. 502 -517 | Series/Report no.: | 157 | Abstract: | With the development of deployment platforms and application scenarios for deep neural networks, traditional fixed network architectures cannot meet the requirements. Meanwhile the dynamic network inference becomes a new research trend. Many slimmable and scalable networks have been proposed to satisfy different resource constraints (e.g., storage, latency and energy). And a single network may support versatile architectural configurations including: depth, width, kernel size, and resolution. In this paper, we propose a novel network architecture reuse strategy enabling dynamic precision in parameters. Since our low-precision networks are wrapped in the high-precision networks like an onion, we name it dynamic precision onion quantization (DPOQ). We train the network by using the joint loss with scaled gradients. To further improve the performance and make different precision network compatible with each other, we propose the precision shift batch normalization (PSBN). And we also propose a scalable input-specific inference mechanism based on this architecture and make the network more adaptable. Experiments on the CIFAR and ImageNet dataset have shown that our DPOQ achieves not only better flexibility but also higher accuracy than the individual quantization | Keywords: | Dynamic Quantization;Neural Network | Document URI: | http://hdl.handle.net/1942/48256 | Category: | C1 | Type: | Proceedings Paper |
| Appears in Collections: | Research publications |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| DPQQ - Dynamic Precision Onion Quantization.pdf | Published version | 419.97 kB | Adobe PDF | View/Open |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.