This is the official repository for our WACV 2022 paper, "Supervised Compression for Resource-Constrained Edge Computing Systems".
In resource-constrained edge computing systems, we often have mobile devices with weak computing resource, battery constraint, and limited communication capacity (e.g., low data rate). One of the solutions is split computing, that literally splits a trained neural network into head and tail models deployed on mobile device (weak) and edge server (stronger). Head model should be lightweight encoders and compress the input data (i.e., save both computational load on mobile device and communication cost from mobile device to edge server), and tail models decode the compressed data and complete the inference. Note that the training process is done offline (i.e., on single machines).
Our new work is built on this project, and
we significantly extended this work and defined supervised compression for split computing (SC2).
We introduced new benchmark metrics and published sc2bench
, a pip-installable Python package for SC2 problems.
We also conducted even larger-scale experiments.
The more advanced project with code and preprint are available here.
Different from a combination of input compression and supervised model (top), supervised compression (bottom) does not reconstruct the original input from the compressed representation, that usually contains information unrelated to supervised downstream tasks. In split computing, we need to relax computational load on mobile device (e.g., by designing a lightweight encoder), compress data to be transferred from weak mobile device to stronger edge server, and save the end-to-end input-to-prediction latency while preserving the original model accuracy.
We leverage a concept of neural image compression and introduced encoder and decoder to a teacher model in place of all the layers before its 2nd residual block.
Our supervised compression method consists of 1) learning to mimic a teacher's intermediate features and a prior for entropy coding to compress data to be transferred and 2) fine-tuning the decoder and remaining layers for a target downstream task so that a single encoder can serve multiple downstream tasks e.g., one-time encoding on mobile device while edge server can use the encoded data for multiple tasks with multiple tail models. In the above figure, we use an image classification with knowledge distillation as an example. The trained classifier can be used as backbone for other downstream tasks such as object detection and semantic segmentation.
@inproceedings{matsubara2022supervised,
title={{Supervised Compression for Resource-Constrained Edge Computing Systems}},
author={Matsubara, Yoshitomo and Yang, Ruihan and Levorato, Marco and Mandt, Stephan},
booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
pages={2685-2695},
month={January},
year={2022}
}
- Python >= 3.6.9
- pipenv
To replicate the supervised performance reported in the paper, download the checkpoints of our trained models from
either GitHub Release or
Baidu Wangpan (PIN: lk2f).
Unzip the downloaded file at the root directory of this repository i.e.,
- supervised-compression/
- resource/
It is highly recommended that you use a virtual environment (e.g., pipenv, anaconda). We use pipenv in this repository. Refer to Pipfile for the required packages.
# For Python 3.6 users
pipenv install --python 3.6
# For Python 3.7 users
pipenv install --python 3.7
# For Python 3.8 users
pipenv install --python 3.8
As the terms of use do not allow to distribute the URLs,
you will have to create an account here to get the URLs,
and replace ${TRAIN_DATASET_URL}
and ${VAL_DATASET_URL}
with them.
wget ${TRAIN_DATASET_URL} ./
wget ${VAL_DATASET_URL} ./
Untar and extract files
mkdir ~/dataset/ilsvrc2012/{train,val} -p
mv ILSVRC2012_img_train.tar ~/dataset/ilsvrc2012/train/
cd ~/dataset/ilsvrc2012/train/
tar -xvf ILSVRC2012_img_train.tar
mv ILSVRC2012_img_train.tar ../
for f in *.tar; do
d=`basename $f .tar`
mkdir $d
(cd $d && tar xf ../$f)
done
rm -r *.tar
mv ILSVRC2012_img_val.tar ~/dataset/ilsvrc2012/val/
wget https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh
mv valprep.sh ~/dataset/ilsvrc2012/val/
cd ~/dataset/ilsvrc2012/val/
tar -xvf ILSVRC2012_img_val.tar
mv ILSVRC2012_img_val.tar ../
sh valprep.sh
mv valprep.sh ../
Download and unzip the datasets
mkdir ~/dataset/coco2017/ -p
cd ~/dataset/coco2017/
wget http://images.cocodataset.org/zips/train2017.zip ./
wget http://images.cocodataset.org/zips/val2017.zip ./
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip ./
unzip train2017.zip
unzip val2017.zip
unzip annotations_trainval2017.zip
# Image classification
echo 'jpeg quality=100'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/jpeg-resnet50.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'jpeg quality=${next_quality}'
sed -i "s/jpeg_quality: ${quality}/jpeg_quality: ${next_quality}/" configs/ilsvrc2012/input_compression/jpeg-resnet50.yaml
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/jpeg-resnet50.yaml
done
# Object detection
echo 'jpeg quality=100'
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/jpeg-retinanet_resnet50_fpn.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'jpeg quality=${next_quality}'
sed -i "s/jpeg_quality: ${quality}/jpeg_quality: ${next_quality}/" configs/coco2017/input_compression/jpeg-retinanet_resnet50_fpn.yaml
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/jpeg-retinanet_resnet50_fpn.yaml
done
# Semantic segmentation
echo 'jpeg quality=100'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/jpeg-deeplabv3_resnet50.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'jpeg quality=${next_quality}'
sed -i "s/jpeg_quality: ${quality}/jpeg_quality: ${next_quality}/" configs/coco2017/input_compression/jpeg-deeplabv3_resnet50.yaml
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/jpeg-deeplabv3_resnet50.yaml
done
# Image classification
echo 'webp quality=100'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/webp-resnet50.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'webp quality=${next_quality}'
sed -i "s/webp_quality: ${quality}/webp_quality: ${next_quality}/" configs/ilsvrc2012/input_compression/webp-resnet50.yaml
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/webp-resnet50.yaml
done
# Object detection
echo 'webp quality=100'
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/webp-retinanet_resnet50_fpn.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'webp quality=${next_quality}'
sed -i "s/webp_quality: ${quality}/webp_quality: ${next_quality}/" configs/coco2017/input_compression/webp-retinanet_resnet50_fpn.yaml
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/webp-retinanet_resnet50_fpn.yaml
done
# Semantic segmentation
echo 'webp quality=100'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/webp-deeplabv3_resnet50.yaml
for quality in $(seq 100 -10 20)
do
next_quality=$((quality-10))
echo 'webp quality=${next_quality}'
sed -i "s/webp_quality: ${quality}/webp_quality: ${next_quality}/" configs/coco2017/input_compression/webp-deeplabv3_resnet50.yaml
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/webp-deeplabv3_resnet50.yaml
done
Install BPG following the instructions here.
If you do not place bpgenc and bpgdec at '~/manually_installed/libbpg-0.9.8/',
edit the encoder_path
and decoder_path
in bpg-resnet.yaml
like this.
# Image classification
echo 'bpg quality=50'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/bpg-resnet50.yaml
for quality in $(seq 50 -5 5)
do
next_quality=$((quality-5))
echo 'bpg quality=${next_quality}'
sed -i "s/bpg_quality: ${quality}/bpg_quality: ${next_quality}/" configs/ilsvrc2012/input_compression/bpg-resnet50.yaml
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/bpg-resnet50.yaml
done
# Object detection
echo 'bpg quality=50'
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/bpg-retinanet_resnet50_fpn.yaml
for quality in $(seq 50 -5 5)
do
next_quality=$((quality-5))
echo 'bpg quality=${next_quality}'
sed -i "s/bpg_quality: ${quality}/bpg_quality: ${next_quality}/" configs/coco2017/input_compression/bpg-retinanet_resnet50_fpn.yaml
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/bpg-retinanet_resnet50_fpn.yaml
done
# Semantic segmentation
echo 'bpg quality=50'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/bpg-deeplabv3_resnet50.yaml
for quality in $(seq 50 -5 5)
do
next_quality=$((quality-5))
echo 'bpg quality=${next_quality}'
sed -i "s/bpg_quality: ${quality}/bpg_quality: ${next_quality}/" configs/coco2017/input_compression/bpg-deeplabv3_resnet50.yaml
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/bpg-deeplabv3_resnet50.yaml
done
Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick Johnston. "Variational image compression with a scale hyperprior"
Make sure you have downloaded and unzipped the pretrained checkpoints for this model. The checkpoint files should be
placed at ./resource/ckpt/input_compression/
as specified in the yaml file.
# Image classification
echo 'beta=0.00015625'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/factorized_prior_ae_128ch-resnet50.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/ilsvrc2012/input_compression/factorized_prior_ae_128ch-resnet50.yaml
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/factorized_prior_ae_128ch-resnet50.yaml
prev_beta=${beta}
done
# Object detection
echo 'beta=0.00015625'
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/factorized_prior_ae_128ch-retinanet_resnet50_fpn.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/coco2017/input_compression/factorized_prior_ae_128ch-retinanet_resnet50_fpn.yaml
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/factorized_prior_ae_128ch-retinanet_resnet50_fpn.yaml
prev_beta=${beta}
done
# Semantic segmentation
echo 'beta=0.00015625'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/factorized_prior_ae_128ch-deeplabv3_resnet50.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/coco2017/input_compression/factorized_prior_ae_128ch-deeplabv3_resnet50.yaml
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/factorized_prior_ae_128ch-deeplabv3_resnet50.yaml
prev_beta=${beta}
done
David Minnen, Johannes Ballé, and George D Toderici. "Joint autoregressive and hierarchical priors for learned image compression"
Make sure you have downloaded and unzipped the pretrained checkpoints for this model. The checkpoint files should be
placed at ./resource/ckpt/input_compression/
as specified in the yaml file.
# Image classification
echo 'beta=0.00015625'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/hierarchical_prior_ae_128ch-resnet50.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/ilsvrc2012/input_compression/hierarchical_prior_ae_128ch-resnet50.yaml
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/input_compression/hierarchical_prior_ae_128ch-resnet50.yaml
prev_beta=${beta}
done
# Object detection
echo 'beta=0.00015625'
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/hierarchical_prior_ae_128ch-retinanet_resnet50_fpn.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/coco2017/input_compression/hierarchical_prior_ae_128ch-retinanet_resnet50_fpn.yaml
pipenv run python object_detection.py -test_only --config configs/coco2017/input_compression/hierarchical_prior_ae_128ch-retinanet_resnet50_fpn.yaml
prev_beta=${beta}
done
# Semantic segmentation
echo 'beta=0.00015625'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/hierarchical_prior_ae_128ch-deeplabv3_resnet50.yaml
prev_beta=0.00015625
for beta in 0.0003125 0.000625 0.00125 0.0025 0.005 0.01 0.02
do
echo 'beta=${beta}'
sed -i "s/beta_${prev_beta}/beta_${beta}/" configs/coco2017/input_compression/hierarchical_prior_ae_128ch-deeplabv3_resnet50.yaml
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/input_compression/hierarchical_prior_ae_128ch-deeplabv3_resnet50.yaml
prev_beta=${beta}
done
If you want to train models yourself by the baselines, exclude -test_only
from the following commands.
Yoshitomo Matsubara, Marco Levorato. "Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks"
Make sure you have downloaded and unzipped the pretrained checkpoints for this model. The checkpoint files should be
placed at ./resource/ckpt/ilsvrc2012/bq/
and ./resource/ckpt/coco2017/bq/
as specified in the yaml file.
# Image classification
for bch in 12 9 6 3 2
do
echo 'bottleneck channel=${bch}'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/bq/custom_resnet50-bq${bch}ch_from_resnet50.yaml
done
# Object detection
for bch in 6 3 2 1
do
echo 'bottleneck channel=${bch}'
pipenv run python object_detection.py -test_only --config configs/coco2017/bq/custom_retinanet_resnet50_fpn-bq${bch}ch_from_retinanet_resnet50_fpn.yaml
done
# Semantic segmentation
for bch in 9 6 3 2 1
do
echo 'bottleneck channel=${bch}'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/bq/custom_deeplabv3_resnet50_bq${bch}ch_from_deeplabv3_resnet50.yaml
done
Saurabh Singh, Sami Abu-El-Haija, Nick Johnston, Johannes Ballé, Abhinav Shrivastava, George Toderici. "End-to-End Learning of Compressible Features"
Make sure you have downloaded and unzipped the pretrained checkpoints for this model. The checkpoint files should be
placed at ./resource/ckpt/ilsvrc2012/singh_et_al/
and ./resource/ckpt/coco2017/shared_singh_et_al/
as specified in the yaml file.
# Image classification
for beta in 5.0e-11 1.0e-10 2.0e-10 4.0e-10 8.0e-10 1.6e-9 6.4e-9 1.28e-8
do
echo 'beta=${beta}'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/singh_et_al/bottleneck_resnet50-b24ch_igdn-beta${beta}_from_resnet50.yaml
done
# Object detection
for beta in 8.0e-10 1.6e-9 6.4e-9 1.28e-8
do
echo 'beta=${beta}'
pipenv run python object_detection.py -test_only --config configs/coco2017/shared_singh_et_al/retinanet_bottleneck_resnet50-b24ch_igdn-beta${beta}_fpn.yaml
done
# Semantic segmentation
for beta in 8.0e-10 1.6e-9 6.4e-9 1.28e-8
do
echo 'beta=${beta}'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/shared_singh_et_al/deeplabv3_bottleneck_resnet50-b24ch_igdn-beta${beta}.yaml
done
Make sure you have downloaded and unzipped the pretrained checkpoints for this model. The checkpoint files should be
placed at ./resource/ckpt/ilsvrc2012/entropic_student/
and ./resource/ckpt/coco2017/shared_entropic_student/
as specified in the yaml file.
If you want to train models yourself by the baselines, exclude -test_only
from the following commands.
# Image classification
for beta in 0.01 0.02 0.04 0.08 0.16 0.32 0.64 1.28 2.56
do
echo 'beta=${beta}'
pipenv run python image_classification.py -test_only --config configs/ilsvrc2012/entropic_student/bottleneck_resnet50-b24ch_igdn-beta${beta}_from_resnet50.yaml
done
# Object detection
for beta in 0.04 0.08 0.16 0.32 0.64 1.28 2.56
do
echo 'beta=${beta}'
pipenv run python object_detection.py -test_only --config configs/coco2017/shared_entropic_student/retinanet_bottleneck_resnet50-b24ch_igdn-beta${beta}_fpn_from_retinanet_resnet50_fpn.yaml
done
# Semantic segmentation
for beta in 0.04 0.08 0.16 0.32 0.64 1.28 2.56
do
echo 'beta=${beta}'
pipenv run python semantic_segmentation.py -test_only --config configs/coco2017/shared_entropic_student/deeplabv3_bottleneck_resnet50-b24ch_igdn-beta${beta}_from_deeplabv3_resnet50.yaml
done