Online Model Distillation for Efficient Video Inference

Mullapudi, Ravi Teja; Chen, Steven; Zhang, Keyi; Ramanan, Deva; Fatahalian, Kayvon

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.02699v2 (cs)

[Submitted on 6 Dec 2018 (v1), last revised 27 Jan 2020 (this version, v2)]

Title:Online Model Distillation for Efficient Video Inference

Authors:Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan, Kayvon Fatahalian

View PDF

Abstract:High-quality computer vision models typically address the problem of understanding the general distribution of real-world images. However, most cameras observe only a very small fraction of this distribution. This offers the possibility of achieving more efficient inference by specializing compact, low-cost models to the specific distribution of frames observed by a single camera. In this paper, we employ the technique of model distillation (supervising a low-cost student model using the output of a high-cost teacher) to specialize accurate, low-cost semantic segmentation models to a target video stream. Rather than learn a specialized student model on offline data from the video stream, we train the student in an online fashion on the live video, intermittently running the teacher to provide a target for learning. Online model distillation yields semantic segmentation models that closely approximate their Mask R-CNN teacher with 7 to 17$\times$ lower inference runtime cost (11 to 26$\times$ in FLOPs), even when the target video's distribution is non-stationary. Our method requires no offline pretraining on the target video stream, achieves higher accuracy and lower cost than solutions based on flow or video object segmentation, and can exhibit better temporal stability than the original teacher. We also provide a new video dataset for evaluating the efficiency of inference over long running video streams.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1812.02699 [cs.CV]
	(or arXiv:1812.02699v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.02699
Journal reference:	ICCV 2019

Submission history

From: Ravi Teja Mullapudi [view email]
[v1] Thu, 6 Dec 2018 18:29:59 UTC (8,120 KB)
[v2] Mon, 27 Jan 2020 21:57:10 UTC (8,423 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Online Model Distillation for Efficient Video Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Online Model Distillation for Efficient Video Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators