What Can Robotics Research Learn from Computer Vision Research?

Corke, Peter; Dayoub, Feras; Hall, David; Skinner, John; Sünderhauf, Niko

doi:10.1007/978-3-030-95459-8_61

Peter Corke¹⁵,
Feras Dayoub¹⁵,
David Hall¹⁵,
John Skinner¹⁵ &
…
Niko Sünderhauf¹⁵

Part of the book series: Springer Proceedings in Advanced Robotics ((SPAR,volume 20))

Included in the following conference series:

The International Symposium of Robotics Research

1897 Accesses
1 Citations

Abstract

The fields of computer vision and robotics are both children of the artificial intelligence program that was spawned by the Dartmouth Conference in 1956. In recent decades the fields have diverged in terms of conferences and journals, research methodology and research rate. From a robotics perspective it seems that computer vision is in the fast lane while robotics is stuck in the slow lane. Roboticists hold a fundamental belief in the importance of experimentation but could it be that experiments are actually holding us back? Or is it that we are doing experiments poorly?.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Robotics: Hephaestus Does It Again

Robotics: History, Trends, and Future Directions

Cognitive Robotics

Notes

1.
A lexical database of English words started in 1985 to support text analysis and NLP.
2.
https://www.unrealengine.com.
3.
https://www.roboticvisionchallenge.org.
4.
https://github.com/jskinn/Dataset_Synthesizer.

References

Performance in ILSVRC over 2011 to 2016. https://commons.wikimedia.org/wiki/File:ImageNet_error_rate_history_(just_systems).svg
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 25, pp. 1097–1105. Curran Associates, Inc. (2012). http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
Smith, R.: Peer review: a flawed process at the heart of science and journals. J. R. Med. 99(4), 178–182 (2006). https://doi.org/10.1177/014107680609900414, pMID: 16574968
Maddern, W., Pascoe, G., Linegar, C., Newman, P.: 1 Year, 1000 km: the Oxford RobotCar dataset. Int. J. Robot. Res. (IJRR) 36(1), 3–15 (2017). https://doi.org/10.1177/0278364916679498
Article Google Scholar
Calli, B., Singh, A., Walsman, A., Srinivasa, S., Abbeel, P., Dollar, A.M.: The YCB object and model set: towards common benchmarks for manipulation research. In: 2015 International Conference on Advanced Robotics (ICAR), pp. 510–517, July 2015
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Hall, D., et al.: Probabilistic object detection: definition and evaluation. In: 2020 IEEE Winter Conference on Applications of Computer Vision. IEEE (2020)
Google Scholar
Recht, B., Roelofs, R., Schmidt, L., Shankar, V.: Do ImageNet classifiers generalize to ImageNet? arXiv preprint arXiv:1902.10811 (2019)
Torralba, A., Efros, A.A., et al.: Unbiased look at dataset bias. In: CVPR, vol. 1, p. 7. Citeseer (2011)
Google Scholar
Barbu, A., et al.: ObjectNet: a large-scale bias-controlled dataset for pushing the limits of object recognition models. In: Advances in Neural Information Processing Systems, pp. 9448–9458 (2019)
Google Scholar
Kumar, N., et al.: Leafsnap: a computer vision system for automatic plant species identification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7573, pp. 502–516. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33709-3_36
Chapter Google Scholar
Bendale, A., Boult, T.: Towards open world recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1893–1902 (2015)
Google Scholar
Blum, H., Sarlin, P.E., Nieto, J., Siegwart, R., Cadena, C.: The fishyscapes benchmark: measuring blind spots in semantic segmentation. arXiv preprint arXiv:1904.03215 (2019)
Kendall, A., Gal, Y.: What uncertainties do we need in Bayesian deep learning for computer vision? In: Advances in Neural Information Processing Systems, pp. 5574–5584 (2017)
Google Scholar
Weiss, C.: Evaluation: Methods for Studying Programs and Policies. Prentice Hall, Hoboken (1998)
Google Scholar
Pickem, D., et al.: The Robotarium: a remotely accessible swarm robotics research testbed. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 1699–1706, May 2017
Google Scholar
Skinner, J., Garg, S., Sünderhauf, N., Corke, P., Upcroft, B., Milford, M.: High-fidelity simulation for evaluating robotic vision performance. In: 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 2737–2744, October 2016
Google Scholar
Mur-Artal, R., Montiel, J., Tardos, J.D.: Orb-slam: a versatile and accurate monocular slam system. IEEE Trans. Robot. 31(5), 1147–1163 (2015)
Article Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015)
Article MathSciNet Google Scholar
Sünderhauf, N., et al.: The limits and potentials of deep learning for robotics. Int. J. Robot. Res. 37(4–5), 405–420 (2018)
Article Google Scholar
Thrun, S., Burgard, W., Fox, D.: Probabilistic Robotics. MIT Press, Cambridge (2005)
MATH Google Scholar
Wang, C.W., Cheng, C.A., Cheng, C.J., Hu, H.N., Chu, H.K., Sun, M.: AugPOD: augmentation-oriented probabilistic object detection. In: CVPR workshop on the Robotic Vision Probabilistic Object Detection Challenge (2019)
Google Scholar
Ammirato, P., Berg, A.C.: A Mask-RCNN baseline for probabilistic object detection. arXiv preprint arXiv:1908.03621 (2019)
Li, D., Xu, C., Liu, Y., Qin, Z.: TeamGL at ACRV robotic vision challenge 1: probabilistic object detection via staged non-suppression ensembling. In: CVPR workshop on the Robotic Vision Probabilistic Object Detection Challenge (2019)
Google Scholar
Morrison, D., Milan, A., Antonakos, A.: Uncertainty-aware instance segmentation using dropout sampling. In: CVPR Workshop on the Robotic Vision Probabilistic Object Detection Challenge (2019)
Google Scholar

Download references

Acknowledgements

We thank the organizers of ISRR2019, in Hanoi, for the invitation to present a first pass of these ideas in a Distinguished Talk. This research was conducted by the Australian Research Council Centre of Excellence for Robotic Vision (project number CE140100016).

Author information

Authors and Affiliations

QUT Centre for Robotics, Queensland University of Technology, Brisbane, Australia
Peter Corke, Feras Dayoub, David Hall, John Skinner & Niko Sünderhauf

Authors

Peter Corke
View author publications
You can also search for this author in PubMed Google Scholar
Feras Dayoub
View author publications
You can also search for this author in PubMed Google Scholar
David Hall
View author publications
You can also search for this author in PubMed Google Scholar
John Skinner
View author publications
You can also search for this author in PubMed Google Scholar
Niko Sünderhauf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter Corke .

Editor information

Editors and Affiliations

Institute for Anthropomatics and Robotics, Karlsruhe Institute of Technology, Karlsruhe, Baden-Württemberg, Germany
Tamim Asfour
Department of Information Technology and Human Factors, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan
Eiichi Yoshida
Seoul National University, Seoul, Korea (Republic of)
Jaeheung Park
Jacobs School of Engineering, Institute for Contextual Robotics, San Diego, CA, USA
Henrik Christensen
Department of Computer Science, Stanford University, Stanford, CA, USA
Oussama Khatib

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Corke, P., Dayoub, F., Hall, D., Skinner, J., Sünderhauf, N. (2022). What Can Robotics Research Learn from Computer Vision Research?. In: Asfour, T., Yoshida, E., Park, J., Christensen, H., Khatib, O. (eds) Robotics Research. ISRR 2019. Springer Proceedings in Advanced Robotics, vol 20. Springer, Cham. https://doi.org/10.1007/978-3-030-95459-8_61

Download citation

DOI: https://doi.org/10.1007/978-3-030-95459-8_61
Published: 17 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95458-1
Online ISBN: 978-3-030-95459-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

What Can Robotics Research Learn from Computer Vision Research?

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Robotics: Hephaestus Does It Again

Robotics: History, Trends, and Future Directions

Cognitive Robotics

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

What Can Robotics Research Learn from Computer Vision Research?

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Robotics: Hephaestus Does It Again

Robotics: History, Trends, and Future Directions

Cognitive Robotics

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation