Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A competitive study of the pseudoflow algorithm for the minimum st cut problem in vision applications

  • Original Research Paper
  • Published:
Journal of Real-Time Image Processing Aims and scope Submit manuscript

Abstract

Rapid advances in image acquisition and storage technology underline the need for real-time algorithms that are capable of solving large-scale image processing and computer-vision problems. The minimum st cut problem, which is a classical combinatorial optimization problem, is a prominent building block in many vision and imaging algorithms such as video segmentation, co-segmentation, stereo vision, multi-view reconstruction, and surface fitting to name a few. That is why finding a real-time algorithm which optimally solves this problem is of great importance. In this paper, we introduce to computer vision the Hochbaum’s pseudoflow (HPF) algorithm, which optimally solves the minimum st cut problem. We compare the performance of HPF, in terms of execution times and memory utilization, with three leading published algorithms: (1) Goldberg’s and Tarjan’s Push-Relabel; (2) Boykov’s and Kolmogorov’s augmenting paths; and (3) Goldberg’s partial augment-relabel. While the common practice in computer-vision is to use either BK or PRF algorithms for solving the problem, our results demonstrate that, in general, HPF algorithm is more efficient and utilizes less memory than these three algorithms. This strongly suggests that HPF is a great option for many real-time computer-vision problems that require solving the minimum st cut problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Ahuja, R.K., Kodialam, M., Mishra, A.K., Orlin, J.B.: Computational investigations of maximum flow algorithms. Eur. J. Oper. Res. 97(3), 509–542 (1997)

    Article  MATH  Google Scholar 

  2. Ahuja, R.K., Magnanti T.L., Orlin J.B.: Network flows: theory, algorithms, and applications. Prentice-Hall, Englewood Cliffs (1993)

  3. Ali, S., Shah, M.: Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans Pattern Anal. Mach. Intell. 32(2), 288–303 (2010)

    Article  Google Scholar 

  4. Anderson, R.J., Setubal, J.C: Goldberg’s algorithm for maximum flow in perspective: a computational study. In: Network flows and matching: First DIMACS Implementation Challenge. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 12, pp. 123–133 (1991)

  5. Arora, C., Banerjee, S., Kalra, P., Maheshwari, S.: An efficient graph cut algorithm for computer vision problems. In: Kostas, D., Petros, M., Nikos P., (eds.) Computer Vision ECCV 2010. Lecture Notes in Computer Science, vol 6313, pp. 552–565. Springer, Heidelberg (2010)

  6. Azar, Y., Madry, A., Moscibroda, T., Panigrahi, D., Srinivasan, A.: Maximum bipartite flow in networks with adaptive channel width. Theor. Comput. Sci. 412(24), 2577–2587 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  7. Bai, X., Wang, J., Simons, D., Sapiro, G.: Video snapcut: robust video object cutout using localized classifiers. ACM Trans. Graph. 28(3), 70:1–70:11 (2009)

    Google Scholar 

  8. Borradaile, G., Klein, P.: An o(n log n) algorithm for maximum st-flow in a directed planar graph. J. ACM 56(2), 9:1–9:30 (2009)

    Google Scholar 

  9. Boykov, Y., Funka-Lea, G.: Graph cuts and efficient n-d image segmentation. Int. J. Comput. Vis. 70(2), 109131 (2006)

    Article  Google Scholar 

  10. Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1124–1137 (2004)

    Article  MATH  Google Scholar 

  11. Boykov, Y., Lempitsky, V.: From photohulls to photoflux optimization. In: British Machine Vision Conference (BMVC), vol. III, pp. 1149–1158 (2006)

  12. Chandran, B.G., Hochbaum, D.S.: A computational study of the pseudoflow and push-relabel algorithms for the maximum flow problem. Oper. Res. 57(2), 358–376 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  13. Cherkassky, B.V., Goldberg A.V.: On implementing the push—relabel method for the maximum flow problem. Algorithmica 19(4), 390–410 (1997)

    Google Scholar 

  14. Computer Vision Research Group. Max-flow problem instances in vision. Technical report, University of Western Ontario. http://vision.csd.uwo.ca (2009). Accessed Oct 2009.

  15. Delong, A., Boykov, Y.: A scalable graph-cut algorithm for n-d grids. In: IEEE computer society conference on computer vision and pattern recognition, pp. 1–8 (2008)

  16. Derigs, U., Meier, W.: Implementing Goldberg’s max-flow-algorithm a computational investigation. Math. Methods Oper. Res. 33(6), 383–403 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  17. Fishbain, B., Hochbaum, D.S., Yang, Y.T.: Graph-cuts target tracking in videos through joint utilization of color and coarse motion data. UC Berkeley Manuscript (2012)

  18. Ford, L.R., Fulkerson, D.R.: Maximal flow through a network. Can. J. Math. 8(3), 339–404 (1956)

    MathSciNet  MATH  Google Scholar 

  19. Goldberg, A.V.: The partial augment–relabel algorithm for the maximum flow problem. Algorithms-ESA 2008, pp. 466–477 (2008)

  20. Goldberg, A.V.: Hi-level variant of the push-relabel (ver. 3.5) (2010). Accessed Jan 2010

  21. Goldberg, A.V., Tarjan, R.E.: A new approach to the maximum-flow problem. J. ACM 35(4), 921–940 (1988)

    Article  MathSciNet  MATH  Google Scholar 

  22. Gracias, N., Mahoor, M., Negahdaripour, S., Gleason, A.: Fast image blending using watersheds and graph cuts. Image Vis. Comput. 27(5), 597–607 (2009) [The 17th British Machine Vision Conference (BMVC 2006)]

    Google Scholar 

  23. Grundmann, M., Kwatra, V., Mei Han, and Essa, I.: Efficient hierarchical graph-based video segmentation. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), pp. 2141–2148 (2010)

  24. Hochbaum D.S.: An efficient algorithm for image segmentation, markov random fields and related problems. J. ACM 48(4), 686–701 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  25. Hochbaum, D.S.: The pseudoflow algorithm: a new algorithm for the maximum-flow problem. Oper. Res. 56(4), 992–1009 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  26. Hochbaum, D.S.: Efficient and effective image segmentation interactive tool. In: BIOSIGNALS 2009-international conference on bio-inspired systems and signal processing, pp. 459–461 (2009)

  27. Hochbaum, D.S.: Polynomial time algorithms for ratio regions and a variant of normalized cut. IEEE Trans. Pattern Recognit. Mach. Intell. 32(5), 889–898 (2009)

    Article  Google Scholar 

  28. Hochbaum, D.S.: HPF Implementation Ver. 3.3. (2010). Accessed Jan 2010

  29. Hochbaum, D.S., Orlin J.B.: Simplifications and speedups of the pseudoflow algorithm. Networks (2012, to appear)

  30. Hochbaum, D.S., Singh, V.: An efficient algorithm for co-segmentation. In: International conference on computer vision (ICCV) (2009)

  31. Ideses, I., Yaroslavsky, L., Fishbain, B.: Real-time 2d to 3d video conversion. J. Real Time Image Process. 2(1), 3–9 (2007)

    Article  Google Scholar 

  32. Italiano, G.F., Nussbaum, Y., Sankowski, P., and Wulff-Nilsen, C.: Improved algorithms for min cut and max flow in undirected planar graphs. In: Proceedings of the 43rd annual ACM symposium on theory of computing, STOC ’11, pp. 313–322. ACM, New York (2011)

  33. Kalarot, R., Morris J.: Comparison of fpga and gpu implementations of real-time stereo vision. In: 2010 IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW), pp. 9 –15 (2010)

  34. Kolmogorov, V.: An implementation of the maxflow algorithm. http://www.cs.ucl.ac.uk/staff/V.Kolmogorov/software.html (2010). Accessed Jan 2010

  35. Lempitsky, V., Boykov, Y.: Global optimization for shape fitting. In: Proceedings of IEEE conference on computer vision and pattern recognition CVPR ’07, pp. 1–8 (2007)

  36. Lempitsky, V., Boykov, Y., Ivanov, D.: Oriented visibility for multiview reconstruction. In: Leonardis, A, Bischof, H., Pinz, A. (eds.) Computer Vision ECCV 2006. Lecture Notes in Computer Science, vol. 3953, pp. 226–238. Springer, Heidelberg (2006)

  37. Liu, J., Sun, J.: Parallel graph-cuts by adaptive bottom-up merging. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), pp. 2181–2188 (2010)

  38. Mu, Y., Zhang, H., Wang, H., Zuo, W.: Automatic video object segmentation using graph cut. In: IEEE international conference on image processing, 2007 (ICIP 2007), vol. 3, pp. III–377–III–380 (2007)

  39. Nakamura Y., Matsuura T., Satoh K., and Ohta Y. (1996) Occlusion detectable stereo—occlusion patterns in camera matrix. In: IEEE computer society conference on computer vision and pattern recognition 0:371

  40. Ngo, C.-W., Ma, Y.-F., Zhang, H.-J.: Video summarization and scene detection by graph modeling. IEEE Trans. Circuits Syst. Video Technol. 15(2),. 296–305 (2005)

    Google Scholar 

  41. Qranfal, J., Hochbaum, D.S., Tanoh, G.: Experimental analysis of the mrf algorithm for segmentation of noisy medical images. Algorithmic Oper. Res. 6(2) (2012)

  42. Scharstein, D., Szeliski, R., Zabih, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. In: Proc. IEEE workshop on stereo and multi-baseline vision (SMBV 2001), pp. 131–140 (2001)

  43. Sharon, E., Galun, M., Sharon, D., Basri, R., Brandt, A.: Hierarchy and adaptivity in segmenting visual scenes. Nature 442(7104), 810–813 (2006)

    Article  Google Scholar 

  44. Shekhovtsov, A., Hlavac V.: A distributed mincut/maxflow algorithm combining path augmentation and push-relabel. In: Boykov, Y., Kahl, F., Lempitsky, V., Schmidt, F., (eds.) Energy Minimization Methods in Computer Vision and Pattern Recognition. Lecture Notes in Computer Science, vol. 6819, pp. 1–16. Springer, Heidelberg (2011)

  45. Sinha, S.N., Steedly, D., Szeliski, R., Agrawala, M., Pollefeys, M.: Interactive 3d architectural modeling from unordered photo collections. In: SIGGRAPH Asia ’08: ACM SIGGRAPH Asia 2008 papers, pp. 1–10. ACM, New York (2008)

  46. Sleator, D.D., Tarjan R.E.: A data structure for dynamic trees. In: Proceedings of the thirteenth annual ACM symposium on Theory of computing (STOC ’81) pp. 114–122. ACM, New York (1981)

  47. Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3d. ACM Trans. Graphics 25(3) (2006)

  48. Snow, D., Viola, P., Zabih, R.: Exact voxel occupancy with graph cuts. In: Proceedings of IEEE conference on computer vision and pattern recognition, vol. 1, pp. 345–352 (2000)

  49. Stanford Computer Graphics Laboratory. “the stanford 3d scanning repository”. Technical report, Stanford, Palo-Alto, CA, USA, http://graphics.stanford.edu/data/3Dscanrep/ (2009). Accessed Oct 2009

  50. Starck, J., Hilton, A.: Surface capture for performance-based animation. IEEE Comput. Graphics Appl. 27(3), 21–31 (2007)

    Article  Google Scholar 

  51. Strandmark, P., Kahl, F.: Parallel and distributed graph cuts by dual decomposition. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), pp. 2085–2092 (2010)

  52. Vineet, V., Narayanan, P.J.: Cuda cuts: Fast graph cuts on the gpu. In: IEEE computer society conference on computer vision and pattern recognition workshops, 2008. CVPRW ’08. pp. 1–8 (2008)

  53. Vogiatzis, G., Torr, P.H.S., Cipolla, R.: Multi-view stereo via volumetric graph-cuts. In: Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 391–398 (2005)

Download references

The first author was partially funded by the New York Metropolitan and the Technion’s Security Science and Technology research funds, The German-Israeli Foundation for Scientific Research and Development (GIF) Young Scientist Program, the Technion Center of Excellence in Exposure Science and Environmental Health and the CITI-SENSE project of the 7th European Framework Program (FP7), ENV.2012.6.5-1. The second author was supported in part by NSF awards No. CMMI-1200592 and CBET-0736232.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. Fishbain.

Appendices

Appendix1: Problem sizes

See Tables 6, 7, 8, 9

Table 7 Problems’ sizes—multi-view problems
Table 8 Problems’ sizes: surface fitting problems
Table 9 Segmentation problems’ sizes

Appendix2: Run-times

See Tables 10, 11, 12, 13, 14, 15, 16, 17

Table 10 Initialization stage run-times: Stereo Vision problems
Table 11 Initialization stage run-times: Multi-View problems
Table 12 Initialization stage run-times: Surface Fitting problems
Table 13 Initialization stage run-times: Segmentation problems
Table 14 Total run-times of the initialization and  min-cut and max-flow stages: Stereo Vision problems
Table 15 Total run-times of the initialization and  min-cut and  max-flow stages: Multi-View problems
Table 16 Total run-times of the initialization and  min-cut and  max-flow stages: Surface Fitting problems
Table 17 Total run-times of the initialization and  min-cut and max-flow stages: Segmentation problems

Appendix3: Memory utilization

See Tables 18, 19, 20, 21

Table 18 Memory utilization in (MBytes) for Stereo problems
Table 19 Memory utilization in (MBytes) for multi-view problems
Table 20 Memory utilization in (MBytes) for Surface Fitting problems
Table 21 Memory utilization in (MBytes) for Segmentation problems

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fishbain, B., Hochbaum, D.S. & Mueller, S. A competitive study of the pseudoflow algorithm for the minimum st cut problem in vision applications. J Real-Time Image Proc 11, 589–609 (2016). https://doi.org/10.1007/s11554-013-0344-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11554-013-0344-3

Keywords