Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2386188.2386202guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A decomposition approach for optimizing large-scale parallel image composition on multi-core MPP systems

Published: 29 March 2009 Publication History

Abstract

In recent years, multi-core processor architecture has emerged as the predominant hardware architecture for high performance computing (HPC) systems. In addition, computational nodes based on SMP (symmetric multiprocessing) and NUMA (non-uniform memory architecture) have become increasingly common. Traditional parallel image composition algorithms were not primarily designed to take advantage of the combined message passing and shared address space parallelism provided by modern massively parallel processing (MPP) systems. This therefore might result in undesirable performance loss. In this study, we have investigated the use of a simple decomposition approach to take advantage of these different hardware characteristics for optimizing the parallel image composition process. Performance evaluation was carried out on a multi-core, multi-processor architecture based T2K Open Supercomputer, and we obtained encouraging results showing the effectiveness of the proposed approach. This approach also seems promising to tackle the large-scale image composition problem on nextgeneration HPC systems where an ever increasing number of processing cores are expected.

References

[1]
AHRENS J., PAINTER J.: Efficient sort-last rendering using compression-based image compositing. In Proceedings of the 2nd Eurographics Workshop on Parallel Graphics and Visualization (1998), pp. 145-151.
[2]
AVS: Advanced Visual Systems. http://www.avs.com/.
[3]
CEI: Computational Engineering International. http://www.ensight.com/.
[4]
CHEN L., FUJISHIRO I., NAKAJIMA K.: Optimizing parallel performance of unstructured volume rendering for the earth simulator. Parallel Comput. 29, 3 (2003), 355-371.
[5]
CAVIN X., MION C., FILBOIS A.: COTS cluster-based sort-last rendering: Performance evaluation and pipelined implementation. In Proceedings of the IEEE Visualization Conference (2005), pp. 111-118.
[6]
EILEMANN S., PAJAROLA R.: Direct send compositing for parallel sort-last rendering. In Proceedings of the Eurographics Symposium on Parallel Graphics and Visualization (2007), pp. 29-36.
[7]
HSU W. M.: Segmented ray casting for data parallel volume rendering. In PRS'93: Proceedings of the 1993 Symposium on Parallel Rendering (1993), pp. 7-14.
[8]
KWAN-LIU MA AND C. W., YU H., TIKHONOVA A.: In-situ processing and visualization for ultrascale simulations. Journal of Physics: Conference Series (Proceedings of DOE SciDAC 2007 Conference) 78 (2007), 012043.
[9]
LIN C.-F., CHUNG Y.-C., YANG D.-L.: TRLE-an efficient data compression scheme for image composition of volume rendering on distributed memory multicomputers. J. Supercomput. 39, 3 (2007), 321-345.
[10]
LOMBEYDA S., MOLL L., SHAND M., BREEN D., HEIRICH A.: Scalable interactive volume rendering using off-the-shelf components. In Proceedings of the IEEE 2001 Symposium on Parallel and Large-Data Visualization and Graphics (2001), pp. 115-121.
[11]
LEE T.-Y., RAGHAVENDRA C. S., NICHOLAS J. B.: Image composition schemes for sort-last polygon rendering on 2D mesh multicomputers. IEEE Transactions on Visualization and Computer Graphics 2, 3 (1996), 202-217.
[12]
MOLNAR S., COX M., ELLSWORTH D., FUCHS H.: A sorting classification of parallel rendering. IEEE Computer Graphics and Applications 14, 4 (1994), 23-32.
[13]
MURAKI S., OGATA M., MA K.-L., KOSHIZUKA K., KAJIHARA K., LIU X., NAGANO Y., SHIMOKAWA K.: Next-generation visual supercomputing using PC Clusters with volume graphics hardware devices. In Proceedings of the 2001 ACM/IEEE Conference on Supercomputing (CDROM) (2001), pp. 51-51.
[14]
MA K.-L., PAINTER J. S., HANSEN C. D., KROGH M. F.: Parallel volume rendering using binary-swap image composition. Computer Graphics and Application 14, 4 (1994), 59-68.
[15]
NEUMANN U.: Parallel volume-rendering algorithm performance on mesh-connected multicomputers. In PRS'93: Proceedings of the 1993 Symposium on Parallel Rendering (1993), pp. 97-104.
[16]
NONAKA J., KUKIMOTO N., SAKAMOTO N., HAZAMA H., WATASHIBA Y., LIU X., OGATA M., KANAZAWA M., KOYAMADA K.: Hybrid hardware-accelerated image composition for sort-last parallel rendering on graphics clusters with commodity image compositor. In VolViS 2004: Proceedings of the IEEE/SIGGRAPH Symposium on Volume Visualization and Graphics 2004 (2004), pp. 17-24.
[17]
PUGMIRE D., MONROE L., DAVENPORT C. C., DUBOIS A., DUBOIS D., POOLE S.: NPU-based image compositing in a distributed visualization system. IEEE Transactions on Visualization and Computer Graphics 13, 4 (2007), 798-809.
[18]
PETERKA T., YU H., ROSS R., MA K.-L.: Parallel volume rendering on the IBM Blue Gene/P. In Proceedings of the Eurographics/ACM SIGGRAPH Symposium on Parallel Graphics and Visualization (2008).
[19]
RAO A. R., CECCHI G., MAGNASCO M.: High performance computing environment for multidimensional image analysis. BMC Cell Biology 8, Suppl 1 (2007), S9.
[20]
REINHARD E., HANSEN C.: A comparison of parallel compositing techniques on shared memory architectures. In Proceedings of the Third Eurographics Workshop on Parallel Graphics and Visualisation (2000), pp. 115-123.
[21]
SANO K., KOBAYASHI Y., NAKAMURA T.: Differential coding scheme for efficient parallel image composition on a PC cluster system. Parallel Comput. 30, 2 (2004), 285-299.
[22]
STOMPEL A., MA K.-L., LUM E. B., AHRENS J., PATCHETT J.: SLIC: Scheduled linear image compositing for parallel volume rendering. In PVG'03: Proceedings of the 2003 IEEE Symposium on Parallel and Large-Data Visualization and Graphics (2003), p. 6.
[23]
STRENGERT M., MAGALLÓN M., WEISKOPF D., GUTHE S., ERTL T.: Large volume visualization of compressed time-dependent datasets on GPU Clusters. Parallel Comput. 31, 2 (2005), 205-219.
[24]
T2K: T2K Open Supercomputer Alliance. http://www.open-supercomputer.org/.
[25]
TAY Y. C.: A comparison of pixel complexity in composition techniques for sort-last rendering. Journal of Parallel and Distributed Computing 62, 1 (2002), 152-171.
[26]
TAKEUCHI A., INO F., HAGIHARA K.: An improved binary-swap compositing for sort-last parallel rendering on distributed memory multiprocessors. Parallel Comput. 29, 11-12 (2003), 1745-1762.
[27]
TOP500: Top500 supercomputer sites. http://www.top500.org/.
[28]
TU T., YU H., RAMIREZ-GUZMAN L., BIELAK J., GHATTAS O., MA K.-L., O'HALLARON D. R.: From mesh generation to scientific visualization: An end-to-end approach to parallel supercomputing. In SC'06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing (2006), p. 91.
[29]
YU H., WANG C., MA K.-L.: Massively parallel volume rendering using 2-3 swap image compositing. In SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing (2008), pp. 1-11.
[30]
YANG D.-L., YU J.-C., CHUNG Y.-C.: Efficient compositing methods for the sort-last-sparse parallel volume rendering system on distributed memory multicomputers. J. Supercomput. 18, 2 (2001), 201-220.

Cited By

View all
  • (2011)Revisiting parallel rendering for shared memory machinesProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386236(31-40)Online publication date: 10-Apr-2011

Index Terms

  1. A decomposition approach for optimizing large-scale parallel image composition on multi-core MPP systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      EG PGV'09: Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization
      March 2009
      110 pages
      ISBN:9783905674156

      Sponsors

      • SimTech: SimTech
      • Intel: Intel

      Publisher

      Eurographics Association

      Goslar, Germany

      Publication History

      Published: 29 March 2009

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2011)Revisiting parallel rendering for shared memory machinesProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386236(31-40)Online publication date: 10-Apr-2011

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media