Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
survey

Cost Optimization for Cloud Storage from User Perspectives: Recent Advances, Taxonomy, and Survey

Published: 13 July 2023 Publication History
  • Get Citation Alerts
  • Abstract

    With the development and maturity of cloud storage, it has attracted a large number of users. Although cloud users do not need to concern themselves with the infrastructure used for storage, thus saving on equipment and maintenance costs, the sheer volume of data still generates significant cloud storage usage costs, which motivates cloud storage users to look for ways to further save costs. In this article, we analyze the whole process of using cloud storage to exhaustively explore opportunities, motivations, and challenges of cost optimization from user perspectives. Then we provide a comprehensive taxonomy and summary of recent advances in terms of storage efficiency (i.e., cost optimization by improving storage efficiency), cloud storage services (i.e., cost optimization by leveraging the features of cloud storage services), and emerging storage paradigms (i.e., cost optimization by leveraging emerging storage paradigms like edge storage). Finally, we present future directions for cost optimization from user perspectives and present our conclusion. This article offers a thorough survey of recent advances focusing on how to optimize the cost of using cloud storage for cloud users, and it has an opportunity to attract a broad audience in the cost-effective cloud storage market.

    References

    [1]
    Edge. 2022. Edge Networking. Retrieved June 22, 2022 from https://edge.network.
    [2]
    Hussam Abu-Libdeh, Lonnie Princehouse, and Hakim Weatherspoon. 2010. RACS: A case for cloud storage diversity. In Proceedings of ACM SoCC. 229–240.
    [3]
    Ahmed Ali-Eldin, Bin Wang, and Prashant J. Shenoy. 2021. The hidden cost of the edge: A performance comparison of edge and cloud latencies. In Proceedings of SC. Article 23, 12 page.
    [4]
    Amazon. 2022. Amazon ElastiCache. Retrieved February 15, 2022 from https://aws.amazon.com/elasticache.
    [5]
    Amazon. 2022. AWS Lambda. Retrieved February 15, 2022 from https://aws.amazon.com/lambda.
    [6]
    Amazon. 2022. Cloud Storage on AWS: S3. Retrieved June 22, 2022 from https://aws.amazon.com/products/storage.
    [7]
    Amazon. 2022. Amazon EC2 Spot Instances. Retrieved February 15, 2022 from https://aws.amazon.com/ec2/spot.
    [8]
    Bo An, Yan Li, Junming Ma, Gang Huang, Xiangqun Chen, and Donggang Cao. 2019. DCStore: A deduplication-based cloud-of-clouds storage service. In Proceedings of IEEE ICWS. 291–295.
    [9]
    Atakan Aral and Tolga Ovatman. 2018. A decentralized replica placement algorithm for edge computing. IEEE Trans. Netw. Serv. Manag. 15, 2 (2018), 516–529.
    [10]
    Microsoft Azure. 2022. Azure Blob Storage. Retrieved June 22, 2022 from https://azure.microsoft.com/services/storage/blobs.
    [11]
    Han Bao, Yijie Wang, and Fangliang Xu. 2020. An adaptive erasure code for JointCloud storage of Internet of Things big data. IEEE Internet Things J. 7, 3 (2020), 1613–1624.
    [12]
    Ben Blamey, Fredrik Wrede, Johan Karlsson, Andreas Hellander, and Salman Toor. 2019. Adapting the secretary hiring problem for optimal hot-cold tier placement under top-K workloads. In Proceedings of IEEE/ACM CCGRID. 576–583.
    [13]
    Djillali Boukhelef, Kamel Boukhalfa, Jalil Boukhobza, Hamza Ouarnoughi, and Laurent Lemarchand. 2017. COPS: Cost based object placement strategies on hybrid storage system for DBaaS cloud. In Proceedings of IEEE/ACM CCGRID. 659–664.
    [14]
    Djillali Boukhelef, Jalil Boukhobza, Kamel Boukhalfa, Hamza Ouarnoughi, and Laurent Lemarchand. 2019. Optimizing the cost of DBaaS object placement in hybrid storage systems. Future Gener. Comput. Syst. 93 (2019), 176–187.
    [15]
    James Broberg, Rajkumar Buyya, and Zahir Tari. 2009. MetaCDN: Harnessing ‘storage clouds’ for high performance content delivery. J. Netw. Comput. Appl. 32, 5 (2009), 1012–1022.
    [16]
    Robson A. Campêlo, Marco A. Casanova, Dorgival O. Guedes, and Alberto H. F. Laender. 2020. A brief survey on replica consistency in cloud environments. J. Internet Serv. Appl. 11, 1 (2020), 1.
    [17]
    E. Cao, P. Wang, C. Yan, and C. Jiang. 2020. A cloudedge-combined data placement strategy based on user access regions. In Proceedings of BigDIA. 243–250.
    [18]
    Damiano Carra, Giovanni Neglia, and Pietro Michiardi. 2019. TTL-based cloud caches. In Proceedings of IEEE INFOCOM. 685–693.
    [19]
    Damiano Carra, Giovanni Neglia, and Pietro Michiardi. 2020. Elastic provisioning of cloud caches: A cost-aware TTL approach. IEEE/ACM Trans. Netw. 28, 3 (2020), 1283–1296.
    [20]
    Ceph. 2022. Ceph Docs. Retrieved February 15, 2022 from https://docs.ceph.com.
    [21]
    Lixing Chen, Sheng Zhou, and Jie Xu. 2018. Computation peer offloading for energy-constrained mobile edge computing in small-cell networks. IEEE/ACM Trans. Netw. 26, 4 (2018), 1619–1632.
    [22]
    Min Chen, Yixue Hao, Kai Lin, Zhiyong Yuan, and Long Hu. 2018. Label-less learning for traffic control in an edge network. IEEE Netw. 32, 6 (2018), 8–14.
    [23]
    Yingying Cheng, Fan Zhang, Gang Hu, Yiwen Wang, Hanhui Yang, Gong Zhang, and Zhuo Cheng. 2021. Block popularity prediction for multimedia storage systems using spatial-temporal-sequential neural networks. In Proceedings of ACM MM. 3390–3398.
    [24]
    Giovanni Cherubini, Yusik Kim, Mark Lantz, and Vinodh Venkatesan. 2017. Data prefetching for large tiered storage systems. In Proceedings of IEEE ICDM. 823–828.
    [25]
    Amina Chikhaoui, Laurent Lemarchand, Kamel Boukhalfa, and Jalil Boukhobza. 2021. StorNIR, a multi-objective replica placement strategy for cloud federations. In Proceedings of ACM/SIGAPP SAC. 50–59.
    [26]
    Asaf Cidon, Stephen M. Rumble, Ryan Stutsman, Sachin Katti, John K. Ousterhout, and Mendel Rosenblum. 2013. Copysets: Reducing the frequency of data loss in cloud storage. In Proceedings of USENIX ATC. 37–48.
    [27]
    Alibaba Cloud. 2022. Edge Node Service. Retrieved June 22, 2022 from https://www.alibabacloud.com/product/edge-node-service.
    [28]
    Alibaba Cloud. 2022. Persistent Memory Usage. Retrieved February 15, 2022 from https://www.alibabacloud.com/help/en/doc-detail/188251.htm.
    [29]
    Yong Cui, Ningwei Dai, Zeqi Lai, Minming Li, Zhenhua Li, Yuming Hu, Kui Ren, and Yuchi Chen. 2019. TailCutter: Wisely cutting tail latency in cloud CDNs under cost constraints. IEEE/ACM Trans. Netw. 27, 4 (2019), 1612–1628.
    [30]
    Hariharan Devarajan, Anthony Kougkas, and Xian-He Sun. 2020. HReplica: A dynamic data replication engine with adaptive compression for multi-tiered storage. In Proceedings of IEEE Big Data. 256–265.
    [31]
    Thaleia Dimitra Doudali and Ada Gavrilovska. 2018. Mnemo: Boosting memory cost efficiency in hybrid memory systems. In Proceedings of ACM SoCC. 523.
    [32]
    Abhinav Duggal, Fani Jenkins, Philip Shilane, Ramprasad Chinthekindi, Ritesh Shah, and Mahesh Kamat. 2019. Data domain cloud tier: Backup here, backup there, deduplicated everywhere! In Proceedings of USENIX ATC. 647–660.
    [33]
    Jinlong E, Yong Cui, Zhenhua Li, Mingkang Ruan, and Ennan Zhai. 2020. HyCloud: Tweaking hybrid cloud storage services for cost-efficient filesystem hosting. IEEE/ACM Trans. Netw. 28, 6 (2020), 2629–2642.
    [34]
    Abdelkarim Erradi and Yaser Mansouri. 2020. Online cost optimization algorithms for tiered cloud storage services. J. Syst. Softw. 160 (2020), 110457.
    [35]
    Ghazaleh Eslami and Abolfazl Toroghi Haghighat. 2017. A new surrogate placement algorithm for cloud-based content delivery networks. J. Supercomput. 73, 12 (2017), 5310–5331.
    [36]
    Flexera. 2021. Flexera Releases 2021 State of the Cloud Report. Retrieved November 22, 2022 from https://www.flexera.com/about-us/press-center/flexera-releases-2021-state-of-the-cloud-report.
    [37]
    Market Data Forecast. 2021. Cloud Storage Market Research Report. Retrieved June 22, 2022 from https://www.marketdataforecast.com/market-reports/cloud-storage-market.
    [38]
    Yinjin Fu, Nong Xiao, Tao Chen, and Jian Wang. 2022. Fog-to-multicloud cooperative ehealth data management with application-aware secure deduplication. IEEE Trans. Dependable Secur. Comput. 19, 5 (2022), 3136–3148.
    [39]
    Yinjin Fu, Nong Xiao, Hong Jiang, Guyu Hu, and Weiwei Chen. 2019. Application-aware big data deduplication in cloud environment. IEEE Trans. Cloud Comput. 7, 4 (2019), 921–934.
    [40]
    Ammar Gharaibeh, Abdallah Khreishah, Bo Ji, and Moussa Ayyash. 2016. A provably efficient online collaborative caching algorithm for multicell-coordinated systems. IEEE Trans. Mob. Comput. 15, 8 (2016), 1863–1876.
    [41]
    Amal Ghorbel, Mahmoud Ghorbel, and Mohamed Jmaiel. 2017. Privacy in cloud computing environments: A survey and research challenges. J. Supercomput. 73, 6 (2017), 2763–2800.
    [42]
    Navneet Kaur Gill and Sarbjeet Singh. 2016. A dynamic, cost-aware, optimized data replication strategy for heterogeneous cloud data centers. Future Gener. Comput. Syst. 65 (2016), 10–32.
    [43]
    Sukhpal Singh Gill, Minxian Xu, Carlo Ottaviani, Panos Patros, Rami Bahsoon, Arash Shaghaghi, Muhammed Golec, et al. 2022. AI for next generation computing: Emerging trends and future directions. Internet of Things 19 (2022), 100514.
    [44]
    Gluster. 2022. GlusterFS Documentation. Retrieved February 15, 2022 from https://docs.gluster.org.
    [45]
    Xinxin Han, Guichen Gao, Yang Wang, Hing-Fung Ting, Ilsun You, and Yong Zhang. 2021. Online data caching in edge computing. Concurr. Comput. Early view, July 11, 2021.
    [46]
    Binbing Hou and Feng Chen. 2017. GDS-LC: A latency- and cost-aware client caching scheme for cloud storage. ACM Trans. Storage 13, 4 (2017), Article 40, 33 pages.
    [47]
    Tingting Hou, Gang Feng, Shuang Qin, and Wei Jiang. 2018. Proactive content caching by exploiting transfer learning for mobile edge computing. Int. J. Commun. Syst. 31, 11 (2018), e3706.
    [48]
    Ta Yuan Hsu and Ajay D. Kshemkalyani. 2019. A proactive, cost-aware, optimized data replication strategy in geo-distributed cloud datastores. In Proceedings of IEEE/ACM UCC. 143–153.
    [49]
    Ying-Feng Hsu, Ryo Irie, Shuuichirou Murata, and Morito Matsuoka. 2018. A novel automated cloud storage tiering system through hot-cold data classification. In Proceedings of IEEE CLOUD. 492–499.
    [50]
    Huayi Jin, Chentao Wu, Xin Xie, Jie Li, Minyi Guo, Hao Lin, and Jianfeng Zhang. 2019. Approximate code: A cost-effective erasure coding framework for tiered video storage in cloud systems. In Proceedings of ICPP. Article 95, 10 pages.
    [51]
    Nesrine Kaaniche and Maryline Laurent. 2017. Data security and privacy preservation in cloud storage environments based on cryptographic mechanisms. Comput. Commun. 111 (2017), 120–141.
    [52]
    Hourieh Khalajzadeh, Dong Yuan, Bing Bing Zhou, John C. Grundy, and Yun Yang. 2020. Cost effective dynamic data placement for efficient access of social networks. J. Parallel Distrib. Comput. 141 (2020), 82–98.
    [53]
    Tahseen Khan, Wenhong Tian, Guangyao Zhou, Shashikant Ilager, Mingming Gong, and Rajkumar Buyya. 2022. Machine learning (ML)-centric resource management in cloud computing: A review and future directions. J. Netw. Comput. Appl. 204 (2022), 103405.
    [54]
    Abdennacer Khelaifa, Saber Benharzallah, Laïd Kahloul, Reinhardt Euler, Abdelkader Laouid, and Ahcène Bounceur. 2019. A comparative analysis of adaptive consistency approaches in cloud storage. J. Parallel Distributed Comput. 129 (2019), 36–49.
    [55]
    Ana Klimovic, Yawen Wang, Patrick Stuedi, Animesh Trivedi, Jonas Pfefferle, and Christos Kozyrakis. 2018. Pocket: Elastic ephemeral storage for serverless analytics. In Proceedings of OSDI. 427–444.
    [56]
    Phu Lai, Qiang He, John Grundy, Feifei Chen, Mohamed Abdelrazek, John G. Hosking, and Yun Yang. 2022. Cost-effective app user allocation in an edge computing environment. IEEE Trans. Cloud Comput. 10, 3 (2022), 1701–1713.
    [57]
    Chunlin Li, Jingpan Bai, Yuan Ge, and Youlong Luo. 2020. Heterogeneity-aware elastic provisioning in cloud-assisted edge computing systems. Future Gener. Comput. Syst. 112 (2020), 1106–1121.
    [58]
    Chunlin Li, Jun Liu, Bo Lu, and Youlong Luo. 2021. Cost-aware automatic scaling and workload-aware replica management for edge-cloud environment. J. Netw. Comput. Appl. 180 (2021), 103017.
    [59]
    Chunlin Li, Mingyang Song, Min Zhang, and Youlong Luo. 2020. Effective replica management for improving reliability and availability in edge-cloud computing environment. J. Parallel Distrib. Comput. 143 (2020), 107–128.
    [60]
    Chunlin Li, Chengyi Wang, Hengliang Tang, and Youlong Luo. 2019. Scalable and dynamic replica consistency maintenance for edge-cloud system. Future Gener. Comput. Syst. 101 (2019), 590–604.
    [61]
    Chunlin Li, YaPing Wang, Hengliang Tang, and Youlong Luo. 2019. Dynamic multi-objective optimized replica placement and migration strategies for SaaS applications in edge cloud. Future Gener. Comput. Syst. 100 (2019), 921–937.
    [62]
    Said Limam, Riad Mokadem, and Ghalem Belalem. 2019. Data replication strategy with satisfaction of availability, performance and tenant budget requirements. Clust. Comput. 22, 4 (2019), 1199–1210.
    [63]
    Chuan Lin, Qiang Cao, Jianzhong Huang, Jie Yao, Xiaoqian Li, and Changsheng Xie. 2018. HPDV: A highly parallel deduplication cluster for virtual machine images. In Proceedings of IEEE/ACM CCGRID. 472–481.
    [64]
    Guoxin Liu, Haiying Shen, and Harrison Chandler. 2016. Selective data replication for online social networks with distributed datacenters. IEEE Trans. Parallel Distrib. Syst. 27, 8 (2016), 2377–2393.
    [65]
    Guoxin Liu, Haiying Shen, and Haoyu Wang. 2017. An economical and slo-guaranteed cloud storage service across multiple cloud service providers. IEEE Trans. Parallel Distrib. Syst. 28, 9 (2017), 2440–2453.
    [66]
    Jinwei Liu, Haiying Shen, Hongmei Chi, Husnu S. Narman, Yongyi Yang, Long Cheng, and Wingyan Chung. 2021. A low-cost multi-failure resilient replication scheme for high-data availability in cloud storage. IEEE/ACM Trans. Netw. 29, 4 (2021), 1436–1451.
    [67]
    Jinwei Liu, Haiying Shen, and Husnu S. Narman. 2019. Popularity-aware multi-failure resilient and cost-effective replication for high data durability in cloud storage. IEEE Trans. Parallel Distrib. Syst. 30, 10 (2019), 2355–2369.
    [68]
    Mingyu Liu, Li Pan, and Shijun Liu. 2019. To transfer or not: An online cost optimization algorithm for using two-tier storage-as-a-service clouds. IEEE Access 7 (2019), 94263–94275.
    [69]
    Mingyu Liu, Li Pan, and Shijun Liu. 2021. Keep hot or go cold: A randomized online migration algorithm for cost optimization in STaaS clouds. IEEE Trans. Netw. Serv. Manag. 18, 4 (2021), 4563–4575.
    [70]
    Mingyu Liu, Li Pan, and Shijun Liu. 2022. Effeclouds: A cost-effective cloud-of-clouds framework for two-tier storage. Future Gener. Comput. Syst. 129 (2022), 33–49.
    [71]
    Songbin Liu, Xiaomeng Huang, Haohuan Fu, and Guangwen Yang. 2013. Understanding data characteristics and access patterns in a cloud storage system. In Proceedings of IEEE/ACM CCGRID. 327–334.
    [72]
    Ying Liu, Qiang He, Dequan Zheng, Mingwei Zhang, Feifei Chen, and Bin Zhang. 2019. Data caching optimization in the edge computing environment. In Proceedings of IEEE ICWS. 99–106.
    [73]
    Saiqin Long, Zhetao Li, Zihao Liu, Qingyong Deng, Sangyoon Oh, and Nobuyoshi Komuro. 2020. A similarity clustering-based deduplication strategy in cloud storage systems. In Proceedings of IEEE ICPADS. 35–43.
    [74]
    Yung-Feng Lu, Chin-Fu Kuo, Shih-Chun Chou, Jhong-Syuan Li, and Yan-Wei Lai. 2017. Cost-aware software-defined hybrid object-based storage system. In Proceedings of PDCAT. 477–482.
    [75]
    Najme Mansouri and Mohammad Masoud Javidi. 2018. A new prefetching-aware data replication to decrease access latency in cloud environment. J. Syst. Softw. 144 (2018), 197–215.
    [76]
    Najme Mansouri, Mohammad Masoud Javidi, and Behnam Mohammad Hasani Zade. 2020. Using data mining techniques to improve replica management in cloud environment. Soft Comput. 24, 10 (2020), 7335–7360.
    [77]
    Najme Mansouri, M. Kuchaki Rafsanjani, and Mohammad Masoud Javidi. 2017. DPRS: A dynamic popularity aware replication strategy with parallel download scheme in cloud environments. Simul. Model. Pract. Theory 77 (2017), 177–196.
    [78]
    Najme Mansouri, Behnam Mohammad Hasani Zade, and Mohammad Masoud Javidi. 2020. A multi-objective optimized replication using fuzzy based self-defense algorithm for cloud computing. J. Netw. Comput. Appl. 171 (2020), 102811.
    [79]
    Yaser Mansouri and Rajkumar Buyya. 2019. Dynamic replication and migration of data objects with hot-spot and cold-spot statuses across storage data centers. J. Parallel Distrib. Comput. 126 (2019), 121–133.
    [80]
    Yaser Mansouri and Abdelkarim Erradi. 2018. Cost optimization algorithms for hot and cool tiers cloud storage services. In Proceedings of IEEE CLOUD. 622–629.
    [81]
    Yaser Mansouri, Adel Nadjaran Toosi, and Rajkumar Buyya. 2018. Data storage management in cloud environments: Taxonomy, survey, and future directions. ACM Comput. Surv. 50, 6 (2018), Article 91, 51 pages.
    [82]
    Yaser Mansouri, Adel Nadjaran Toosi, and Rajkumar Buyya. 2019. Cost optimization for dynamic replication and migration of data in cloud data centers. IEEE Trans. Cloud Comput. 7, 3 (2019), 705–718.
    [83]
    Ahmed Awad Mohamed, Rashed K. Salem, Hatem Abdel Kader, and Mustafa Abdul Salam. 2021. A novel intelligent approach for dynamic data replication in cloud environment. IEEE Access 9 (2021), 40240–40254.
    [84]
    Rekha Nachiappan, Bahman Javadi, Rodrigo N. Calheiros, and Kenan M. Matawie. 2017. Cloud storage reliability for big data applications: A state of the art survey. J. Netw. Comput. Appl. 97 (2017), 35–47.
    [85]
    Kwangsung Oh, Nan Qin, Abhishek Chandra, and Jon B. Weissman. 2020. Wiera: Policy-driven multi-tiered geo-distributed cloud storage system. IEEE Trans. Parallel Distrib. Syst. 31, 2 (2020), 294–305.
    [86]
    Openstack. 2022. Swift Docs. Retrieved February 15, 2022 from https://docs.openstack.org/swift.
    [87]
    Hojin Park, Gregory R. Ganger, and George Amvrosiadis. 2020. More IOPS for less: Exploiting burstable storage in public clouds. In Proceedings of USENIX HotCloud.
    [88]
    Milan Patel, Brian Naughton, Caroline Chan, Nurit Sprecher, Sadayuki Abeta, and Adrian Neal. 2014. Mobile-Edge Computing. Introductory Technical White Paper. Mobile-Edge Computing (MEC) Industry Initiative.
    [89]
    Lingjun Pu, Lei Jiao, Xu Chen, Lin Wang, Qinyi Xie, and Jingdong Xu. 2018. Online resource allocation, content placement and request routing for cost-efficient edge caching in cloud radio access networks. IEEE J. Sel. Areas Commun. 36, 8 (2018), 1751–1767.
    [90]
    Qifan Pu, Shivaram Venkataraman, and Ion Stoica. 2019. Shuffling, fast and slow: Scalable analytics on serverless infrastructure. In Proceedings of NSDI. 193–206.
    [91]
    Han Qiu, Hassan Noura, Meikang Qiu, Zhong Ming, and Gerard Memmi. 2021. A user-centric data protection method for cloud storage based on invertible DWT. IEEE Trans. Cloud Comput. 9, 4 (2021), 1293–1304.
    [92]
    Han Qiu, Chentao Wu, Jie Li, Minyi Guo, Tong Liu, Xubin He, Yuanyuan Dong, and Yafei Zhao. 2020. EC-fusion: An efficient hybrid erasure coding framework to improve both application and recovery performance in cloud storage systems. In Proceedings of IEEE IPDPS. 191–201.
    [93]
    Shweta Saharan, Gaurav Somani, Gaurav Gupta, Robin Verma, Manoj Singh Gaur, and Rajkumar Buyya. 2020. QuickDedup: Efficient VM deduplication in cloud computing environments. J. Parallel Distrib. Comput. 139 (2020), 18–31.
    [94]
    Mohammad A. Salahuddin, Jagruti Sahoo, Roch H. Glitho, Halima Elbiaze, and Wessam Ajib. 2018. A survey on content placement algorithms for cloud-based content delivery networks. IEEE Access 6 (2018), 91–114.
    [95]
    Mahadev Satyanarayanan. 2017. The emergence of edge computing. Computer 50, 1 (2017), 30–39.
    [96]
    Amazon Web Services. 2022. ElastiCache Data Tiering. Retrieved February 15, 2022 from https://aws.amazon.com/elasticache/pricing/#Data_tiering.
    [97]
    Ali Shakarami, Mostafa Ghobaei-Arani, Ali Shahidinejad, Mohammad Masdari, and Hamid Shakarami. 2021. Data replication schemes in cloud computing: A survey. Clust. Comput. 24, 3 (2021), 2545–2579.
    [98]
    Yanling Shao, Chunlin Li, Zhao Fu, Leyue Jia, and Youlong Luo. 2019. Cost-effective replication management and scheduling in edge computing. J. Netw. Comput. Appl. 129 (2019), 46–61.
    [99]
    Young-Joo Shin, Dongyoung Koo, and Junbeom Hur. 2017. A survey of secure data deduplication schemes for cloud storage systems. ACM Comput. Surv. 49, 4 (2017), Article 74, 38 pages.
    [100]
    Le Sun, Hai Dong, Farookh Khadeer Hussain, Omar Khadeer Hussain, and Elizabeth Chang. 2014. Cloud service selection: State-of-the-art and future research directions. J. Netw. Comput. Appl. 45 (2014), 134–150.
    [101]
    Sheng-Yao Sun, Wenbin Yao, and Xiaoyong Li. 2018. DARS: A dynamic adaptive replica strategy under high load cloud-P2P. Future Gener. Comput. Syst. 78 (2018), 31–40.
    [102]
    Sheng-Yao Sun, Wenbin Yao, Baojun Qiao, Ming Zong, Xin He, and Xiaoyong Li. 2019. RRSD: A file replication method for ensuring data reliability and reducing storage consumption in a dynamic cloud-P2P environment. Future Gener. Comput. Syst. 100 (2019), 844–858.
    [103]
    Choon Beng Tan, Mohd. Hanafi Ahmad Hijazi, Yuto Lim, and Abdullah Gani. 2018. A survey on proof of retrievability for cloud data integrity and availability: Cloud storage state-of-the-art, issues, solutions and future trends. J. Netw. Comput. Appl. 110 (2018), 75–86.
    [104]
    Jun Tang, Yong Cui, Qi Li, Kui Ren, Jiangchuan Liu, and Rajkumar Buyya. 2016. Ensuring security and privacy preservation for cloud data services. ACM Comput. Surv. 49, 1 (2016), Article 13, 39 pages.
    [105]
    Keitaro Uehara, Yu Xiang, Yih-Farn Robin Chen, Matti A. Hiltunen, Kaustubh Joshi, and Richard D. Schlichting. 2018. SuperCell: Adaptive software-defined storage for cloud storage workloads. In Proceedings of IEEE/ACM CCGRID. 103–112.
    [106]
    Philipp Waibel, Johannes Matt, Christoph Hochreiner, Olena Skarlat, Ronny Hans, and Stefan Schulte. 2017. Cost-optimized redundant data storage in the cloud. Serv. Oriented Comput. Appl. 11, 4 (2017), 411–426.
    [107]
    Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, and Yue Cheng. 2020. InfiniCache: Exploiting ephemeral serverless functions to build a cost-effective memory cache. In Proceedings of FAST. 267–281.
    [108]
    Cheng Wang, Bhuvan Urgaonkar, Aayush Gupta, George Kesidis, and Qianlin Liang. 2017. Exploiting spot and burstable instances for improving the cost-efficacy of in-memory caches on the public cloud. In Proceedings of EuroSys. 620–634.
    [109]
    Haoyu Wang, Haiying Shen, Zijian Li, and Shuhao Tian. 2021. GeoCol: A geo-distributed cloud storage system with low cost and latency using reinforcement learning. In Proceedings of IEEE ICDCS. 149–159.
    [110]
    Haoyu Wang, Haiying Shen, Qi Liu, Kevin Zheng, and Jie Xu. 2020. A reinforcement learning based system for minimizing cloud storage service cost. In Proceedings of ICPP. Article 30, 10 pages.
    [111]
    Huaimin Wang, Peichang Shi, and Yiming Zhang. 2017. JointCloud: A cross-cloud cooperation architecture for integrated Internet service customization. In Proceedings of IEEE ICDCS. 1846–1855.
    [112]
    Shucheng Wang, Ziyi Lu, Qiang Cao, Hong Jiang, Jie Yao, Yuanyuan Dong, and Puyuan Yang. 2020. BCW: Buffer-controlled writes to HDDs for SSD-HDD hybrid storage server. In Proceedings of FAST. 253–266.
    [113]
    Wei Wang, Ben Liang, and Baochun Li. 2015. To reserve or not to reserve: Optimal online multi-instance acquisition in IaaS clouds. IEEE Trans. Parallel Distrib. Syst. 26, 12 (2015), 3407–3419.
    [114]
    Chenggang Wu, Vikram Sreekanti, and Joseph M. Hellerstein. 2021. Autoscaling tiered cloud storage in Anna. VLDB J. 30, 1 (2021), 25–43.
    [115]
    Huijun Wu, Chen Wang, Yinjin Fu, Sherif Sakr, Kai Lu, and Liming Zhu. 2018. A differentiated caching mechanism to enable primary storage deduplication in clouds. IEEE Trans. Parallel Distrib. Syst. 29, 6 (2018), 1202–1216.
    [116]
    Huijun Wu, Chen Wang, Yinjin Fu, Sherif Sakr, Liming Zhu, and Kai Lu. 2017. HPDedup: A hybrid prioritized data deduplication mechanism for primary storage in the cloud. In Proceedings of MSST.
    [117]
    Suzhen Wu, Kuan-Ching Li, Bo Mao, and Minghong Liao. 2017. DAC: Improving storage availability with deduplication-assisted cloud-of-clouds. Future Gener. Comput. Syst. 74 (2017), 190–198.
    [118]
    Zhe Wu, Curtis Yu, and Harsha V. Madhyastha. 2015. CosTLO: Cost-effective redundancy for lower latency variance on cloud storage services. In Proceedings of NSDI. 543–557.
    [119]
    Qiufen Xia, Zichuan Xu, Weifa Liang, Shui Yu, Song Guo, and Albert Y. Zomaya. 2019. Efficient data placement and replication for QoS-aware approximate query evaluation of big data analytics. IEEE Trans. Parallel Distrib. Syst. 30, 12 (2019), 2677–2691.
    [120]
    Wen Xia, Hong Jiang, Dan Feng, Fred Douglis, Philip Shilane, Yu Hua, Min Fu, Yucheng Zhang, and Yukun Zhou. 2016. A comprehensive study of the past, present, and future of data deduplication. Proc. IEEE 104, 9 (2016), 1681–1710.
    [121]
    Xiaoyu Xia, Feifei Chen, Qiang He, Guangming Cui, Phu Lai, Mohamed Abdelrazek, John Grundy, and Hai Jin. 2020. Graph-based data caching optimization for edge computing. Future Gener. Comput. Syst. 113 (2020), 228–239.
    [122]
    Xiaoyu Xia, Feifei Chen, Qiang He, John Grundy, Mohamed Abdelrazek, and Hai Jin. 2021. Online collaborative data caching in edge computing. IEEE Trans. Parallel Distrib. Syst. 32, 2 (2021), 281–294.
    [123]
    Xiaoyu Xia, Feifei Chen, Qiang He, John C. Grundy, Mohamed Abdelrazek, and Hai Jin. 2021. Cost-effective app data distribution in edge computing. IEEE Trans. Parallel Distrib. Syst. 32, 1 (2021), 31–44.
    [124]
    Jiwei Xu, Wenbo Zhang, Shiyang Ye, Jun Wei, and Tao Huang. 2014. A lightweight virtual machine image deduplication backup approach in cloud environment. In Proceedings of IEEE COMPSAC. 503–508.
    [125]
    Zichen Xu, Christopher Stewart, Nan Deng, and Xiaorui Wang. 2016. Blending on-demand and spot instances to lower costs for in-memory storage. In Proceedings of IEEE INFOCOM. 1–9.
    [126]
    Chi Yang and Jinjun Chen. 2017. A scalable data chunk similarity based compression approach for efficient big sensing data processing on cloud. IEEE Trans. Knowl. Data Eng. 29, 6 (2017), 1144–1157.
    [127]
    Jie Yang, Haibin Zhu, and Tieqiao Liu. 2019. Secure and economical multi-cloud storage policy with NSGA-II-C. Appl. Soft Comput. 83 (2019), 105649.
    [128]
    Shengsong Yang, Li Pan, Qingyang Wang, Shijun Liu, and Shuo Zhang. 2018. Subscription or pay-as-you-go: Optimally purchasing IaaS instances in public clouds. In Proceedings of IEEE ICWS. 219–226.
    [129]
    Zhengyu Yang, Morteza Hoseinzadeh, Allen Andrews, Clay Mayers, David Thomas Evans, Rory Thomas Bolt, Janki Bhimani, Ningfang Mi, and Steven Swanson. 2017. AutoTiering: Automatic data placement manager in multi-tier all-flash datacenter. In Proceedings of IEEE IPCCC. 1–8.
    [130]
    Zhengyu Yang, Yufeng Wang, Janki Bhamini, Chiu Chiang Tan, and Ningfang Mi. 2018. EAD: Elasticity aware deduplication manager for datacenters with multi-tier storage systems. Clust. Comput. 21, 3 (2018), 1561–1579.
    [131]
    Zhen Ye, Shanping Li, and Xiaozhen Zhou. 2013. GCplace: Geo-cloud based correlation aware data replica placement. In Proceedings of ACM SAC. 371–376.
    [132]
    Jianwei Yin, Yan Tang, ShuiGuang Deng, Ying Li, Wei Lo, Kexiong Dong, Albert Y. Zomaya, and Calton Pu. 2017. ASSER: An efficient, reliable, and cost-effective storage scheme for object-based cloud storage systems. IEEE Trans. Comput. 66, 8 (2017), 1326–1340.
    [133]
    Jianwei Yin, Yan Tang, Shuiguang Deng, Bangpeng Zheng, and Albert Y. Zomaya. 2021. MUSE: A multi-tiered and SLA-driven deduplication framework for cloud storage systems. IEEE Trans. Comput. 70, 5 (2021), 759–774.
    [134]
    Xingjia Yuan and Yuelong Zhao. 2019. RPMSP: A novel replica placement method inspired by self-similarity propagation of plants. In Proceedings of IEEE ISPA/BDCloud/SocialCom/SustainCom. 596–601.
    [135]
    Victor Zakhary, Lawrence Lim, Divy Agrawal, and Amr El Abbadi. 2021. Cache on Track (CoT): Decentralized elastic caches for cloud environments. In Proceedings of EDBT. 217–228.
    [136]
    Lingfang Zeng, Shijie Xu, Yang Wang, Kenneth B. Kent, David Bremner, and Cheng-Zhong Xu. 2017. Toward cost-effective replica placements in cloud storage systems with QoS-awareness. Softw. Pract. Exp. 47, 6 (2017), 813–829.
    [137]
    Lei Zhang, Xuejun Li, Hourieh Khalajzadeh, Yan Yang, Ruiyue Zhu, Xia Ji, Chuanhui Ju, and Yun Yang. 2018. Cost-effective and traffic-optimal data placement strategy for cloud-based online social networks. In Proceedings of IEEE CSCWD. 110–115.
    [138]
    Panfeng Zhang, Ping Huang, Xubin He, Hua Wang, and Ke Zhou. 2017. Resemblance and mergence based indexing for high performance data deduplication. J. Syst. Softw. 128 (2017), 11–24.
    [139]
    Wei Zhao, Jiajia Liu, Hongzhi Guo, and Takahiro Hara. 2018. ETC-IoT: Edge-node-assisted transmitting for the cloud-centric Internet of Things. IEEE Netw. 32, 3 (2018), 101–107.
    [140]
    Jiang Zhou, Yong Chen, Wei Xie, Dong Dai, Shuibing He, and Weiping Wang. 2020. PRS: A pattern-directed replication scheme for heterogeneous object-based storage. IEEE Trans. Comput. 69, 4 (2020), 591–605.

    Cited By

    View all
    • (2024)ZeroEA: A Zero-Training Entity Alignment Framework via Pre-Trained Language ModelProceedings of the VLDB Endowment10.14778/3654621.365464017:7(1765-1774)Online publication date: 30-May-2024
    • (2024)Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approachJournal on Image and Video Processing10.1186/s13640-024-00623-62024:1Online publication date: 22-Mar-2024
    • (2024)Leveraging Pretrained Language Models for Enhanced Entity MatchingInternational Journal of Intelligent Systems10.1155/2024/19412212024Online publication date: 15-Apr-2024
    • Show More Cited By

    Index Terms

    1. Cost Optimization for Cloud Storage from User Perspectives: Recent Advances, Taxonomy, and Survey

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Computing Surveys
        ACM Computing Surveys  Volume 55, Issue 13s
        December 2023
        1367 pages
        ISSN:0360-0300
        EISSN:1557-7341
        DOI:10.1145/3606252
        Issue’s Table of Contents

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 13 July 2023
        Online AM: 03 February 2023
        Accepted: 30 January 2023
        Revised: 15 January 2023
        Received: 22 June 2022
        Published in CSUR Volume 55, Issue 13s

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Storage-as-a-Service (STaaS)
        2. cloud storage
        3. cost optimization

        Qualifiers

        • Survey

        Funding Sources

        • National Key R& D Program of China
        • Key Research and Development Program of Shandong Province
        • Shandong Provincial Natural Science Foundation

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)714
        • Downloads (Last 6 weeks)57
        Reflects downloads up to 09 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)ZeroEA: A Zero-Training Entity Alignment Framework via Pre-Trained Language ModelProceedings of the VLDB Endowment10.14778/3654621.365464017:7(1765-1774)Online publication date: 30-May-2024
        • (2024)Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approachJournal on Image and Video Processing10.1186/s13640-024-00623-62024:1Online publication date: 22-Mar-2024
        • (2024)Leveraging Pretrained Language Models for Enhanced Entity MatchingInternational Journal of Intelligent Systems10.1155/2024/19412212024Online publication date: 15-Apr-2024
        • (2024)Hybrid Prompt Learning for Generating Justifications of Security Risks in Automation RulesACM Transactions on Intelligent Systems and Technology10.1145/3675401Online publication date: 29-Jun-2024
        • (2024)Research on Financial Fraud Text Classification Based on PET-BiLSTMProceedings of the 2024 3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy10.1145/3672919.3672982(341-345)Online publication date: 1-Mar-2024
        • (2024)Revealing the Unseen: AI Chain on LLMs for Predicting Implicit Data Flows to Generate Data Flow Graphs in Dynamically-Typed CodeACM Transactions on Software Engineering and Methodology10.1145/3672458Online publication date: 12-Jun-2024
        • (2024)Towards a Catalog of Prompt Patterns to Enhance the Discipline of Prompt EngineeringACM SIGAda Ada Letters10.1145/3672359.367236443:2(43-51)Online publication date: 7-Jun-2024
        • (2024)A Reasoning and Value Alignment Test to Assess Advanced GPT ReasoningACM Transactions on Interactive Intelligent Systems10.1145/3670691Online publication date: 3-Jun-2024
        • (2024)Towards AI for Software SystemsProceedings of the 1st ACM International Conference on AI-Powered Software10.1145/3664646.3664767(79-84)Online publication date: 10-Jul-2024
        • (2024)A Unified Review of Deep Learning for Automated Medical CodingACM Computing Surveys10.1145/3664615Online publication date: 17-May-2024
        • Show More Cited By

        View Options

        Get Access

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        Full Text

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media