Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Forecasting Price Trend of Bulk Commodities Leveraging Cross-domain Open Data Fusion

Published: 21 January 2020 Publication History

Abstract

Forecasting price trend of bulk commodities is important in international trade, not only for markets participants to schedule production and marketing plans but also for government administrators to adjust policies. Previous studies cannot support accurate fine-grained short-term prediction, since they mainly focus on coarse-grained long-term prediction using historical data. Recently, cross-domain open data provides possibilities to conduct fine-grained price forecasting, since they can be leveraged to extract various direct and indirect factors of the price. In this article, we predict the price trend over upcoming days, by leveraging cross-domain open data fusion. More specifically, we formulate the price trend into three classes (rise, slight-change, and fall), and then we predict the specific class in which the price trend of the future day lies. We take three factors into consideration: (1) supply factor considering sources providing bulk commodities,<?brk?> (2) demand factor focusing on vessel transportation with reflection of short time needs, and (3) expectation factor encompassing indirect features (e.g., air quality) with latent influences. A hybrid classification framework is proposed for the price trend forecasting. Evaluation conducted on nine real-world cross-domain open datasets shows that our framework can forecast the price trend accurately, outperforming multiple state-of-the-art baselines.

References

[1]
BDI. 2017. Baltic Dry Index. Retrieved from http://www.balticexchange.com.
[2]
Chao Chen, Shuhai Jiao, Shu Zhang, Weichen Liu, Liang Feng, and Yasha Wang. 2018. TripImputor: Real-time imputing taxi trip purpose leveraging multi-sourced urban data. IEEE Trans. Intell. Transport. Syst. 99 (2018), 1--13.
[3]
Chao Chen, Daqing Zhang, Xiaojuan Ma, Bin Guo, Leye Wang, Yasha Wang, and Edwin Sha. 2016. Crowddeliver: Planning city-wide package delivery paths leveraging the crowd of taxis. IEEE Trans. Intell. Transport. Syst. 18, 6 (2016), 1478--1496.
[4]
Longbiao Chen, Daqing Zhang, Xiaojuan Ma, Leye Wang, Shijian Li, Zhaohui Wu, and Gang Pan. 2016. Container port performance measurement and comparison leveraging ship GPS traces and maritime open data. IEEE Trans. Intell. Transport. Syst. 17, 5 (2016), 1227--1242.
[5]
Longbiao Chen, Daqing Zhang, Gang Pan, Xiaojuan Ma, Dingqi Yang, Kostadin Kushlev, Wangsheng Zhang, and Shijian Li. 2015. Bike sharing station placement leveraging heterogeneous urban open data. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 571--575.
[6]
Longbiao Chen, Daqing Zhang, Gang Pan, Leye Wang, Xiaojuan Ma, Chao Chen, and Shijian Li. 2014. Container throughput estimation leveraging ship GPS traces and open data. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 847--851.
[7]
Longbiao Chen, Daqing Zhang, Leye Wang, Dingqi Yang, Xiaojuan Ma, Shijian Li, Zhaohui Wu, Gang Pan, Thi-Mai-Trang Nguyen, and Jeremie Jakubowicz. 2016. Dynamic cluster-based over-demand prediction in bike sharing systems. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 841--852.
[8]
Javier Contreras, Rosario Espinola, Francisco J. Nogales, and Antonio J. Conejo. 2003. ARIMA models to predict next-day electricity prices. IEEE Trans. Power Syst. 18, 3 (2003), 1014--1020.
[9]
Shanghai Baosteel Group Corporation. 2019. Retrieved from http://www.baosteel.com.
[10]
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Mach. Learn. 20, 3 (1995), 273--297.
[11]
Pedro Domingos and Michael Pazzani. 1997. On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29 (1997), 103--130.
[12]
NetEase Finance. 2017. Quotations. Retrieved from http://quotes.money.163.com.
[13]
Yoav Freund and Robert E. Schapire. 1997. A desicion-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 1 (1997), 119--139.
[14]
Jerome H. Friedman. 2001. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 5 (2001), 1189--1232.
[15]
Bin Fu, Jialiu Lin, Lei Li, Christos Faloutsos, Jason Hong, and Norman Sadeh. 2013. Why people hate your app: Making sense of user feedback in a mobile app store. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1276--1284.
[16]
Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014. Sparse real estate ranking with online user reviews and offline moving behaviors. In Proceedings of the IEEE International Conference on Data Mining. IEEE, 120--129.
[17]
B. Guo, Y. Liu, W. Wu, Z. Yu, and Q. Han. 2017. ActiveCrowd: A framework for optimized multitask allocation in mobile crowdsensing systems. IEEE Trans. Hum.-Mach. Syst. 47, 3 (June 2017), 392--403.
[18]
Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2008. The Elements of Statistical Learning. Springer.
[19]
Stefan Hellmer and Joakim Ekstrand. 2013. The iron ore world market in the early twenty-first century-the impact of the increasing Chinese dominance. Mineral Econ. 25, 2--3 (2013), 89--95.
[20]
Chih-Wei Hsu and Chih-Jen Lin. 2002. A comparison of methods for multi-class support vector machines. IEEE Trans. Neural Netw. 13 (2002), 415--425.
[21]
Weimin Ma, Xiaoxi Zhu, and Miaomiao Wang. 2013. Forecasting iron ore import and consumption of China using grey model optimized by particle swarm optimization algorithm. Resources Policy 38, 4 (2013), 613--620.
[22]
Yiqun Ma. 2013. Iron ore spot price volatility and change in forward pricing mechanism. Resources Policy 38, 4 (2013), 621--627.
[23]
Malik Magdon-Ismail. 2000. No free lunch for noise prediction. Neural Comput. 12, 3 (2000), 547--564.
[24]
Bloomberg Markets. 2017. BDIY Quote—Baltic Dry Index. Retrieved from https://www.bloomberg.com/quote/BDIY:IND.
[25]
Nicolai Meinshausen and Peter Bühlmann. 2010. Stability selection. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 72, 4 (2010), 417--473.
[26]
Maria Isabel Wolf Motta Morandi, Luis Henrique Rodrigues, Daniel Pacheco Lacerda, and Isaac Pergher. 2014. Foreseeing iron ore prices using system thinking and scenario planning. Syst. Pract. Action Res. 27, 3 (2014), 287--306.
[27]
Andrew Y. Ng and Michael I. Jordan. 2002. On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Advances in Neural Information Processing Systems (2002), 841--848.
[28]
National Bureau of Statistics. 2017. Retrieved from http://www.stats.gov.cn/.
[29]
Fleet Online. 2017. Ship dataset. Retrieved from http://www.hifleet.com.
[30]
OpenSea. 2018. Retrieved from https://opensea.pro/.
[31]
Y. Ouyang, B. Guo, X. Lu, Q. Han, T. Guo, and Z. Yu. 2018. CompetitiveBike: Competitive analysis and popularity prediction of bike-sharing apps using multi-source data. IEEE Trans. Mobile Comput. (2018), 1--1.
[32]
Online Air Quality Monitoring Platform. 2017. Air Quality. Retrieved from https://www.aqistudy.cn.
[33]
Platts. 2017. Platts Iron Iron Index. Retrieved from http://platts.com.
[34]
Alexander Pustov, Alexander Malanichev, and Ilya Khobotilov. 2013. Long-term iron ore price modeling: Marginal costs vs. incentive price. Resources Policy 38, 4 (2013), 558--567.
[35]
Qianzhan. 2017. IRON ORE. Retrieved from http://d.qianzhan.com/xdata.
[36]
Clarkson Research. 2017. Shipping Intelligence Network. Retrieved from https://sin.clarksons.net.
[37]
Jingbo Shang, Yu Zheng, Wenzhu Tong, Eric Chang, and Yong Yu. 2014. Inferring gas consumption and pollution emission of vehicles throughout a city. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1027--1036.
[38]
Xuan Song, Quanshi Zhang, Yoshihide Sekimoto, and Ryosuke Shibasaki. 2016. Prediction and simulation of human mobility following natural disasters. ACM Trans. Intell. Syst. Technol. 8, 2 (2016), 29:1--29:23.
[39]
SWS. 2018. SWS Research. Retrieved from http://www.swsresearch.com.
[40]
Linda Wårell. 2014. The effect of a change in pricing regime on iron ore prices. Resources Policy 41 (2014), 16--22.
[41]
Kun Wang, Hezhong Tian, Shenbing Hua, Chuanyong Zhu, Jiajia Gao, Yifeng Xue, Jiming Hao, Yong Wang, and Junrui Zhou. 2016. A comprehensive emission inventory of multiple air pollutants from iron and steel industry in China: Temporal trends and spatial variation characteristics. Sci. Total Environ. 559 (2016), 7--14.
[42]
Yi-Hsien Wang. 2009. Nonlinear neural network forecasting model for stock index option price: Hybrid GJR--GARCH approach. Expert Syst. Appl. 36, 1 (2009), 564--570.
[43]
Jason Weston, Sayan Mukherjee, Olivier Chapelle, Massimiliano Pontil, Tomaso Poggio, and Vladimir Vapnik. 2001. Feature selection for SVMs. In Advances in Neural Information Processing Systems. MIT Press, 668--674.
[44]
David H. Wolpert. 1996. The lack of a priori distinctions between learning algorithms. Neural Comput. 8, 7 (1996), 1341--1390.
[45]
David H. Wolpert, William G. Macready et al. 1995. No Free Lunch Theorems for Search. Technical Report. Technical Report SFI-TR-95-02-010, Santa Fe Institute.
[46]
David H. Wolpert, William G. Macready et al. 1997. No free lunch theorems for optimization. IEEE Trans. Evolution. Comput. 1, 1 (1997), 67--82.
[47]
WorldSteel. 2017. Steel Statistical Yearbook 2016. Retrieved from https://www.worldsteel.org.
[48]
Kuang Xiao, Yuku Wang, Guang Wu, Bin Fu, and Yuanyuan Zhu. 2018. Spatiotemporal characteristics of air pollutants (PM10, PM2.5, SO2, NO2, O3, and CO) in the inland basin city of Chengdu, southwest China. Atmosphere 9, 2 (2018), 74.
[49]
Anamika Yadav, Rajagopal Peesapati, and Niranjan Kumar. 2017. Electricity price forecasting and classification through wavelet-dynamic weighted PSO-FFNN approach. IEEE Syst. J. 12, 4 (2018), 3075--3084.
[50]
Dongqing Zhang, Guangming Zang, Jing Li, Kaiping Ma, and Huan Liu. 2018. Prediction of soybean price in China using QR-RBF neural network model. Comput. Electron. Agric. 154 (2018), 10--17.
[51]
Sha Zhao, Shijian Li, Julian Ramos, Zhiling Luo, Ziwen Jiang, Anind K. Dey, and Gang Pan. 2019. User profiling from their use of smartphone applications: A survey. Pervas. Mobile Comput. 59 (2019), 101052.
[52]
Sha Zhao, Julian Ramos, Jianrong Tao, Ziwen Jiang, Shijian Li, Zhaohui Wu, Gang Pan, and Anind K. Dey. 2016. Discovering different kinds of smartphone users through their application usage behaviors. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 498--509.
[53]
Sha Zhao, Yizhi Xu, Xiaojuan Ma, Ziwen Jiang, Zhiling Luo, Shijian Li, Laurence T. Yang, Anind K. Dey, and Gang Pan. 2019. Gender profiling from a single snapshot of apps installed on a smartphone: An empirical study. IEEE Trans. Industr. Info. (Early Access). (2019).
[54]
Yu Zheng and Xing Xie. 2011. Learning travel recommendations from user-generated GPS traces. ACM Trans. Intell. Syst. Technol. 2, 1 (Jan. 2011).
[55]
Yu Zheng, Xiuwen Yi, Ming Li, Ruiyuan Li, Zhangqing Shan, Eric Chang, and Tianrui Li. 2015. Forecasting fine-grained air quality based on big data. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, Sydney, Australia, 2267--2276.
[56]
Yu Zheng, Lizhu Zhang, Xing Xie, and Wei-Ying Ma. 2009. Mining interesting locations and travel sequences from GPS trajectories. In Proceedings of the 18th International Conference on World Wide Web. ACM, 791--800.
[57]
Zhihua Zhou. 2016. Machine Learning. TsingHua Press, Beijing.

Cited By

View all
  • (2023)Cloud-Based Knowledge Extraction (CBEx) System: A Solution for Real-time Data Analysis for Rapid Decision Making2023 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE58730.2023.10289046(324-330)Online publication date: 24-Sep-2023
  • (2023)Spatio-temporal analysis of urban crime leveraging multisource crowdsensed dataPersonal and Ubiquitous Computing10.1007/s00779-020-01456-627:3(599-612)Online publication date: 1-Jun-2023
  • (2023)Hybrid deep learning cost evaluation using CNN with ANN for the plastic injection industryNeural Computing and Applications10.1007/s00521-023-08947-635:31(23153-23175)Online publication date: 1-Nov-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 11, Issue 1
February 2020
304 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/3375625
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 January 2020
Accepted: 01 August 2019
Revised: 01 July 2019
Received: 01 May 2019
Published in TIST Volume 11, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Price trend
  2. bulk commodity
  3. cross-domain data
  4. data fusion
  5. multi-class prediction

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • National Key Research and Development Plan
  • National Natural Science Foundation of China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)4
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Cloud-Based Knowledge Extraction (CBEx) System: A Solution for Real-time Data Analysis for Rapid Decision Making2023 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE58730.2023.10289046(324-330)Online publication date: 24-Sep-2023
  • (2023)Spatio-temporal analysis of urban crime leveraging multisource crowdsensed dataPersonal and Ubiquitous Computing10.1007/s00779-020-01456-627:3(599-612)Online publication date: 1-Jun-2023
  • (2023)Hybrid deep learning cost evaluation using CNN with ANN for the plastic injection industryNeural Computing and Applications10.1007/s00521-023-08947-635:31(23153-23175)Online publication date: 1-Nov-2023
  • (2022)Text‐based soybean futures price forecasting: A two‐stage deep learning approachJournal of Forecasting10.1002/for.290942:2(312-330)Online publication date: 13-Sep-2022
  • (2021)Forecasting Brazilian Ethanol Spot Prices Using LSTMEnergies10.3390/en1423798714:23(7987)Online publication date: 30-Nov-2021
  • (2021)ESCORT: Fine-Grained Urban Crime Risk Inference Leveraging Heterogeneous Open DataIEEE Systems Journal10.1109/JSYST.2020.302376215:3(4656-4667)Online publication date: Sep-2021

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media