research-article

Open access

Prediction Method of Government Economic Situation based on Big Data Analysis

Authors:

Yisheng Liu,

Anying TangAuthors Info & Claims

Digital Government: Research and Practice, Volume 3, Issue 4

Article No.: 27, Pages 1 - 16

https://doi.org/10.1145/3563042

Published: 05 December 2022 Publication History

All formats PDF

Abstract

In order to improve the forecasting accuracy of economic situation, a government economic situation forecasting method based on big data analysis is proposed. According to the hardware structure of the system, STC12C5608AD is used as the data acquisition terminal chip to simplify the circuit. The proposed forecasting method can give real-time early warning to the government's economic situation. The software part screens the influencing factors of government economic development, constructs a government economic development index system, collects government economic index data, cleans, clusters, classifies, and standardizes the government economic index data, and extracts the preprocessed government economic index data from the preprocessed government economic index data through data mining. The economic development features are extracted and then input into the neural network. After training and learning, the predicted value of the economic situation is output, and the economic situation level is classified. The experimental results show that the proposed method reduces the error rate of economic situation forecast, shortens the forecast time, improves the forecast accuracy and efficiency, with the peak error ratio not exceeding 15%.

1 Introduction

Macroeconomic forecast refers to various economic forecasts based on the economic activities of the national economy. Considering the consistency, stability, and causality of macroeconomic development, a macroeconomic forecast must be scientific and reasonable, because it has become an important basis for formulating macroeconomic policies, preparing and checking economic development plans, and adjusting economic structure. Microeconomic refers to individual economic activities, that is, the economic activities of a single economic unit. It refers to individual enterprises, business units, and their economic activities, such as the production, supply and marketing, and exchange prices of individual enterprises. The operation of microeconomics, guided by price and market signals, adjusts and balances itself through competition. During the operation of the macro-economy, there are many areas where the role of market mechanism cannot be given full play to, and the state needs to use various means to carry out macro-regulation and control from the overall interests of society. From the perspective of the trend change of GDP, macroeconomic will not only affect the financial situation of the government, and enterprise profits [1], but also directly affect the income and expenditure of households and individuals. Therefore, the prediction, analysis, and evaluation of macroeconomic aggregate is particularly important [2].

Reference [3] proposes a macroeconomic prediction method based on an optimized wavelet neural network. The wolf swarm algorithm in the intelligent group algorithm is used to optimize the weight of wavelet neural network model. The normalized economic data of a province is used to learn and train the proposed model to output the macroeconomic prediction results. In Reference [4], by taking the GDP from 1978 to 2018 as the basic data, the nonlinear autoregressive with exogenous inputs (NARX) dynamic neural network and autoregressive integrated moving average model (ARIMA) model are used as single forecasts to establish the combined prediction model and predict the macro-economy.

After long-term development, updating, and improvement, the traditional macroeconomic forecasting models have become complete and mature. Although both theory-driven modeling and data-driven modeling play an important role in the field of macroeconomic forecasting [5], they rely heavily on the continuity of the laws of the economic system. The basic logic of the existing macroeconomic prediction models is to find the basic laws of economic operation through historical data and predict the future economic situation through the laws concluded based on historical data. However, the economic operation law is less likely to change in a short time, but with the increase in time interval, the deviates from the original economic operation law will be greater and greater. The effectiveness of traditional macroeconomic forecasting models is not sufficiently “good” [6]. There are serious problems in the statistical data used in the process of traditional macroeconomic prediction and analysis, which is also an important reason for the large “error” in the macroeconomic prediction and analysis, and also a fundamental problem in the process of traditional macroeconomic prediction and analysis.

Therefore, we should fully study the economic conditions of different governments, link the government economy with the overall economic development, clarify the direction of future economic development, and promote the sustainability of government economic development. Based on the above analyses, this article proposes a government economic situation prediction method based on big data analysis. The overall technical route of this method is as follows:

(1)

The hardware structure of the system is designed, and the data acquisition terminal chip adopts STC12C5608AD, which simplifies the circuit. Through the government's economic early the warning system, it can carry out real-time early warning of the government's economic situation.

(2)

After the government's economic development indicators are pre-processed and classified, the data are extracted from the government's economic development indicators. Extract the characteristics of economic development, and input the characteristics of economic development into a neural network. After training and learning, output the predicted value of the economic situation, classify the level of the economic situation, and output the predicted results.

(3)

Taking RMSE and MAE as experimental indicators, the comparison between prediction results and actual values is carried out to verified the effectiveness and accuracy of the proposed method. Moreover, the proposed method is compared with different traditional methods.

2 System Hardware Structure Design

In the second section, the hardware of the system designed. The perfection of the hardware engineering directly determines the working performance of the system. Therefore, we should strengthen the hardware function of the system to improve the effectiveness of prediction.

Figure 1 shows the hardware structure design scheme of the government economic situation prediction system based on big data analysis, mainly consisting of a data acquisition terminal, communication network, and host computer monitoring equipment.

Fig. 1.

As can be seen from Figure 1, each lower computer detects the prediction signal of the government's economic situation in real time, sends it to the GPRS module after data processing, and connects to the Internet through the mobile communication network [7]. GPRS module adopts high-performance industrial wireless module and an embedded processor, takes the real-time operating system as the software support platform, and embeds the TCP/IP protocol with independent intellectual property rights to provide users with a high-speed, stable, reliable and online transparent data transmission channel. The host communicates with the lower computer through the fixed TCP/IP address of the bridge, and carries out wireless remote transmission to the host. The monitoring equipment is a remote monitor in the system, which mainly realizes the functions of government economic situation data monitoring, fault alarm, government economic situation data query, event recording, user management, sampling control, and so on [8]. The host adopts forcecontrol6.0 configuration software to set, add, modify, and delete the basic information of the equipment. If the measured value of the monitoring equipment received through the host exceeds the set upper and lower limits, the system will automatically alarm [9]. The government economic situation data alarmed by the system each time will be output to the electronic report form to record the operator's login action.

2.1 Data Acquisition Terminal

The government economic situation data acquisition terminal shown in Figure 2 is responsible for signal acquisition, data processing, and wireless transmission.

Fig. 2.

As can be seen from Figure 2, the system hardware consists of a government economic situation signal detection processing board, an A/D converter, a microcontroller CPU, and a GPRS module. Through signal conditioning and A/D conversion, the substation bus voltage signal detection and processing and GPRS module control are realized. The GPRS module of the connection board realizes the data transmission among the CPU, GPRS module, and the mobile company SIM card [10]. According to the data acquisition and processing speed and performance requirements of the government's economic situation, the control chip STC12C5608AD is adopted, which has high speed, low power consumption, and strong anti-interference ability. STC12C5608AD is an enhanced 51 single-chip microcomputer with an AD function. All IO ports of the on-chip RAM768Byte can be configured. The high-level drive capability can reach 20mA. It has EEPROM function with only 20 or 28 pins, so it can reduce the volume. There is a reset circuit under the same crystal frequency; the working speed is 12 times that of the traditional 51 single-chip microcomputer. The instruction code takes into account 8,051, but its speed is 8–12 times faster than 8,051. Using this chip, the usage of A/D converter and peripheral equipment can be avoided, which greatly reduces the difficulty in circuit design and software programming [11].

2.2 Design of Government Economic Situation Early Warning System

The government economic situation early warning system consists of a main port and multiple branch ports. The main port is connected to multiple stations at the same time, and the branch port is connected to the main port through a network channel or serial port channel. The port consists of measurement and control device and telemetry terminal equipment. The measurement and control device are connected with the primary equipment installed on the bus through the line, collecting the government economic situation data from the primary equipment. Figure 3 shows the government's economic situation and early warning system.

Fig. 3.

It can be seen from Figure 3 that the system has the advantages of clear function, low equipment cost, convenient carrying, accurate detection, and judgment, avoids the interference of local economy and policies, and greatly shortens the elimination time of interference factors.

3 System Software Design

In system software, the big data analysis method is mainly used to design the government economic situation prediction algorithm.

3.1 Factors Affecting Government Economic Development

Combined with the cost-effectiveness of government infrastructure, the influencing factors of government economic development are extracted. Cost-benefit mainly includes the construction cost of infrastructure and operation cost. The former includes municipal, road, real estate, communication, and security facilities, which usually account for 5% to 10% of the total investment. The latter includes the consumption of labor, energy, and other materials required for the daily operation and maintenance of municipal, road, real estate and safety systems, and the maintenance costs of infrastructure, power system, heating system, water supply system, and other costs.

When analyzing the benefits of government infrastructure, it is assumed that the service life of government infrastructure is \({A }\) years, the completion time of government infrastructure and superstructure is the initial time \({t = 0}\), and the government infrastructure operator purchases infrastructure at the initial time. In time \({A }\), assuming that the income \({C}\) and annual usage \({D}\) are constant, and the project investment cost is \({E}\), including infrastructure construction cost and facility present value; under the usage \({D}\) state, the fixed cost remains unchanged, only variable cost is considered, and all costs are calculated by opportunity cost. The conditional expression of positive net present value is obtained as follows:

\begin{equation} [F(D) - G(D)]e - \sum\limits_0^A {H{e}^{ - Ct}} \ge E. \end{equation}

(1)

Where, \({F(D) }\) is the annual social income of the project, \({G(D)}\) is the annual operation and maintenance cost based on \({D}\), and \({H }\) is the investment cost, \({e}\) represents cost error. According to formula (1), during the operation of government infrastructure construction, operation and maintenance costs, labor costs, energy consumption, and so on. will be generated.

3.2 Collect Government Economic Development Index Data

Screen the influencing factors of government economic development, build the government economic development index system, and collect the government economic index data. Combined with relevant indicators and data of economic development, 9 factors such as per capita GDP, per capita industrial production, per capita fixed asset investment, per capita fiscal income, per capita resident savings deposit balance, per capita total retail sales of social consumer goods, per capita net income of rural residents, the proportion of fiscal income in GDP and the speed of economic development are selected. This article discusses the economic development in the radiation zone of government infrastructure. The comprehensive development level of the node is characterized by the comprehensive quality index and measured by composite indicators. 14 indicators are selected to build the comprehensive development level index system from the three levels of economic development, social development, and urban construction. The maximum method is used to standardize the data, and the comprehensive development level is preliminarily measured with the help of SPSS software. The index system is shown in Table 1.

Table 1.

Target layer	Criterion layer	Index layer
Economic situation	Economic development	Regional GDP
		Per capita GDP
		Proportion of output value of secondary and tertiary industries
		Public revenue
		Investment in fixed assets
		Resident savings deposit balance
	Social development	Number of employees
		Total retail sales of social consumer goods
		Urban per capita disposable income
		Post and telecommunications business volume
		Number of beds in health institutions
	Urban construction	Per capita park green space area
		Per capita Road area
		Greening coverage rate of built-up area

Table 1. Economic Situation Index System

The basic data are collected from the government statistical yearbook, i.e., China Urban Statistical Yearbook 2020.

In addition to the statistical yearbook as a collection source, government economic index data can also be collected through web crawlers. Take the government's economic development as the keyword, search the relevant web pages of situation indicators on the Internet, join the crawler queue from the seed Uniform Resource Locator (URL), analyze and download the web pages, grab the URL and obtain a new URL. The data pages containing the government's economic situation are considered relevant, including all economic related pages. However, in order to simplify the calculation, only the top 10 pages of the Internet search are collected, and the same economic situation data are not collected. In the real network traffic data, count the usage heat of various indicators, read the web page on the front page of the web page, find other link addresses in the web page, find the next web page, and set the access layers of different web pages until all web pages of the web site are captured [12]. Preprocess the captured pages, take the government's economic development as the theme content, filter out the pages with inconsistent theme, use the table tag to repair and sort out the wrong or irregular tags, store the repaired complete pages in the HTML document, select the HTML file as the root node, construct the tag tree, and use the visual information of the web page to process the web page in blocks. In line with the forecast demand of the government's economic development, remove the redundant information of the web page, link the useful information together, find the text file related to the subject content, mark the hypertext, and integrate the web page [13]. Finally, through the HTTP protocol, assist the browser to download the web page, capture the effective information in the web page, including sound, text, image and other documents, obtain the government economic index data in the field of government economic development, add the government economic development related content, collect the video, audio, database, picture, text data and other types of data in the web page, eliminate new URLs, add a new crawling queue and cycle the above operations. So far, the collection of government economic development index data is completed.

3.3 Preprocessing Government Economic Development Index Data

The massive government economic development index data collected are preprocessed to enable the government economic index data to accurately express the government's economic situation.

3.3.1 Cluster Processing of Government Economic Index Data.

The distributed k-means algorithm is used to cluster the relevant government economic development index data. First, clean up the original data of economic situation indicators, and deal with wrong data, data noise, and invalid data. Then, the time series of government economic development data in historical data are counted, and the time series are classified and processed in quarterly order to maintain the continuity of government economic development index data, and then the missing data are filled in. Through attribute mapping, the character data of the original dataset is converted into digital standardized data. The mapping formula is as follows:

\begin{equation} m{\rm{ = }}\frac{{({m}_{\max } - {m}_{\min })(n - {n}_{\min })}}{{{n}_{\max } - {n}_{\min }}}{\rm{ + }}{m}_{\min }. \end{equation}

(2)

Where, \({m }\) is the processed standardized government economic index data, \({{m}_{\max}}\) and \({{m}_{\min}}\) are the maximum and minimum values of the processed government economic index data, respectively, \({n}\) is the original historical data of the government economic development index, and \({n_{\max}}\) and \({n_{\min}}\) are the maximum and minimum values of the original data, respectively. Randomly select \({k }\) data objects in the dataset as the initial clustering center of government economic development index data, and compare the initial clustering center with the remaining data objects by using Mahalanobis distance. Mahalanobis distance is the covariance distance of data, which is an effective method to calculate the similarity of two unknown sample sets. Unlike Euclidean distance, it takes into account the relationship between various characteristics, so as to improve the effectiveness of calculation. The calculation formula of Mahalanobis distance is

\begin{equation} G = j \times (j - {\rm{1}}) \times (j - {\rm{2}}). \end{equation}

(3)

Suppose \({{H}_{ij}}\) is the Mahalanobis distance between the government economic indicator data \({i}\) and the government economic indicator data \({j}\). If the Mahalanobis distance \({H_{ij}}\) is closer to 1 or −1, the higher the correlation degree, the closer the distance between the two governments economic index data. If \({H_{ij} }\) is closer to 0, the lower the correlation degree, the farther the distance. The remaining government economic index data objects are classified into the nearest initial cluster center, and then the cluster center is re-selected. It is iterated for many times until the criterion function converges, while the \({k}\) cluster centers remain unchanged. The definition formula of criterion function J is as follows:

\begin{equation} J = {H}_{ij}\int\limits_{{r = 1}}^{k}{{{Z}_r - {E}_r}}dr. \end{equation}

(4)

Where, \({Z_{r} }\) is the central point value of the cluster center of class \({r}\), and \({E_r}\) is the average value of \({Z_r}\). Clean the clustered government economic indicator data, and delete records irrelevant to government economic development, including picture content requests, file requests and crawler requests. When HTTP requests are initiated, separate illogical sessions and record a large amount of government economic development information through HTTP headers. Based on the government economic index data after piecewise clustering, the collected data are finely classified to obtain different local data tuples. The refined data items after segmentation are shown in Table 2.

Table 2.

Field code	Field type	Meaning
MCI	VarChar2	Regional GDP
MC	VarChar	Per capita GDP
SJY	VarChar	Proportion of output value of secondary and tertiary industries
SJYLX	VarChar	Public revenue
SJFL	VarChar	Investment in fixed assets
TXTZ	VarChar2	Resident savings deposit balance
ZXZBX	Date	Number of employees
ZXZBY	Date	Total retail sales of social consumer goods
YSZBX	VarChar2	Urban per capita disposable income
YSZBY	Number	Post and telecommunications business volume
BLC	Number	Number of beds in health institutions
BB	Number	Per capita Park green space area
SYQX	VarChar2	Per capita Road area
SJYSM	VarChar2	Greening coverage rate of built-up area

Table 2. Data Items of Economic Situation Indicators

Build a distributed SQL database to represent the attribute structure of data items, and provide data support for government economic situation prediction through various refined datasets. So far, the clustering processing of government economic index data is completed.

3.3.2 Ranking the Primary and Secondary Relationship of Government Economic Index Data.

The SPRINT classification algorithm is used to sort the primary and secondary relationship of government economic development index data. SPRINT classification algorithm is easy to understand and has a low degree of time complexity, which is the main advantage. It can be used for processing of small datasets, and the missing value of the algorithm is insensitive, which can effectively extract the characteristics of related data. The maximum minimum normalization formula is used to discretize the continuous numerical attribute of government economic index data, and the government economic index data is linearly transformed. The calculation formula is

\begin{equation} V{\rm{ = }}\frac{{\beta (L + M)}}{{\left| {M - N} \right|}}. \end{equation}

(5)

Where, \({L}\) is the data value of government economic indicators, \({M}\) and \({N}\) are the maximum and minimum values of government economic indicator data with the same attribute, respectively; \({\beta}\) is the mapping interval, and \({V}\) is the mapped value of government economic indicator data. The neural network center is used to replace the continuous value of government economic index data, convert the data attributes into discrete values, display regular rules on the basis of ensuring the relative attributes, and reduce the number of values of the same attribute data [14]. The sprint classification algorithm is adopted to sort the primary and secondary relationship of government economic development index data, classify the economic development level of governments in surrounding areas, divide the governments in surrounding areas into multiple sub groups, regard the governments in surrounding areas with different development levels as different categories, and distinguish the economic development level of governments in surrounding areas. It is worth noting that the economic development level of the same subdivision government is close to each other. The classification of government economic indicator data is realized through the decision tree. The attribute with the highest priority is selected as the root in the government economic indicator data to provide the preprocessed attribute set. Search for commonness from the government economic indicator data, make a series of sorting decisions, split the decision tree nodes, and then split the government economic indicator data attributes, so that the attributes are accurately associated with the child nodes, and the attribute value segmentation dataset can be obtained [15]. If the number of dataset categories is \({c }\) and the number of dataset categories is equal to the number of leaf node categories, the calculation formula of splitting parameter \({F }\) is

\begin{equation} F = V\sum\limits_I^c {1 - p_I^2}. \end{equation}

(6)

Where, \({p_{I} }\) is the relative frequency of dataset category \({I }\). Select a data node in the dataset, take the logical judgment of the economic development level of the surrounding government as the internal node of the decision tree, take the branch result of the logical judgment as the edge of the decision tree, and associate the data attributes to the root node of the decision tree, so as to construct a multi tree decision tree. When all the government economic indicator data belong to the same category, the class label is used to define the leaf node. When the government economic indicator data do not belong to the same category, the data attribute is measured according to the information entropy, and the data in the original attribute set is deleted. When the candidate set is empty, the leaf node is returned and marked as a common category. For different types of government economic index data, the calculation formula of information entropy \({W }\) is

\begin{equation} W = \lg \frac{{{C}_I}}{{\left| \xi \right|}}\sum\limits_I^c {\xi {C}_I}. \end{equation}

(7)

Where, \({\xi}\) is the dataset given by the decision tree, and \({{C}_I}\) is the set of datasets belonging to class \({I }\) objects. Classify the dataset \({\xi}\) according to the attribute characteristics to obtain multiple different objects. The weighted sum of the information entropy \({W}\) is obtained through partition entropy, based on which the information gain attribute of government economic index data can be calculated according to the formula as follows:

\begin{equation} K = \sum\limits_{I = 1}^\eta {\frac{{W{C}_I}}{{\left| \xi \right|}}}. \end{equation}

(8)

Where, \({K }\) is the information gain of government economic index data, and \({\eta}\) is the number of attribute characteristics of the dataset. In the attribute set, select the attribute with the highest information gain \({K }\), mark the leaf node, get the score of the attribute with the highest information gain, and make the subset elements of the dataset meet the score. When the categories at the nodes are the same, and the remaining attributes cannot be subdivided, or the given score has no data, create a class label, terminate the division of the decision tree, and complete the classification of the economic development level of the governments in the surrounding areas. So far, the sorting of the primary and secondary relationships of government economic index data is completed, and the preprocessing of government economic development index data is completed.

3.4 The Characteristics of Government Economic Development

Based on the semantic attention of government economic indicator data, this study highlights the semantically similar government economic development indicator data, and uses it to predict the government economic situation. According to the above basis, analyze the factors that affect the semantic distance between data items including economic development, social development, and urban construction. According to the analysis results of the above influencing factors, the influencing factors are taken as the dynamic characteristics of the government economic indicator data, so that the data of the government economic indicators change over time, showing different characteristics of government economic development. Obtain multidimensional data information according to dynamic features, conduct data exploration to reduce the dimension of government economic indicator data, and convert multidimensional dynamic features into two-dimensional dynamic features according to the dimension combination obtained by data exploration [16]. Assuming that the characteristic dimension of government economic indicator data is \({z }\), the calculation formula for the combination exploration condition \({R }\) of government economic indicator data is

\begin{equation} R = \sum\limits_{z > 0}^z {z \times (z - {\rm{1}}) \times (z - 2)}. \end{equation}

(9)

The abstract features of government economic index data in information space are extracted, and the abstract feature types are divided into three categories: time series, network, and level. The calculation parameters of semantic distance are determined according to the structural relationship among the three types of government economic indicator data. Suppose that the dynamic characteristic object of the government economic development index is \({s}\) and the data object of any government economic index in information space is \({x }\), then the semantic distance \({d( {s,x} ) }\) between \({s }\) and \({x }\) is

\begin{equation} d(s,x) = w\left[ {f(s,x) + g(s,x) + l(s,x)} \right]. \end{equation}

(10)

Where, \({ f( {s,x} ) }\) is the two-dimensional display of \({x }\) on the combination of \({s }\) dynamic feature dimensions. The implicit intention is used to determine the impact of government economic index data on the prediction of government economic situation, the explicit intention is used to clarify the prediction intention of government economic situation.

\({g(s,x) }\) represents the association relationship between \({x }\) and \({s }\).

\({l(s,x) }\) is the center distance after semantic representation of \({x }\) and \({s }\), and \({w }\) represents the weight.

The semantic distance is taken as an important parameter of the semantic attention of government economic indicator data. Through \({d(s,x) }\), the distance between \({x }\) and \({s }\) at the semantic level is adopted, based on which it is possible to determine the a priori importance of different data items of government economic indicator data in the prediction of government economic situation, set the semantic attention threshold, and limit the collection of government economic indicator data items. The semantic attention \({P }\) of government economic indicator data item \({x }\) with respect to \({s }\) is

\begin{equation} P = \frac{{k(s, x)C}}{{d(s, x)}}. \end{equation}

(11)

Where, \({k( {s,x} ) }\) is the a priori importance of \({x }\) with respect to \({s }\), and \({C }\) is the semantic attention threshold. The greater the semantic attention, the closer the semantics of \({x }\) and \({s }\). Aggregate the data items of government economic indicators with similar semantics to assist in the prediction of the government economic situation.

For the government economic development index data with similar semantics, the association rules of the government economic situation in the surrounding areas are mined, and the economic development model of each government is determined according to the relationship between different attributes and characteristics of the government economic index data. In the dataset, the attribute information of government economic development index data is extracted and divided into three sets: continuous attribute set, original invariant attribute set and nominal attribute set. This study uses the knowledge base of HowNet, defines the words existing in the semantic dictionary, and takes the def item in HowNet as the concept of words. According to the above concept of words, words with similar meanings need to be replaced, so that words have semantic relevance. HowNet (English name is HowNet) is a common-sense knowledge base that takes the concepts represented by Chinese and English words as the description object and regards reminding the relationship between concepts and the attributes of concepts as the basic content. At the same time, the semantic similarity interval between words is considered in this study, and the minimum semantic similarity and the maximum semantic similarity are adopted. Calculate the semantic similarity of different government economic indicator data according to the minimum and maximum semantics. The specific calculation formula is

(12)

Where, \({{K}_a,{K}_b }\) is the concept of semantic word \({a,b }\) of government economic indicator data, \({{K}_a \cap {K}_b}\) is the number of words with the same definition of the two concepts. The value of concept similarity is within [0, 1]. The smaller the similarity is, the lower the possibility of concept semantic correlation between the mined feature attributes and the prediction of government economic situation is, and the greater the similarity is, the closer the concept semantics is. Set the semantic similarity threshold, select the feature attribute with \({M }\) greater than the threshold, extract the government economic index data and determine the similarity between data. The semantic similarity matrix is used to represent the semantic similarity of all government economic index data. Combined with the semantic elements of government economic situation prediction, we mine the index features of deep semantic connection, analyze the common parts of the semantic elements of index feature attributes, and obtain the semantic connection key points and the semantic information describing the characteristics of government economic development. According to the semantic bias of economic development characteristics to economic situation prediction, semantically process the data mining results of government economic indicators, and define the characteristics of government economic development. So far, the excavation of the characteristics of government economic development is completed.

3.5 Training of Government Prediction Model

The experimental data were collected from the National Bureau of statistics. The types of data collected include residents’ income, production level, socio-economic level, and so on. Input the government economic development data into BP neural network to predict the government economic situation. The BP neural network applies the radial basis function of Multivariable Interpolation, selects the three-layer forward network as the typical structure of the neural network, and transforms the characteristic attributes of government economic development in the surrounding areas extracted from the input layer in the middle layer, so as to make the category of the characteristic attributes of government economic development in the surrounding areas closer to the center of the network. If the output value of the \({i }\)th neuron is \({{x}_i }\) and the sample point of the \({j }\)th network center is \({{G}_j }\), the corrected new network center \({B }\) is

\begin{equation} B = M\sum\limits_{i = 1}^n {{x}_i - } {G}_j. \end{equation}

(13)

The characteristic attributes of government economic development in surrounding areas are divided into new network centers, and the collection of network centers is used as the value domain to replace the characteristic values of government economic development in surrounding areas, so as to eliminate the impact of different dimensions of data on the prediction of government economic situation and find out the change law of government economic situation. In the prediction of the government's economic situation, with the increase of the prediction length, the error of the prediction value will become larger and larger. Therefore, the BP neural network adopts the learning training of the fitting error difference to ensure the prediction accuracy of nonlinear factors. The learning algorithm of the BP neural network is composed of four processes on the premise of the error back propagation algorithm in the neural network. The input mode in the first stage is the forward propagation of the input layer to the output layer through the middle layer, the expected output of the network and the actual output of the network in the second stage is the error inverse propagation of the error signal to the input layer through the middle layer, and the connection weight of the neural network is corrected layer by layer; the third stage is the repeated alternation of the error inverse propagation and the mode forward propagation; and the fourth stage is the convergence of the neural network, and the learning convergence process of network global error tending to a minimum [17, 18]. The overall process of predicting the government's economic situation by BP neural network is shown in Figure 4.

Fig. 4.

As shown in the three-layer BP network structure in Figure 4, the number of nodes in the input layer is set to 2, the number of nodes in the hidden layer is set to 6, the number of nodes in the output layer is set as the number of output vectors, and the number of output vectors of the target value of the neural network is set to 1, that is, the prediction result of the government's economic situation. Taking the characteristic attribute of the government's economic development as the training data and test data, BP neural network training is carried out based on training data, and the predicted value of the government's economic situation is output. The predicted value of economic situation is divided into 1 ∼ 5 levels, as shown in Table 3.

Table 3.

Estimate	Level	Meaning
0∼20	1	The government's economic situation is preferable
20∼40	2	The government's economic situation is good
40∼60	3	The government's economic situation is general
60∼80	4	The government's economic situation is slightly poor
80∼100	5	The government's economic situation is poor

Table 3. Level of Government Economic Situation

According to Table 3, the level of government economic situation is determined. So far, the prediction of the government economic situation has been completed, and the design of the government economic situation prediction method based on big data analysis has been realized.

4 Experimental Analysis

The proposed government economic situation prediction method based on big data, the structural vector autoregressive model in Reference [3], and the autoregressive moving average model in Reference [4] are comparatively analyzed.

4.1 Experimental Data

Taking the regional economic belt around regional government infrastructure as the experimental object, the absolute economic difference is expanding year by year from 2010 to 2020, and the relative economic difference is expanding in fluctuation. It can be roughly divided into two stages, including the stage of narrowing the economic gap from 2010 to 2013 and the stage of continuous expansion of economic differences from 2013 to 2019. In the first stage, the coefficient of variation and Searle index are lower than the average level. The Searle index reached the lowest value of 0.3065 in 2012 and the coefficient of variation reached the lowest value of 0.85352 in 2013. In the second stage, the coefficient of variation and sear index in 2019 are 1.18 and 1.32 times higher than those in 2000, respectively. The experimental data comes from the economic data published by the National Bureau of statistics, and there are a total of 3,000 sets of energy consumption data, including 2,000 groups of data are randomly selected as training data and 1,000 groups of validation data. This method is used to validate the model. The number of particle swarm optimization is set to 36 and the learning factor is 2.05. 100 independent simulation experiments were carried out for each method, and 300 iterative calculations were carried out for each simulation.

The average value of the relative error between the prediction result and the actual value is regarded as the evaluation index of the model accuracy. The model simulation verification results are shown in Table 4.

Table 4.

Training times	Proposed method		Reference [3] method		Reference [4] method
Training times	RMSE	MAE	RMSE	MAE	RMSE	MAE
1	0.46	0.16	2.56	1.98	2.99	1.32
2	0.37	0.35	2.32	1.33	2.36	1.02
3	0.12	0.22	2.68	1.52	2.32	1.66
Average	0.32	0.24	2.52	1.61	2.56	1.33

Table 4. Comparison of RMSE and MAE

4.2 Analysis of Prediction Trends

Random factors may interfere with the output results in the process of data training. While calculating the training error value of each prediction model, add the average value of calculation error to avoid affecting the evaluation of the prediction model and improve the stability of output results. The comparison results of RMSE, MAE, and average error of the output results of each model are shown in Table 4.

As can be seen from Table 4, the RMSE, MAE and average error of the proposed prediction model are all less than 0.5, which has obvious advantages among the three methods. The average value of RMSE, MAE, and error of the prediction model in Reference [3] is not less than 1.61, and the average value of RMSE, MAE, and error of the prediction model in Reference [4] is not less than 1.33. It can be seen from the above comparison that the prediction model of the proposed method has a low error rate and strong prediction performance.

The proposed method, the Reference [3] method, and the Reference [4] method were compared in terms of prediction accuracy. The comparison results of prediction accuracy are shown in Figure 5.

Fig. 5.

As seen from Figure 5, the economic situation prediction accuracy of the proposed model in this article is high, which remains above 94%. The economic prediction accuracy of the Reference [3] method and Reference [4] method shows an overall deviation trend. From the above comparison, we can see that the proposed model can accurately predict the economic situation without significant deviation, and the prediction accuracy is high at any time node. It is further verified that the proposed method has high prediction accuracy and a low error rate.

Through in-depth analysis of the above prediction results, the percentage distribution of the prediction results of the proposed method, Reference [3] method, and Reference [4] method within different relative error ranges is shown in Figure 6.

Fig. 6.

It can be seen from Figure 6 that when the proposed method is used to predict the relative error, the proportion fluctuation of the error in different ranges is small, and the peak value of the overall error rate is no more than 15%, which shows that the changing trend of power supply energy efficiency predicted by the proposed method is not disturbed by external factors, and the output result is stable. The error rate of the methods in Reference [3] and Reference [4] fluctuates obviously in different ranges, and the peak error rate of the method in Reference [3] is not less than 25%. The peak error rate of the method in Reference [4] is not less than 30%. Through the above comparison, it can be seen that there is a large gap between the methods in Reference [3] and Reference [4] and the proposed methods, indicating the stability and practicability of the proposed methods.

5 Conclusion

With the development of macroeconomic theory, macroeconomic forecasting has become another important aspect of empirical analysis and economic model analysis. The core idea of traditional macroeconomic forecasting methods is to find the internal law of statistical data through specific models and methods, so as to predict the future. Therefore, this article proposes a government economic situation prediction method based on big data analysis to divide the level of economic development. Experiments show that the proposed method can accurately predict the government economic situation, and the peak value of the overall error rate is no more than 15%. In the future research work, a new algorithm should be used to optimize the calculation process in order to reduce the error rate.

References

[1]

J. Lu and A. Ms. 2019. The impact of innovation on IPO short-term performance – Evidence from the Chinese markets. Pacific-Basin Finance Journal 53, 2 (2019), 208–235.

Abstract

1 Introduction

2 System Hardware Structure Design

2.1 Data Acquisition Terminal

2.2 Design of Government Economic Situation Early Warning System

3 System Software Design

3.1 Factors Affecting Government Economic Development

3.2 Collect Government Economic Development Index Data

3.3 Preprocessing Government Economic Development Index Data

3.3.1 Cluster Processing of Government Economic Index Data.

3.3.2 Ranking the Primary and Secondary Relationship of Government Economic Index Data.

3.4 The Characteristics of Government Economic Development

3.5 Training of Government Prediction Model

4 Experimental Analysis

4.1 Experimental Data

4.2 Analysis of Prediction Trends

5 Conclusion

References

Cited By

Index Terms

Recommendations

Research on the Influence of East Asia Free Trade Area on China's Economic Effect Based on Big Data Analysis

Contrasting strategies are pursued by big three economic powerhouses

Analysis of regional economic development differences of China: A comparison between Zhejiang and Shaanxi based on variation coefficient

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

HTML Format

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations