Keywords

1 Introduction

Ever-increasing mobile-traffic demand requires operators to deploy more base stations and to keep updating their radio access network (RAN). In the future, 5G RAN is set to provide an even larger variety of services with widely varying requirements on bandwidth and latency. Therefore, future RAN design is expected to satisfy the very stringent constraints in terms of tolerable latency and required data rate [1], e.g., end-to-end latency requirements below 1 ms, more than 250 Gb/s/km\(^2\) in dense-urban areas and device density in the order of thousands per km\(^2\) [2]. Overall, a very large amount of baseband processing resources and network resources will be required to be deployed in the RAN.

Fig. 1.
figure 1

Illustration of 2-layer and 3-layer C-RAN architecture

In 4-th generation LTE networks, Distributed Radio Access Network (D-RAN) is the dominant RAN paradigm, where the Base Station (BS) comprises two modules: Remote Radio Head (RRH) for transmission and reception of radio signals, frequency up/down conversion and power amplification, and Baseband Unit (BBU) to perform digital processing of baseband signal. The BBU and RRH were traditionally co-located in the same housing facility, so the maintenance and investment costs of networks increase linearly with the number of BSs, which leads to a not scalable solution. An alternative RAN architecture, called Centralized Radio Access Network (C-RAN), has been proposed as a scalable solution in terms of both power and cost efficiency. In C-RAN, the BBUs are centralized into larger housing facilities, called BBU pools, which are connected with the RRHs through a high-capacity and low-latency “fronthaul” network [3]. Optical metro/aggregation networks based on wavelength division multiplexing (WDM) are considered as a promising candidate for transport network of RAN. Besides performing traffic aggregation and switching (SW), the central offices (COs) at different hierarchical stages of the metro network can be also equipped with processing resources to process baseband signal. The procedure of moving BBUs from cell sites to COs is called BBU hotelling, as shown in Fig. 1(left). From cell site to access/main CO, to core CO, the fewer COs need to be used as BBU hotels, and the more “consolidation” is achieved through BBU hotelling. As the motivations of BBU hotelling are to allow operators to save costly installation and maintenance of processing facilities inside COs, centralizing more BBU into COs in higher stages of networks can increase these benefits.

However, due to the high fronthaul cost of C-RAN [4], RAN architecture is further evolving towards decentralizing part of baseband processing functions at the network edge, in order to decrease the fronthaul bandwidth. BBU is now divided into two parts: Distributed Unit (DU) and Central Unit (CU). The DU contains some real-time baseband processing functions, i.e., Hybrid Automatic Repeat Request (HARQ) processing, radio coordination, while CU contains some non real-time baseband processing functions. Original RRH with part of PHY-layer functions comprises Remote Unit (RU). It follows that the RAN architecture evolves from a 2-layer (BBU-RRH) to 3-layer (CU-DU-RU) architecture. Through flexible distribution of CUs and DUs in the metro transport network, service requirements can be satisfied according to different DU/CU hotelling schemes. As shown in the right side of Fig. 1, the RU replaces the RRH at the cell sites, and DUs/CUs can be placed at any locations starting from the cell sites up to the COs at the different stages of the metro network. Compared with 2-layer RAN architecture, a new network segment is introduced, called midhaul, which connects DU and CU. In this paper, we re-consider the DU/CU placement problem in the 3-layer RAN architecture by minimizing the number of active COs for CU/DU hotelling. We also provide a mathematical model for this placement problem, and we apply this model over a limited, yet realistic network scenario. This model allows us to show the interplay between DU/CU placement and front/mid/back-haul, and to investigate the relation between the consolidation of baseband processing functions and the processing/bandwidth capacity constraints, as well as service latency. For the metro transport solutions, we consider OTN (more specifically, we assume a version of OTN optimized for mobile transport, called M3C-OTN, aiming at Mobile optimized, Multi-service and Metro Cloud OTN solution for the 5G scenario) and overlay, in line with the assumptions used in [8].

The remainder of this paper is organized as follows. Section 2 introduces the related works. Section 3 introduces functional split based 3-layer RAN architecture. Section 4 illustrates joint DU/CU placement problem in the metro networks. Section 5 shows the illustrative numerical results. Section 6 concludes the work.

2 Related Works

The problem of baseband functions placement and traffic routing has been investigated in recent years. In traditional C-RAN architecture, authors in [5] propose the energy-efficient virtual base station (vBS) formation problem, where they combine the virtual passive optical network (vPON) with vBS to form a virtual RAN (vRAN) and minimize the number of active digital units (DUs) in the DU pool and the number of wavelengths in the PON at the same time. In [6], a “BBU aggregation” problem is proposed to minimize the number of the active BBUs for energy saving. BBU aggregation is modeled as an evolved 2D bin-packing problem and two heuristics to solve this problem are proposed. Authors in [7] motivate energy-efficient BBU aggregation problem in an AWGR-based passive WDM network. They introduce the AWGR decomposition into the BBU aggregation to help reduce the cost of the tunable transceivers and BBUs. The problem of BBU placement over a metro/aggregation network was first proposed in [8], where the relation between latency and the consolidation of BBUs is discussed under the OTN and overlay cases separately. In addition to BBU placement problem, BBU pool allocation and selection problem for C-RAN is also proposed in [9], the authors investigated how to deploy BBU pool among the optical network nodes and how to choose BBU pool to host the BBU of each traffic request with the objective of maximizing traffic acceptance ratio and minimizing network resource usage.

For baseband function placement in a functional spilt RAN architecture, Ref. [10] proposes a graph-based framework for flexible baseband function splitting and placement problem, and determines how to split the baseband function chain for each cell to maximize the utilization of processing resources under the constraint of latency and processing capacity using genetic algorithm. In [11], the authors propose a fully flexible functional split RAN architecture and define a new baseband entity, called flexible unit (FU). Based on this, they motivate a minimal number of active central office problem through the energy-efficient placement of FUs.

In summary, most existing works consider baseband function placement in the 2-layer RAN, while there are no works providing the analysis of the CU and DU placement problem in 3-layer C-RAN based on the 3GPP standard functional split options and the relation between the consolidation of CU/DU and constraints of latency, network capacity, as the one provided in this paper.

3 Functional Split Based 3-Layer C-RAN Architecture

According to 3GPP [12] and other standardization bodies, the 5G RAN architecture has been defined as a 3-layer architecture consisting of a CU, a DU, and a RU. Accordingly, the network between cell sites and mobile core is also divided into three segments: fronthaul, midhaul and backhaul. To address the strict bandwidth and latency requirements in 5G-RAN, 3GPP has proposed multiple functional spilt options, typically listed from option 1 to option 8, as shown in Fig. 2. In this paper we consider that the baseband processing entities (CU, DU and RU) are connected via two interfaces, F1 and Fx, as the split at Option 2 (interface F1) and Option 7 (interface Fx) have been selected by ITU [14] as the standard split options. According to [13], different baseband functions have various characteristics in terms of bandwidth requirement and processing complexity. The bandwidth, processing and latency requirement of CU, DU and RU can be summarized as follows.

Fig. 2.
figure 2

Example of function split options

Transport Bandwidth: In the cell level, the F1 interface scales and dynamically varies with the air interface traffic load, so midhaul bandwidth scales with traffic load as backhaul and requires only slightly more bandwidth than backhaul. The fronthaul bandwidth at Fx also varies with the air interface traffic load but requires higher rates (typically by up to an order of magnitude compared to backhaul). In the user level, the bandwidth requirement of a user is related to the resource blocks (RB) occupied by this user in the carrier spectrum.

Processing Complexity: The processing complexity of baseband functions can be measured in Giga Operation Per Second (GOPS) [19]. The processing complexity of channel estimation, resource (de)mapping, FFT/IFFT is related to carrier bandwidth and number of antennas (called cell-processing functions), whereas the processing complexity of MAC/RLC/PDCP layer and the other functions in the PHY layer are related also to the traffic load besides carrier bandwidth and number of antennas (called user-processing functions). According to 3GPP [12], for option 2, RRC and PDCP are in the CU, while RLC, MAC, part of physical layer are in the DU. For the option 7 OFDM and MIMO precoding reside in the DU, FFT, resource mapping and RF resides in RU. So the processing complexities of DU and CU are all user load dependent. The details for the calculation of bandwidth requirement between CU, DU, RU and processing complexity of each entity used in this paper can be found in [13].

Transport Latency: Transport latency requirements for backhaul and midhaul links are determined by service latency requirements, i.e. around 10 ms for eMBB, about 1 ms for URLLC and ranging from 1 ms to several 10 ms for mMTC [16]. For the fronthaul links at Option 7, the latencies are determined by the requirements of the RAN technology. To satisfy the HARQ processing latency requirement, a total round-trip latency budget of RTT\(_{BBU-RRH}\) = 3 ms is available between a BBU and its corresponding RRH. It means that the NACK/ACK should be transported on the fronthaul link within hundreds of microseconds [15]. As we know the HARQ processing function is located in the low-MAC layer and MAC layer is in the DU, so latency requirement between DU and RU should be the same as HARQ processing latency requirement. According to [17], the reference values of the latency in different parts of networks are given as follows:

  • Signal processing latency: The BBU completes the processing and send ACK/NACK usually within 2.75 ms. In the 3-layer architecture, no matter where the DU and CU are placed, the total processing can be seen as a fixed value.

  • Signal propagation latency: For HARQ processing, propagation latency is related to the distance between RU and DU, and for the whole service, the propagation latency is related to the distance between the server and the RU (we ignore wireless transmission latency).

  • Switching latency: We assume an active solution for the fronthaul, for OTN encapsulation, the latency is about 40 \(\upmu \)s, for the non-OTN encapsulation, the latency is about few \(\mu \)sec.

  • Encapsulation (like CPRI) processing latency: Before the CU or DU transmit or receive the data, the data must be encapsulated or de-encapsulated with CPRI protocol. This procedure will cost around 10 \(\upmu \)s.

4 Joint DU/CU Placement in Metro Networks

4.1 Problem Statement

The DU/CU placement problem over metro/aggregation networks can be stated as follows. Given network topology, number of wavelengths per link and their line-rate capacity, set of traffic requests, maximum fronthaul latency (HARQ latency) and service latency, decide the placement of DUs and CUs in different COs that maximizes the consolidation of the baseband processing functions (i.e., minimizes the number of CO housing processing) under latency and capacity constraints.

The placement of CU/DU is not only restricted by the processing capacity and network capacity, but also subjected to the RAN latency and service latency constraints. Figure 3 shows an example of CU/DU placement with two service requests from different cell sites. For service 1, the DU and CU are placed in main COs for a higher consolidation, whereas for service 2, restricted by the bandwidth capacity, the DU of service 2 must be placed in the access CO. Next, we will show the mathematic model of the DU/CU placement problem.

Fig. 3.
figure 3

Illustration of possible baseband function placement

4.2 Model

  1. (1)

    Given sets and parameters

    • \(V \): set of COs

    • \(S \): set of service requests

    • \(N \): set of processing elements: RU, DU, CU, server

    • \(C ^i_p\): processing capacity of COs \(i \)(GOPS)

    • \(C _w\): bandwidth of wavelength (Gbps)

    • \(G _n\): computing resource requirement of baseband function \(n \)

    • \(B _n\): bandwidth requirement between baseband function \(n \) and \(n+1 \)

    • \(T ^{i,j}\): transport latency between CO \(i \) and \(j \)

    • \(T _{oe}\): latency for switching and encapsulation

    • \(T _{HARQ}\): latency requirement for HARQ processing

    • \(T _s\): latency requirement for service request

    • \(K _{s,i}\): service \(s \) accessed into CO \(i \)

    • \(Max \): a large positive value

  2. (2)

    Decision Variables

    • \(Y ^{s,n}_i\): 1, if the function \(n \) of service \(s \) is located in CO \(i \), otherwise 0

    • \(X ^{i,j,w}_{s,n}\): 1, if the function \(n \) and \(n+1 \) of service \(s \) is located in the CO \(i,j \) separately using wavelength \(w \)

    • \(D _i\): 1, if the baseband processing functions (either CU or DU) are placed in CO \(i \), otherwise 0

    • \(H _{s,r}\): 1, if the service \(s \) is processed in the CO \(r \)

  3. (3)

    Objective

    Minimize the number of active COs:

    $$\begin{aligned} \begin{aligned} \min \{\sum \nolimits _{i \in V} {{D_i}} \} \end{aligned} \end{aligned}$$
    (1)
  4. (4)

    Constraints

    Routing:

    $$\begin{aligned} Y_{i}^{s,n}=\left\{ \begin{matrix} K_{s,i},&{}if \ n=1 \\ 1,&{}if \ i=dest,n=N \end{matrix}\right. ,\forall s\in S,i\in V,N \in N \end{aligned}$$
    (2)
    $$\begin{aligned} \begin{aligned} \sum \nolimits _{n \in N,i \in V,w \in W} {X_{s,n}^{i,k,w}} - \sum \nolimits _{n \in N,j \in V,w \in W} {X_{s,n}^{k,j,w}} \\ = \left\{ \begin{matrix} -1,if \ K_{s,k}=1 \\ 1,if \ k = 1 \\ 0,\ otherwise \end{matrix}\right. \forall s \in S,k \in V, l \in L \end{aligned} \end{aligned}$$
    (3)

Baseband function placement:

$$\begin{aligned} \begin{aligned} Y_r^{s,k} \leqslant \sum \nolimits _{j \in V,w \in W,k \leqslant l \leqslant N - 1} {X_{s,n}^{i,j,w}\cdot {M^{r,j}}} \leqslant Y_i^{s,k}, \\ \forall s \in S,k \in N,r \in V,{K_{s,r}} = 1\ \end{aligned} \end{aligned}$$
(4)
$$\begin{aligned} \begin{aligned} 2Y_r^{s,k} \leqslant \sum \nolimits _{i \in V,w \in W,0 \leqslant l \leqslant k - 1} {X_{s,l}^{i,r,w} \cdot {M^{i,r}}} \\ + \sum \nolimits _{j \in V,w \in W,k \leqslant h \leqslant N - 1} {X_{s,h}^{r,j,w} \cdot {M^{r,j}}} \leqslant Y_r^{s,k} + 1, \\ \forall s \in S,k \in N,r \in V,{K_{s,r}} \ne 1 \end{aligned} \end{aligned}$$
(5)
$$\begin{aligned} {D_i} \leqslant \sum \nolimits _{s \in S,n \in N} {Y_i^{s,n}} \leqslant Max \cdot {D_i},{} {} {} \forall i \in V\ \end{aligned}$$
(6)
$$\begin{aligned} \sum \nolimits _{i \in V} {Y_i^{s,n}} = 1,\forall s \in S,n \in N\ \end{aligned}$$
(7)
$$\begin{aligned} {H_{s,r}} \leqslant \sum \nolimits _{n \in N} {Y_r^{s,n}} \leqslant Max \cdot {H_{s,r}},\forall s \in S,r \in V \end{aligned}$$
(8)

Capacity:

$$\begin{aligned} \sum \nolimits _{s \in S,n \in N} {{G_n} \cdot Y_i^{s,n}} \leqslant C_{p}^i,\forall i \in V\ \end{aligned}$$
(9)
$$\begin{aligned} \sum \nolimits _{s \in S,n \in N} {{B_n} \cdot X_{s,n}^{i,j,w}} \leqslant {C_w},\forall i,j \in V,i \ne j,w \in W\ \end{aligned}$$
(10)

Latency:

$$\begin{aligned} \sum \nolimits _{i,j \in V,w \in W} {{T^{i,j}} \cdot X_{s,0}^{i,j,w}} \leqslant {T_{HARQ}},\forall s \in S \end{aligned}$$
(11)
$$\begin{aligned} \begin{aligned} \sum \nolimits _{i,j \in V,n \in N,w \in W} {X_{s,n}^{i,j,w} \cdot {T^{i,j}}} + \sum \nolimits _{r \in V} {{H_{s,r}} \cdot {T_{oe}}} \leqslant {T_s}, \\ \forall s \in S \end{aligned} \end{aligned}$$
(12)

Additional bandwidth capacity constraint for overlay network:

$$\begin{aligned} \sum \nolimits _{s \in S,n \in N} {X_{s,n}^{i,j,w}} \leqslant 1,\forall i,j \in V,w \in W\ \end{aligned}$$
(13)

Equations (2), (3) enable the routing of requests over the lightpaths, the source/ destination of a request is given and the baseband functions are flexibly placed in the intermediate nodes. Equations (4) and (5) restricts the services starting from the source node of a request and the links between the intermediate nodes. Equation (6) indicate that if the CO i is active. Equation (7) ensures that one baseband processing entity must be located in only one CO. Equation (8) indicates that if the service is processed in the CO r. Equations (9 and 10) is the capacity constraints of processing and bandwidth. Equation (11) ensures that the placement of DU is restricted by the HARQ processing latency. Equation (12) is for the service latency. Equation (13) is the additional bandwidth constraint for the overlay solution.

Table 1. No. users of the considered geotypes

5 Illustrative Numerical Results

5.1 Evaluation Settings

We consider a 15-node WDM metro/aggregation network topology as shown in Fig. 4, under three different geographical type areas (geotypes) - Dense urban, Urban, and Sub-urban - with 32 cell sites distributed in that area. The total area of cell sites for different geotypes and the total number of users in the corresponding area are shown in Table 1 [18]. For the cell site, we assume a radio configuration with 20 MHz, \(2\times 2\) MIMO antenna, 64QAM modulation scheme and full system load, and the reference value for processing complexity [19] of different baseband entities and bandwidth requirements [13] of different network segments for the mentioned radio configuration is shown in Table 2. We assume that the resource blocks of a cell are uniformly allocated to all users, that are normally distributed in the whole area, so the processing complexity and bandwidth requirement is equally divided by all the users associated with the same cell site. For the transport network, the COs in different stages are equipped with different levels of baseband processing and switching capacity. COs are organized in a ring topology and are connected via bidirectional monofiber links, carrying W wavelengths at 10 Gb/s. For the latency constraint, according to [17], maximum HARQ latency (RU-DU) is set to be 246 \(\upmu \)s and maximum service latency is set to be 1000 \(\upmu \)s. When below 246 \(\upmu \)s, the HARQ latency requirement and service latency requirement are equal, and when above 246 \(\upmu \)s, the HARQ latency is fixed at 246 \(\upmu \)s. We define a performance metric R, called consolidation factor, which is used to evaluate multiplexing gain of function in our simulation. \(R=N_{co}/N_{cs}\), \(N_{co}\) is the numbers of active COs and cell sites, \(N_{cs}\) is the number of cell sites. R=1 indicates no consolidation, that is all the baseband functions are located in the cell sites, whereas 1/\(N_{cs}\) indicates the highest consolidation degree.

Table 2. The value of processing complexity (GOPS) and bandwidth requirement (Gbps) with radio configuration of 20 MHz, \(2\times 2\) MIMO, 64QAM (Downlink)
Fig. 4.
figure 4

(a) R value for increasing \(T_s\), and different RAN architectures (2-layer vs 3-layer, Q = 75%); (b) R values for increasing Q under different geotypes (overlay case; latency = 800 \(\upmu \)s); (c) R values for the increasing W (OTN case; Q = 50%)

5.2 Evaluation Results

Figure 4(a) shows the relationship between R and service latency requirement \(T_s\), for the 2-layer and 3-layer RAN architecture (left and right bars, respectively), when considering overlay transport technology, urban geotype case and actual processing capacity set to be 75 % of maximum processing capacity. The value of R represented as the sum of four contributions, corresponding to consolidation degree in each stage of the network. With the increase of \(T_s\), CUs and DUs can be consolidated in fewer COs to minimize the number of active COs, due to the fact that less stringent latency requirement allows the services to route along a longer path and end up in the COs of higher stages. For example, when the \(T_s\) \(\ge \) 600 \(\upmu \)s, there is no baseband function located in the cell sites (red color). We can also find that R value of 3-layer RAN architecture is less than the one of 2-layer RAN architecture, because flexible functional split divides the traditional BBU into multiple parts which can be deployed into COs more flexibly.

Figure 4(b) shows R as a function of Q = GOPS\(_a\)/GOPS\(_m\) for the different geotypes and RAN architectures when considering \(T_s = 800\,\upmu \)s and overlay case. GOPS\(_a\)/GOPS\(_m\) is the total actual processing capacity of all the COs divided by the maximum processing capacity in the networks. We can observe that higher consolidation is obtained when Q is increasing, especially for the suburban case. Because with the increasing processing capacity of COs, the DU and CU can be finally placed in the main or core COs when the latency requirement is relatively relaxed. For example, when Q goes near to 1, the value of R reaches the lowest level among almost all the scenarios. Note also that, due to the lower number of requests in the suburban area, the DUs/CUs are more easily deployed in the COs of higher stage. From this result, we can also find that 3-layer architecture benefits more in terms of consolidation that 2-layer architecture because of the flexibility in placement.

Figure 4(c) shows the relationship between R and network capacity in terms of number of wavelengths W. We can find that the relation between R and latency depends on the bandwidth constraint. When the number of wavelengths is limited, no matter how loose the maximum latency requirement is, consolidation factor R will not decrease when it arrives at a critical point. This can be explained since when if enough wavelengths are provided, the provision of baseband functions can be more flexible; whereas if the bandwidth between different COs is limited, the location of baseband functions tends to be closer to the cell sites.

6 Conclusion

In this work, we have modeled an optimized DU/CU placement problem for C-RAN deployment over a WDM metro/aggregation network and formalized it into an ILP model. Compared to the original 2-layer RAN architecture, we proved that the 3-layer RAN architecture has higher consolidation of baseband functions thanks to the increased placement flexibility. We also observed that: (1) looser latency constraint leads to a high degree consolidation of baseband functions; (2) the processing capacity of COs also influences the consolidation of baseband functions; (3) adopting overlay transport solution can lead to a higher baseband function consolidation. In this work, the functional split options between RU, DU and CU are fixed. In the future, we will investigate the DU/CU placement problem when the functional split options is flexible, so the relation between flexible DU/CU placement and flexible functional split option needs to be jointly evaluated. Also, the heuristics for this problem will be proposed for the realistic scenario with larger network topology. Moreover, compared to the static DU/CU placement, dynamic DU/CU allocation problem according to real-time service requests from mobile users is worthy to be investigated.