Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Initial tensor construction and dependence of the tensor renormalization group on initial tensors

Katsumasa Nakayama katsumasa.nakayama@riken.jp RIKEN Center for Computational Science, Kobe, 650-0047, Japan    Manuel Schneider National Yang Ming Chiao Tung University (NYCU), Hsinchu, 30010, Taiwan
(19 July 2024)
Abstract

We propose a method to construct a tensor network representation of partition functions without singular value decompositions nor series expansions. The approach is demonstrated for one- and two-dimensional Ising models and we study the dependence of the tensor renormalization group (TRG) on the form of the initial tensors and their symmetries. We further introduce variants of several tensor renormalization algorithms. Our benchmarks reveal a significant dependence of various TRG algorithms on the choice of initial tensors and their symmetries. However, we show that the boundary TRG technique can eliminate the initial tensor dependence for all TRG methods. The numerical results of TRG calculations can thus be made significantly more robust with only a few changes in the code. Furthermore, we study a three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory without gauge-fixing and confirm the applicability of the initial tensor construction. Our method can straightforwardly be applied to systems with longer range and multi-site interactions, such as the next-nearest neighbor Ising model.

I Introduction

Since its introduction about two decades ago Levin and Nave (2007), the tensor renormalization group (TRG) method was widely applied to statistical physics problems, including quantum field theories such as the CP(1) model Nakayama et al. (2022), the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory Liu et al. (2013); Kuramashi and Yoshimura (2019), the Schwinger model Shimizu and Kuramashi (2014a, b, 2018), and many more Yu et al. (2014); Zou et al. (2014); Yang et al. (2016); Takeda and Yoshimura (2015); Yoshimura et al. (2018); Bazavov et al. (2019); Kuramashi and Yoshimura (2020); Hirasawa et al. (2021); Akiyama and Kadoh (2021); Yosprakob et al. (2023); Akiyama et al. (2024); Yosprakob and Okunishi (2024). The partition function is written in the form of a tensor network. Then, the partition function itself and other physical quantities can be calculated by contracting the tensor network, which means summing over all its indices. This is only possible using approximate methods which truncate the exponential growths of information with increasing the system size. The tensor network is contracted in subsequent coarse-graining steps. In each step a truncation is applied, typically making use of a singular value decomposition (SVD). Since common algorithms coarse-grain the lattice in one direction only, the directions are exchanged after each step. The way this change is done affects the accuracy of the method and should therefore be carefully chosen. We discuss this effect in more details in App. G. Finally, physical quantities are extracted from the trace of the coarse-grained tensors. Since the TRG is free of sampling problems Nakayama et al. (2022); Shimizu and Kuramashi (2014b, 2018); Yang et al. (2016); Takeda and Yoshimura (2015); Kuramashi and Yoshimura (2020); Hirasawa et al. (2021); Yosprakob et al. (2023); Shimizu and Kuramashi (2014b), we can study systems for which Monte Carlo methods suffer from the sign problem Nagata (2022).

Overview of TRG algorithms.

The TRG was originally introduced by Levin and Nave Levin and Nave (2007), and it was since improved by truncation methods that reduce the numerical costs Halko et al. (2011); Nakamura et al. (2019); Morita et al. (2018); Okanohara (2014). The tensor network renormalization (TNR) additionally introduces disentanglers to improve the accuracy of the TRG Evenbly and Vidal (2015), an idea that originates in the multi-scale entanglement renormalization ansatz (MERA) Jiang et al. (2008). For systems with relatively small volumes, the core TRG (CTRG) can also reduce the computational requirements Lan and Evenbly (2019).

For higher dimensional systems, the TRG was extended to higher-order TRG (HOTRG) Xie et al. (2012). Recently, various alternatives were also studied, such as the anisotropic TRG (ATRG) Adachi et al. (2020), the triad TRG (TTRG) Kadoh and Nakayama (2019), and the minimally decomposed TRG (MDTRG) Nakayama (2023). These methods can reduce the numerical costs, and allow for contractions of three- and higher-dimensional tensor network systems in feasible computational time. We explain several TRG algorithms in appendices D and E

Initial tensor construction.

In order to apply the TRG methods efficiently, we have to represent the physical quantities by a locally connected tensor network. This means that each index only appears on two neighboring tensors. Different geometries can arise for this network, depending on the connectivity of the interactions. We focus on square and cubic lattices. The TRG coarse-grains these lattices to a network with the same geometry and can thus be used iteratively.

Common approaches to construct a locally connected tensor network make use of SVDs or series expansions such as the Taylor expansion Liu et al. (2013); Baumgartner and Wenger (2015); Marchis and Gattringer (2018). The expansion creates new variables, the power indices of each term. These can be used as indices of the initial tensor of the tensor network, by integrating out the original degrees of freedom. We give two examples of this construction in sections III and IV.

However, the choice of the initial tensors describing a given system is not unique. We propose another approach to construct the tensor network, based on a trivial decomposition with an identity matrix. The procedure does not require problem-specific and more involved decompositions, expansions, and variable transformations from spin indices to new tensor indices. We consider the spin indices as the indices of the initial tensor and localize the network by a matrix decomposition without approximations, inserting an identity matrix. This method generally generates a local tensor network representation for any theory which can be described by a translationally invariant Lagrangian or Hamiltonian. However, the index dimension can be large, depending on the dimension of the local degrees of freedom and the range of the interaction. The method is very efficient for local interactions. The general resource scaling is discussed in Sec. V.

Initial tensor dependence of TRG methods.

Since the coarse graining steps include local truncations, the accuracy of TRG algorithms possibly depends on the form of the initial tensors. Although the tensor construction based on the expansion is widely and successfully used for different models, it might not be the optimal choice for a given system and contraction method. We benchmark the accuracy of different TRG algorithms for the two-dimensional Ising model. Our results show that HOTRG-like methods, which use isometries for the coarse-graining step, are highly dependant on the symmetry of the initial tensors. We find that this problem does not apply when isometries in the algorithms are replaced by so-called squeezers Adachi et al. (2020), an idea originating from the boundary TRG Iino et al. (2019). This is possible for any isometry based TRG algorithm. Thus, we suggest to make use of this method in order to remove the dependence on the form of the initial tensors. In this case, our simple construction of the initial tensors leads to the same accuracy as other, more involved or problem-specific techniques. Appendix D discusses the technical details of how to implement squeezers in coarse-graining algorithms.

2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory.

As an example for higher dimensional systems, the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory in three spatial dimensions was studied with HOTRG and TRG Kuramashi and Yoshimura (2019). There, the tensor network representation based on a Taylor expansion was used with gauge-fixing. The critical temperature was calculated with high accuracy. However, the representation in Kuramashi and Yoshimura (2019) has two-different tensors. Because the SVD is an optimization of local tensors, a smaller unit cell could generally be preferable. We show the applicability of our tensor network construction to the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory, where only one initial tensor appears in the network. We calculate the free energy, specific heat and critical temperature without gauge-fixing, and find good agreement with previous calculations.

Structure of this paper.

This paper is organized as follows. We introduce our initial tensor construction in Sec. II for the one-dimensional Ising model with next-nearest neighbor interaction (NNNI) as a simple example. We can reproduce the exact solution with our method. After this, we apply the method to the two-dimensional Ising model and study the initial tensor dependence of the TRG and HOTRG in Sec. III. The accuracy of the HOTRG depends on the symmetricity of the initial tensor. In Sec. IV we apply the initial tensor construction method without gauge-fixing to the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory and calculate the free energy and specific heat. Section V explains how the method can be applied to general models and how the index sizes of the initial tensors scale. We conclude our study in Sec. VI.

II One-dimensional Ising model with next-nearest neighbor interactions

We first introduce our method for the one-dimensional Ising model with next-nearest neighbor interactions and periodic boundary conditions as a simple example. The idea has similarities to the initial tensor construction for a particular Ising model on a triangular lattice in Zhao et al. (2010). Our method allows for other interaction terms than local interactions, such as next nearest interactions in the case considered here. The Ising model with NNNI in one spatial dimension with N𝑁Nitalic_N sites can be described by the partition function

Z=σ=±1x=1NTσx,σx+1,σx+2(1d).𝑍subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥1𝑁superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥21dZ=\sum_{\sigma=\pm 1}\prod_{x=1}^{N}T_{\sigma_{x},\sigma_{x+1},\sigma_{x+2}}^{% \mathrm{(1d)}}.italic_Z = ∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT . (1)

The sum σ=±1subscript𝜎plus-or-minus1\sum_{\sigma=\pm 1}∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT indicates a summation over all combinations of the spins at all sites. The tensor T(1d)superscript𝑇1dT^{\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT can be constructed with the spin indices σxsubscript𝜎𝑥\sigma_{x}italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT at sites x𝑥xitalic_x and depends on the inverse temperature β𝛽\betaitalic_β and the coupling constants g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT:

Tσx,σx+1,σx+2(1d)eβ(g1σxσx+1+g2σxσx+2).superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥21dsuperscript𝑒𝛽subscript𝑔1subscript𝜎𝑥subscript𝜎𝑥1subscript𝑔2subscript𝜎𝑥subscript𝜎𝑥2T_{\sigma_{x},\sigma_{x+1},\sigma_{x+2}}^{\mathrm{(1d)}}\equiv e^{-\beta(g_{1}% \sigma_{x}\sigma_{x+1}+g_{2}\sigma_{x}\sigma_{x+2})}.italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT ≡ italic_e start_POSTSUPERSCRIPT - italic_β ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT + italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT . (2)

This formulation does not form a locally connected network: a spin index σxsubscript𝜎𝑥\sigma_{x}italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT at a given site x𝑥xitalic_x occurs on three different tensors (Tσx2,σx1,σx(1d)Tσx1,σx,σx+1(1d)Tσx,σx+1,σx+2(1d)superscriptsubscript𝑇subscript𝜎𝑥2subscript𝜎𝑥1subscript𝜎𝑥1dsuperscriptsubscript𝑇subscript𝜎𝑥1subscript𝜎𝑥subscript𝜎𝑥11dsuperscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥21dT_{\sigma_{x-2},\sigma_{x-1},\sigma_{x}}^{\mathrm{(1d)}}T_{\sigma_{x-1},\sigma% _{x},\sigma_{x+1}}^{\mathrm{(1d)}}T_{\sigma_{x},\sigma_{x+1},\sigma_{x+2}}^{% \mathrm{(1d)}}italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x - 2 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x - 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x - 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT) instead of only two neighboring tensors. Therefore, the partition function in Eq. 1 cannot be used in typical coarse-graining algorithms. We have to find an alternative initial tensor formulation with only locally connected tensors. For example, we want to find initial tensors T(1d)superscript𝑇1dT^{\prime\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT which only depend on two neighboring indices for a one-dimensional system. For this, we first decompose the tensor T(1d)superscript𝑇1dT^{\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT into A𝐴Aitalic_A and B𝐵Bitalic_B without approximation by introducing a new index a𝑎aitalic_a:

Tσx,σx+1,σx+2(1d)=ax+1=±1Aσx,σx+1ax+1Bσx+2ax+1.superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥21dsubscriptsubscript𝑎𝑥1plus-or-minus1superscriptsubscript𝐴subscript𝜎𝑥subscript𝜎𝑥1subscript𝑎𝑥1superscriptsubscript𝐵subscript𝜎𝑥2subscript𝑎𝑥1T_{\sigma_{x},\sigma_{x+1},\sigma_{x+2}}^{\mathrm{(1d)}}=\sum_{a_{x+1}=\pm 1}A% _{\sigma_{x},\sigma_{x+1}}^{a_{x+1}}B_{\sigma_{x+2}}^{a_{x+1}}.italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT = ± 1 end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (3)

We can apply a SVD or other methods for this decomposition, as it was previously done for an Ising model with a magnetic field in Zhao et al. (2010). However, we can also easily construct a tensor of the above form by choosing B𝐵Bitalic_B to be the identity matrix:

Tσx,σx+1,σx+2(1d)=ax+1=±1Tσx,σx+1,ax+1(1d)δσx+2ax+1.superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥21dsubscriptsubscript𝑎𝑥1plus-or-minus1superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝑎𝑥11dsuperscriptsubscript𝛿subscript𝜎𝑥2subscript𝑎𝑥1T_{\sigma_{x},\sigma_{x+1},\sigma_{x+2}}^{\mathrm{(1d)}}=\sum_{a_{x+1}=\pm 1}T% _{\sigma_{x},\sigma_{x+1},a_{x+1}}^{\mathrm{(1d)}}\delta_{\sigma_{x+2}}^{a_{x+% 1}}.italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT = ± 1 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (4)

We then define the localized tensor in terms of the tensors A𝐴Aitalic_A and B𝐵Bitalic_B, where the indices of tensor B𝐵Bitalic_B are shifted by one lattice site compared to Eq. 3:

Tσx,σx+1,ax,ax+1(1d)Aσx,σx+1ax+1Bσx+1ax.superscriptsubscript𝑇subscript𝜎𝑥subscript𝜎𝑥1subscript𝑎𝑥subscript𝑎𝑥11dsuperscriptsubscript𝐴subscript𝜎𝑥subscript𝜎𝑥1subscript𝑎𝑥1superscriptsubscript𝐵subscript𝜎𝑥1subscript𝑎𝑥T_{\sigma_{x},\sigma_{x+1},a_{x},a_{x+1}}^{\prime\mathrm{(1d)}}\equiv A_{% \sigma_{x},\sigma_{x+1}}^{a_{x+1}}B_{\sigma_{x+1}}^{a_{x}}.italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT ≡ italic_A start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (5)

Exploiting the translational invariance of the system, the partition function can be rewritten as a locally connected tensor network consisting of these new tensors:

Z=σ=±1a=±1x=1NTσx,σx+1,ax,ax+1(1d).𝑍subscript𝜎plus-or-minus1subscript𝑎plus-or-minus1superscriptsubscriptproduct𝑥1𝑁subscriptsuperscript𝑇1dsubscript𝜎𝑥subscript𝜎𝑥1subscript𝑎𝑥subscript𝑎𝑥1Z=\sum_{\sigma=\pm 1}\sum_{a=\pm 1}\prod_{x=1}^{N}T^{\prime\mathrm{(1d)}}_{% \sigma_{x},\sigma_{x+1},a_{x},a_{x+1}}.italic_Z = ∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_a = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (6)

By defining the combined indices [aσ]aσdelimited-[]𝑎𝜎tensor-product𝑎𝜎[a\sigma]\equiv a\otimes\sigma[ italic_a italic_σ ] ≡ italic_a ⊗ italic_σ, we obtain a one-dimensional system with size-four indices:

Z=[aσ]=14x=1NT[aσ]x,[aσ]x+1(1d).𝑍superscriptsubscriptdelimited-[]𝑎𝜎14superscriptsubscriptproduct𝑥1𝑁subscriptsuperscript𝑇1dsubscriptdelimited-[]𝑎𝜎𝑥subscriptdelimited-[]𝑎𝜎𝑥1Z=\sum_{[a\sigma]=1}^{4}\prod_{x=1}^{N}T^{\prime\mathrm{(1d)}}_{[a\sigma]_{x},% [a\sigma]_{x+1}}.italic_Z = ∑ start_POSTSUBSCRIPT [ italic_a italic_σ ] = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_x = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_a italic_σ ] start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , [ italic_a italic_σ ] start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (7)

Since T(1d)superscript𝑇1dT^{\prime\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT is a 4×4444\times 44 × 4 matrix, we can easily find its eigenvalues by exact diagonalization. This gives the exact solution of the one-dimensional Ising model with NNNI, which is known from previous studies Pini and Rettori (1993); Taherkhani et al. (2011).

Different ways of constructing the locally connected tensor network lead to different tensors A𝐴Aitalic_A and B𝐵Bitalic_B and thus different Tsuperscript𝑇T^{\prime}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. In general, we can relate different tensor representations of the same system using a unitary matrix:

Txx(new)k,kUxkTkk(1d)Ukx.subscriptsuperscript𝑇new𝑥superscript𝑥subscript𝑘superscript𝑘subscript𝑈𝑥𝑘subscriptsuperscript𝑇1d𝑘superscript𝑘subscriptsuperscript𝑈superscript𝑘superscript𝑥T^{\mathrm{(new)}}_{xx^{\prime}}\equiv\sum_{k,k^{\prime}}U_{xk}T^{\mathrm{(1d)% }}_{kk^{\prime}}U^{\dagger}_{k^{\prime}x^{\prime}}.italic_T start_POSTSUPERSCRIPT ( roman_new ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≡ ∑ start_POSTSUBSCRIPT italic_k , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_x italic_k end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( 1 roman_d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (8)

Although the partition function is analytically not changed by this transformation, the numerical accuracy of the coarse-graining steps can depend on the form of T(1d)superscript𝑇1dT^{\prime\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT. This will be confirmed and studied in more detail in Sec. III.

The presented approach can be straightforwardly extended to an interaction with nintsubscript𝑛int{n_{\mathrm{int}}}{}italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT distinct hopping interactions. Since each hopping term introduces a new index that gets combined with the spin index, the matrix size of T(1d)superscript𝑇1dT^{\prime\mathrm{(1d)}}italic_T start_POSTSUPERSCRIPT ′ ( 1 roman_d ) end_POSTSUPERSCRIPT grows as dnint{d}{}^{{n_{\mathrm{int}}}{}}italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT, where d𝑑{d}{}italic_d is the dimension of the spin index. See Sec. V for more details.

One-dimensional models with several interactions are studied in the context of frustrated systems and antiferromagnetism Guimaraes and Plascak (2002); Taherkhani et al. (2011); Jurcisinoca and Jurcisin (2014); Karlova et al. (2018); Kassan-Ogly et al. (2012); Kwek et al. (2009); Niemeijer (1971); Ozerov et al. (2010); Raymond and Wong (2012); Sandvik (2010); Capriotti et al. (2003), and the approach presented here could be useful as a simple candidate to construct a locally connected network. Also, two-dimensional systems are widely studied to understand the phenomena of spin statistical systems Wang et al. (2016); Wang and Sandvik (2018); Richter et al. (2015); Sirker et al. (2006); Yoshiyama and Hukushima (2023); Li and Yang (2021). Our initial tensor construction can be readily applied to these systems. We show the explicit form of the initial tensors for the two-dimensional J1J2subscript𝐽1subscript𝐽2J_{1}-J_{2}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and J1J3subscript𝐽1subscript𝐽3J_{1}-J_{3}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT Ising models in appendices A and B respectively. In addition, we discuss more general systems including higher dimensions and long-range interaction in Sec. V. In principle, our construction can be extended to any dimension, and to various kinds of interaction terms.

III Two-dimensional Ising model and initial tensor dependence of the TRG methods

We use the two-dimensional Ising model with periodic boundary conditions in a volume of N×N𝑁𝑁N\times Nitalic_N × italic_N as a testing ground for the initial tensor dependence of different TRG methods. The partition function is

Z𝑍\displaystyle Zitalic_Z =\displaystyle== σ=±1x,y=1Neβhσx,yeβg2σx,y(σx+1,y+σx,y+1)subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁superscript𝑒𝛽subscript𝜎𝑥𝑦superscript𝑒𝛽𝑔2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}e^{\beta h\sigma_{x,y}}e^{% \frac{\beta g}{2}\sigma_{x,y}(\sigma_{x+1,y}+\sigma_{x,y+1})}∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_h italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT divide start_ARG italic_β italic_g end_ARG start_ARG 2 end_ARG italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT (9)
=\displaystyle== σ=±1x,y=1NKσx,y,σx+1,y,σx,y+1,subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁subscript𝐾subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}K_{\sigma_{x,y},\sigma_{x+1,y% },\sigma_{x,y+1}},∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (10)

with the spin indices σx,ysubscript𝜎𝑥𝑦\sigma_{x,y}italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT at sites {x,y}𝑥𝑦\{x,y\}{ italic_x , italic_y }, the coupling constant g𝑔gitalic_g and the external field hhitalic_h. In our numerical studies, we set g=1𝑔1g=1italic_g = 1, h=00h=0italic_h = 0, and β=βc=12ln(1+2)𝛽subscript𝛽𝑐12ln12\beta=\beta_{c}=\frac{1}{2}\mathrm{ln}(1+\sqrt{2})italic_β = italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_ln ( 1 + square-root start_ARG 2 end_ARG ), which is the critical value Onsager (1944); Duminil-Copin (2022).

Initial tensor construction with shifted delta-functions.

The representation by the tensor K𝐾Kitalic_K is not a two-dimensional locally connected tensor network where the same index would only occur on two neighboring tensors. Thus, this formulation can not directly be used for the numerical evaluation of the partition function through coarse-graining algorithms. We can construct a suitable network by inserting a delta function,

Z=a=±1σ=±1x,y=1NKσx,y,σx+1,y,ax,y,ax,y+1(delta),𝑍subscript𝑎plus-or-minus1subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁subscriptsuperscript𝐾deltasubscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥𝑦subscript𝑎𝑥𝑦1Z=\sum_{a=\pm 1}\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}K^{(\mathrm{delta})}_{% \sigma_{x,y},\sigma_{x+1,y},a_{x,y},a_{x,y+1}},italic_Z = ∑ start_POSTSUBSCRIPT italic_a = ± 1 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (11)

where

Kσx,y,σx+1,y,ax,y,ax,y+1(delta)Kσx,y,σx+1,y,ax,y+1δσx,yax,y.subscriptsuperscript𝐾deltasubscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥𝑦subscript𝑎𝑥𝑦1subscript𝐾subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥𝑦1superscriptsubscript𝛿subscript𝜎𝑥𝑦subscript𝑎𝑥𝑦K^{(\mathrm{delta})}_{\sigma_{x,y},\sigma_{x+1,y},a_{x,y},a_{x,y+1}}\equiv K_{% \sigma_{x,y},\sigma_{x+1,y},a_{x,y+1}}\delta_{\sigma_{x,y}}^{a_{x,y}}.italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≡ italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (12)

Similarly, other matrix decompositions like SVD or QR could be used instead of inserting a delta function. Again, we made use of the translational invariance of the system and obtained a locally connected tensor network.

Initial tensor construction based on Taylor expansion.

Previously, a different form of the initial tensor with h=00h=0italic_h = 0 was derived as in Liu et al. (2013); Zhao et al. (2010); Xie et al. (2012). We explain it in the following as a reference to compare our method to. We consider the Taylor expansion of a two site interaction in the Boltzmann weight. Because the square of a spin variable is the identity, only two factors in the expansion arise, which can be rewritten as a matrix multiplication:

e(βg/2)σnσn+1superscript𝑒𝛽𝑔2subscript𝜎𝑛subscript𝜎𝑛1\displaystyle e^{(\beta g/2)\sigma_{n}\sigma_{n+1}}italic_e start_POSTSUPERSCRIPT ( italic_β italic_g / 2 ) italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
=\displaystyle== cosh(βg/2)+σnσn+1sinh(βg/2)cosh𝛽𝑔2subscript𝜎𝑛subscript𝜎𝑛1sinh𝛽𝑔2\displaystyle\mathrm{cosh}(\beta g/2)+\sigma_{n}\sigma_{n+1}\mathrm{sinh}(% \beta g/2)roman_cosh ( italic_β italic_g / 2 ) + italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT roman_sinh ( italic_β italic_g / 2 )
=\displaystyle== l=01(σnlcosh(βg/2)1lsinh(βg/2)l\displaystyle\sum_{l=0}^{1}\Bigg{(}\sigma_{n}^{l}\sqrt{\mathrm{cosh}(\beta g/2% )}^{1-l}\sqrt{\mathrm{sinh}(\beta g/2)}^{l}∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT square-root start_ARG roman_cosh ( italic_β italic_g / 2 ) end_ARG start_POSTSUPERSCRIPT 1 - italic_l end_POSTSUPERSCRIPT square-root start_ARG roman_sinh ( italic_β italic_g / 2 ) end_ARG start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT
×σn+1lcosh(βg/2)1lsinh(βg/2)l)\displaystyle\times\sigma_{n+1}^{l}\sqrt{\mathrm{cosh}(\beta g/2)}^{1-l}\sqrt{% \mathrm{sinh}(\beta g/2)}^{l}\Bigg{)}× italic_σ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT square-root start_ARG roman_cosh ( italic_β italic_g / 2 ) end_ARG start_POSTSUPERSCRIPT 1 - italic_l end_POSTSUPERSCRIPT square-root start_ARG roman_sinh ( italic_β italic_g / 2 ) end_ARG start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT )
=\displaystyle== l=01Wσn,lWσn+1,l.superscriptsubscript𝑙01subscript𝑊subscript𝜎𝑛𝑙subscript𝑊subscript𝜎𝑛1𝑙\displaystyle\sum_{l=0}^{1}W_{\sigma_{n},l}W_{\sigma_{n+1},l}.∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_l end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , italic_l end_POSTSUBSCRIPT . (13)

The matrix W𝑊Witalic_W is defined as

W=(cosh(βg/2)sinh(βg/2)cosh(βg/2)sinh(βg/2)),𝑊matrixcosh𝛽𝑔2sinh𝛽𝑔2cosh𝛽𝑔2sinh𝛽𝑔2W=\begin{pmatrix}\sqrt{\mathrm{cosh}(\beta g/2)}&{\sqrt{\mathrm{sinh}(\beta g/% 2)}}\\ {\sqrt{\mathrm{cosh}(\beta g/2)}}&{-\sqrt{\mathrm{sinh}(\beta g/2)}}\end{% pmatrix},italic_W = ( start_ARG start_ROW start_CELL square-root start_ARG roman_cosh ( italic_β italic_g / 2 ) end_ARG end_CELL start_CELL square-root start_ARG roman_sinh ( italic_β italic_g / 2 ) end_ARG end_CELL end_ROW start_ROW start_CELL square-root start_ARG roman_cosh ( italic_β italic_g / 2 ) end_ARG end_CELL start_CELL - square-root start_ARG roman_sinh ( italic_β italic_g / 2 ) end_ARG end_CELL end_ROW end_ARG ) , (14)

where the first row corresponds to σ=1𝜎1\sigma=-1italic_σ = - 1, and the second to σ=+1𝜎1\sigma=+1italic_σ = + 1. We see that the exponential of the two-site interaction can be decomposed into two W𝑊Witalic_W matrices, introducing a new index l𝑙litalic_l. Including the interaction terms in the orthogonal spatial direction, we get the initial tensor

Klx,y,lx+1,y,mx,y,mx,y+1(exp)subscriptsuperscript𝐾expsubscript𝑙𝑥𝑦subscript𝑙𝑥1𝑦subscript𝑚𝑥𝑦subscript𝑚𝑥𝑦1\displaystyle K^{(\mathrm{exp})}_{l_{x,y},l_{x+1,y},m_{x,y},m_{x,y+1}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_l start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=αWα,lx,yWα,lx+1,yWα,mx,yWα,mx,y+1.absentsubscript𝛼subscript𝑊𝛼subscript𝑙𝑥𝑦subscript𝑊𝛼subscript𝑙𝑥1𝑦subscript𝑊𝛼subscript𝑚𝑥𝑦subscript𝑊𝛼subscript𝑚𝑥𝑦1\displaystyle=\sum_{\alpha}W_{\alpha,l_{x,y}}W_{\alpha,l_{x+1,y}}W_{\alpha,m_{% x,y}}W_{\alpha,m_{x,y+1}}.= ∑ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_α , italic_l start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_α , italic_l start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_α , italic_m start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_α , italic_m start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (15)

This tensor is symmetric under permutation of any indices.

Initial tensor dependence of TRG algorithms.

We test the dependence of the coarse-graining methods on the initial tensors by using K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT, K(exp)superscript𝐾expK^{(\mathrm{exp})}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT, and the symmetrized tensor K(sym)superscript𝐾symK^{(\mathrm{sym})}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT. The latter is obtained from K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT by a gauge transformation on the tensor indices, in order to make the tensor nearly symmetric under permutation of its indices. This symmetrization is explained in App. C. Each SVD in the coarse-graining step is truncated to a maximum bond size D𝐷Ditalic_D to prevent the exponential growth of the index sizes. We apply 𝒪(D6)𝒪superscript𝐷6{\mathcal{O}}(D^{6})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT ) TRG Levin and Nave (2007), 𝒪(D7)𝒪superscript𝐷7{\mathcal{O}}(D^{7})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT ) HOTRG Xie et al. (2012), 𝒪(D5)𝒪superscript𝐷5{\mathcal{O}}(D^{5})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ) ATRG Adachi et al. (2020), 𝒪(D5)𝒪superscript𝐷5{\mathcal{O}}(D^{5})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ) MDTRG without internal line oversampling Nakayama (2023), and 𝒪(D7)𝒪superscript𝐷7{\mathcal{O}}(D^{7})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT ) boundary TRG for HOTRG (b-HOTRG) Iino et al. (2019) for a system size of V=220𝑉superscript220V=2^{20}italic_V = 2 start_POSTSUPERSCRIPT 20 end_POSTSUPERSCRIPT.

In this section we discuss the initial tensor dependence of the TRG, HOTRG, and b-HOTRG. The details of the algorithms and further benchmarks for the other TRG methods can be found in App. E.

Refer to caption
Figure 1: Dependence of the TRG and HOTRG methods on the form of the initial tensors for different cutoff bond dimensions D𝐷Ditalic_D in the two-dimensional Ising model. Shown are the relative errors of the free energy for the asymmetric initial tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT, the symmetric tensor K(exp)superscript𝐾expK^{(\mathrm{exp})}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT, and the symmetrized tensor K(sym)superscript𝐾symK^{\mathrm{(sym)}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT. See main text for details.
Refer to caption
Figure 2: Dependence of the HOTRG and boundary-HOTRG methods on the form of the initial tensors for different cutoff bond dimensions D𝐷Ditalic_D in the two-dimensional Ising model. Shown are the relative errors of the free energy for the asymmetric initial tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT and the symmetric tensor K(exp)superscript𝐾expK^{(\mathrm{exp})}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT. The initial tensor dependence can be removed by introducing squeezers in the boundary HOTRG method. See main text and App. D for details.

We calculate the free energy F(lnZ)/(βV)𝐹ln𝑍𝛽𝑉F\equiv-(\mathrm{ln}Z)/(\beta V)italic_F ≡ - ( roman_ln italic_Z ) / ( italic_β italic_V ) and compare it to the exact value Kaufman (1949). Figure 1 shows the error of the free energy for the TRG and HOTRG methods. We find that the accuracy of the original TRG method does not depend on the choice of the initial tensor. As in previous studies Xie et al. (2012), the HOTRG has a better accuracy than TRG if the symmetric tensor K(exp)superscript𝐾expK^{(\mathrm{exp})}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT is used. The same holds for the symmetrized tensor K(sym)superscript𝐾symK^{(\mathrm{sym})}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT. However, this is not true anymore for the asymmetric initial tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT, where the accuracy is lowered significantly. We study the symmetry dependence in more detail in App. C and find that the original TRG does generally not depend on the symmetry of the initial tensors, while HOTRG becomes more and more unreliable the less symmetric the initial tensors are.

Removing the initial tensor dependence by boundary TRG techniques.

The HOTRG results for the asymmetric initial tensor can be improved by applying the boundary TRG method Iino et al. (2019). As shown in Fig. 2, this boundary HOTRG method produces results with the same accuracy as HOTRG for a symmetric initial tensor, but does so even if an asymmetric initial tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT is used. The boundary HOTRG differs from the simple HOTRG by the details of the coarse graining steps: simple HOTRG uses an isometry U(HOTRG)superscript𝑈HOTRGU^{(\mathrm{HOTRG})}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT, while the boundary HOTRG introduces squeezers P1(bHOTRG)superscriptsubscript𝑃1bHOTRGP_{1}^{(\mathrm{bHOTRG})}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT and P2(bHOTRG)superscriptsubscript𝑃2bHOTRGP_{2}^{(\mathrm{bHOTRG})}italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT for the coarse-graining. See App. D for details.

Similar results are found for ATRG and MDTRG as shown in App. E. We observe that ATRG and MDTRG with squeezers similar to the boundary TRG have no dependence on the form of the initial tensors, while coarse graining methods using isometries similar to the simple HOTRG strongly depend on it.

Overview of TRG methods and their initial tensor dependencies.

We give a summary of the different coarse graining methods, their costs and their dependence on the initial tensors in table 1. The different methods can be categorized into three classes. The first category uses no isometries but replaces tensors directly by their SVD representations, or by the projectors introduced in the boundary TRG method Iino et al. (2019) (TRG, b-HOTRG). We call these projectors squeezers as in Adachi et al. (2020) because they are not always projectors in the mathematical sense. We indicate this class of algorithms as sqz in table 1. The second category consist of methods which use the index of an isometry as a new index in the next coarse graining step (HOTRG-like). This is denoted as iso in table 1. Finally, the third class consists of methods which use isometries for intermediate approximate contractions, but the indices of the isometries are not used as new indices of the coarse-grained tensors. We denote these methods as iso*.

Costs Trun. Dep. |1F(D=30)/Fex|1𝐹𝐷30subscript𝐹ex|1-F(D=30)/F_{\mathrm{ex}}|| 1 - italic_F ( italic_D = 30 ) / italic_F start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT |
TRG Levin and Nave (2007) 𝒪(D6)𝒪superscript𝐷6{\mathcal{O}}(D^{6})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT ) sqz --- - 𝒪(106)similar-toabsent𝒪superscript106\sim{\mathcal{O}}(10^{-6})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT )
HOTRG Xie et al. (2012) 𝒪(D4dim1)𝒪superscript𝐷4𝑑𝑖𝑚1{\mathcal{O}}(D^{4{dim}-1})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 4 italic_d italic_i italic_m - 1 end_POSTSUPERSCRIPT ) iso +++++ + 𝒪(105108)𝒪similar-tosuperscript105superscript108{\mathcal{O}}(10^{-5}\sim 10^{-8})caligraphic_O ( 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT ∼ 10 start_POSTSUPERSCRIPT - 8 end_POSTSUPERSCRIPT )
b-HOTRG Iino et al. (2019) sqz --- - 𝒪(108)similar-toabsent𝒪superscript108\sim{\mathcal{O}}(10^{-8})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 8 end_POSTSUPERSCRIPT )
ATRG Adachi et al. (2020) 𝒪(D2dim+1)𝒪superscript𝐷2𝑑𝑖𝑚1{\mathcal{O}}(D^{2{dim}+1})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 2 italic_d italic_i italic_m + 1 end_POSTSUPERSCRIPT ) sqz --- - 𝒪(107)similar-toabsent𝒪superscript107\sim{\mathcal{O}}(10^{-7})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT )
Iso-ATRG Adachi et al. (2020) iso +++++ + 𝒪(105106)𝒪similar-tosuperscript105superscript106{\mathcal{O}}(10^{-5}\sim 10^{-6})caligraphic_O ( 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT ∼ 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT )
sh-ATRG sqz -- 𝒪(107)similar-toabsent𝒪superscript107\sim{\mathcal{O}}(10^{-7})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT )
sh-Iso-ATRG iso* -- 𝒪(106)similar-toabsent𝒪superscript106\sim{\mathcal{O}}(10^{-6})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT )
MDTRG Nakayama (2023) 𝒪(Ddim+3)𝒪superscript𝐷𝑑𝑖𝑚3{\mathcal{O}}(D^{{dim}+3})caligraphic_O ( italic_D start_POSTSUPERSCRIPT italic_d italic_i italic_m + 3 end_POSTSUPERSCRIPT ) iso +++++ + 𝒪(105107)𝒪similar-tosuperscript105superscript107{\mathcal{O}}(10^{-5}\sim 10^{-7})caligraphic_O ( 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT ∼ 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT )
sh-MDTRG iso* -- 𝒪(106)similar-toabsent𝒪superscript106\sim{\mathcal{O}}(10^{-6})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT )
b-MDTRG sqz --- - 𝒪(107)similar-toabsent𝒪superscript107\sim{\mathcal{O}}(10^{-7})∼ caligraphic_O ( 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT )
Table 1: Properties of different TRG coarse graining methods. 2nd column: numerical costs; D𝐷Ditalic_D is the bond dimension and dim𝑑𝑖𝑚{dim}italic_d italic_i italic_m the spacetime-dimension. 3rd column: truncation method; iso stands for isometries which are used to create the coarse-grained indices; iso* means that isometries are used for intermediate approximate contractions, but they do not create the new indices of the coarse-grained tensors directly; sqz denotes all other methods, so either the squeezers from boundary TRG Iino et al. (2019) (see main text and App. D), or a simple contraction and singular value decomposition. 4th column: dependence on the initial tensors; --- - stands for no dependence, -- for a slight but not significant dependence, +++++ + for strong dependence; 5th column: relative error for a bond dimension of D=30𝐷30D=30italic_D = 30 for the two-dimensional critical Ising model compared to the exact energy; this gives an estimate of the accuracy, but note that different methods scale differently in the bond dimension. See App. E for more details on algorithms and benchmarks.

From our calculations in Figs. 1, 2 and E we conclude that coarse graining methods making use of isometries to create the new indices (iso), such as the simple HOTRG, depend strongly on their symmetry properties. This was also found for the massless Schwinger model with a different approach Butt et al. (2020). The isometries can always be replaced by squeezers as introduced for the boundary TRG. We suggest using these boundary TRG techniques, which can remove the dependence on the initial tensor symmetries and make the algorithm more robust (sqz). In that case, our tensor construction provides a simple and generic way to represent the partition function as a locally connected tensor network, without loss of accuracy in numerical calculations compared to other construction techniques.

Dependence on the index exchange type.

We also found a dependence of TRG algorithms on the way the index-directions are exchanged after each coarse graining step. From App. G we conclude that the exchange of directions should ideally allow the initial SVDs in a coarse graining step to split tensors along the contraction direction in the previous step. For the algorithms in this paper, this means that a rotation in clockwise or counterclockwise direction is better suited for shifted TRG methods. For non-shifted methods, a flip xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y (xysuperscript𝑥superscript𝑦x^{\prime}\leftrightarrow y^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ↔ italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT) leads to similar or better results. It replaces the x𝑥xitalic_x-index in negative (positive) x𝑥xitalic_x-direction with a corresponding y𝑦yitalic_y-index. We found that these flips lead to inaccurate results for the shifted methods and an accumulation of systematic errors. Therefore, the type of index exchange should be carefully checked for the TRG method used. In our benchmarks and numerical results we always apply the optimal exchange between directions, which is a rotation for shifted and a flip for non-shifted methods.

IV 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory

The three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory was studied in Liu et al. (2013); Kuramashi and Yoshimura (2019) using HOTRG and TRG. The partition function can be written as

Z=23Vσ=±1n,μ>νeβσn,μσn+μ^,νσn+ν^,μσn,ν,𝑍superscript23𝑉subscript𝜎plus-or-minus1subscriptproduct𝑛𝜇𝜈superscript𝑒𝛽subscript𝜎𝑛𝜇subscript𝜎𝑛^𝜇𝜈subscript𝜎𝑛^𝜈𝜇subscript𝜎𝑛𝜈Z=2^{-3V}\sum_{\sigma=\pm 1}\prod_{n,\mu>\nu}e^{-\beta\sigma_{n,\mu}\sigma_{n+% \hat{\mu},\nu}\sigma_{n+\hat{\nu},\mu}\sigma_{n,\nu}},italic_Z = 2 start_POSTSUPERSCRIPT - 3 italic_V end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_n , italic_μ > italic_ν end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_β italic_σ start_POSTSUBSCRIPT italic_n , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_μ end_ARG , italic_ν end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_ν end_ARG , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n , italic_ν end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (16)

where we introduce the link variables σn,μsubscript𝜎𝑛𝜇\sigma_{n,\mu}italic_σ start_POSTSUBSCRIPT italic_n , italic_μ end_POSTSUBSCRIPT at site n𝑛nitalic_n with direction μ𝜇\muitalic_μ. The unit vector in μ𝜇\muitalic_μ direction is represented by μ^^𝜇\hat{\mu}over^ start_ARG italic_μ end_ARG. The interaction corresponds to a spin system where each spin interacts with its nearest and next-nearest neighbors in a four-site interaction, known as plaquette-term. A schematic picture of the three plaquette terms in the three directions can be seen as black lines in Fig. 3.

Initial tensor construction based on Taylor expansion.

In Kuramashi and Yoshimura (2019), the authors used a representation based on the Taylor expansion similar to Eq. 13:

eβσn,μσn+μ^,νσn+ν^,μσn,νsuperscript𝑒𝛽subscript𝜎𝑛𝜇subscript𝜎𝑛^𝜇𝜈subscript𝜎𝑛^𝜈𝜇subscript𝜎𝑛𝜈\displaystyle e^{\beta\sigma_{n,\mu}\sigma_{n+\hat{\mu},\nu}\sigma_{n+\hat{\nu% },\mu}\sigma_{n,\nu}}italic_e start_POSTSUPERSCRIPT italic_β italic_σ start_POSTSUBSCRIPT italic_n , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_μ end_ARG , italic_ν end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_ν end_ARG , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n , italic_ν end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
=coshβp=01(tanhβ)p(σn,μσn+μ^,νσn+ν^,μσn,ν)p.absentcosh𝛽superscriptsubscript𝑝01superscripttanh𝛽𝑝superscriptsubscript𝜎𝑛𝜇subscript𝜎𝑛^𝜇𝜈subscript𝜎𝑛^𝜈𝜇subscript𝜎𝑛𝜈𝑝\displaystyle=\mathrm{cosh}\beta\sum_{p=0}^{1}\left(\mathrm{tanh}\beta\right)^% {p}\left(\sigma_{n,\mu}\sigma_{n+\hat{\mu},\nu}\sigma_{n+\hat{\nu},\mu}\sigma_% {n,\nu}\right)^{p}.= roman_cosh italic_β ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_tanh italic_β ) start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_n , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_μ end_ARG , italic_ν end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_ν end_ARG , italic_μ end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_n , italic_ν end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT . (17)

In this previous work, a gauge-fixing was applied to simplify the tensor network representation. However, for gauge theories on the lattice in general, numerical calculations with gauge-fixing can suffer from ambiguity of Gribov copies Gribov (1978). Therefore, we do not fix the gauge in our initial tensor constructions and in our numerical calculations.

Following the derivation in Kuramashi and Yoshimura (2019); Liu et al. (2013) but without gauge-fixing, we define the tensors A𝐴Aitalic_A and B𝐵Bitalic_B as

Apqrs=mod(1+p+q+r+s,2)subscript𝐴𝑝𝑞𝑟𝑠mod1𝑝𝑞𝑟𝑠2A_{pqrs}=\mathrm{mod}(1+p+q+r+s,2)italic_A start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT = roman_mod ( 1 + italic_p + italic_q + italic_r + italic_s , 2 ) (18)
Bpqrs=(tanhβ)(p+q+r+s)/4δpqδqrδrs.subscript𝐵𝑝𝑞𝑟𝑠superscripttanh𝛽𝑝𝑞𝑟𝑠4subscript𝛿𝑝𝑞subscript𝛿𝑞𝑟subscript𝛿𝑟𝑠B_{pqrs}=(\mathrm{tanh}\beta)^{(p+q+r+s)/4}\delta_{pq}\delta_{qr}\delta_{rs}.italic_B start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT = ( roman_tanh italic_β ) start_POSTSUPERSCRIPT ( italic_p + italic_q + italic_r + italic_s ) / 4 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_q italic_r end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT . (19)

A combination of six tensors leads to a unit cell tensor T(exp)superscript𝑇expT^{(\mathrm{exp})}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT which defines a locally connected tensor network that reproduces the partition function:

T[xX][xX][yY][yY][zZ][zZ](exp)/(coshβ)3subscriptsuperscript𝑇expdelimited-[]𝑥𝑋delimited-[]superscript𝑥superscript𝑋delimited-[]𝑦𝑌delimited-[]superscript𝑦superscript𝑌delimited-[]𝑧𝑍delimited-[]superscript𝑧superscript𝑍superscriptcosh𝛽3absent\displaystyle T^{(\mathrm{exp})}_{[xX][x^{\prime}X^{\prime}][yY][y^{\prime}Y^{% \prime}][zZ][z^{\prime}Z^{\prime}]}/(\mathrm{cosh}\beta)^{3}\equivitalic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_x italic_X ] [ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] [ italic_y italic_Y ] [ italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] [ italic_z italic_Z ] [ italic_z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] end_POSTSUBSCRIPT / ( roman_cosh italic_β ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ≡
a,b,c,d,e,fAcyZeAfzxbAdYXaBbxycBaXZeBfzYd.subscript𝑎𝑏𝑐𝑑𝑒𝑓subscript𝐴𝑐𝑦𝑍𝑒subscript𝐴𝑓𝑧𝑥𝑏subscript𝐴𝑑𝑌𝑋𝑎subscript𝐵𝑏superscript𝑥superscript𝑦𝑐subscript𝐵𝑎superscript𝑋superscript𝑍𝑒subscript𝐵𝑓superscript𝑧superscript𝑌𝑑\displaystyle\sum_{a,b,c,d,e,f}A_{cyZe}A_{fzxb}A_{dYXa}B_{bx^{\prime}y^{\prime% }c}B_{aX^{\prime}Z^{\prime}e}B_{fz^{\prime}Y^{\prime}d}.∑ start_POSTSUBSCRIPT italic_a , italic_b , italic_c , italic_d , italic_e , italic_f end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_c italic_y italic_Z italic_e end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_f italic_z italic_x italic_b end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_d italic_Y italic_X italic_a end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_b italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_c end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_a italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_e end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_f italic_z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_d end_POSTSUBSCRIPT . (20)

The combination of two indices like [xX]xXdelimited-[]𝑥𝑋tensor-product𝑥𝑋[xX]\equiv x\otimes X[ italic_x italic_X ] ≡ italic_x ⊗ italic_X introduces new spin-3/2 indices for the unit cell tensor.

Note that T(exp)superscript𝑇expT^{\mathrm{(exp)}}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT is not symmetric, even if A𝐴Aitalic_A and B𝐵Bitalic_B are completely symmetric in all indices. This differs from the Ising model, where the initial tensors obtained using a Taylor expansion were symmetric. Therefore, the expansion method does not produce better symmetry properties than our method for the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model. For the Ising model, we found in sections III and E that HOTRG is not well suited for non-symmetric initial tensors, while ATRG does not depend on the symmetry properties. This suggests that ATRG is a better choice for the initial tensors T(delta)superscript𝑇deltaT^{\mathrm{(delta)}}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT and T(exp)superscript𝑇expT^{\mathrm{(exp)}}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPTof the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model. However, the initial tensor dependence was not explicitly checked for the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model and we use ATRG in all our simulations.

The previous study Kuramashi and Yoshimura (2019) applied further constraints on the tensors A and B to implement a gauge-fixing condition. With this, they could precisely reproduce Monte-Carlo results.

Initial tensor construction with shifted delta-functions.

In the following, we construct another tensor network for the same model using the method introduced in Sec. II. We do not need a Taylor expansion, do not make use of the spin property σ2=1superscript𝜎21\sigma^{2}=1italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 1, and we keep the gauge unfixed. For a simpler notation, we define the indices

xk^subscript𝑥^𝑘absent\displaystyle x_{\hat{k}}\equivitalic_x start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ≡ σn+k^,μ=0subscript𝜎𝑛^𝑘𝜇0\displaystyle\sigma_{n+\hat{k},\mu=0}italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_k end_ARG , italic_μ = 0 end_POSTSUBSCRIPT
yk^subscript𝑦^𝑘absent\displaystyle y_{\hat{k}}\equivitalic_y start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ≡ σn+k^,μ=1subscript𝜎𝑛^𝑘𝜇1\displaystyle\sigma_{n+\hat{k},\mu=1}italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_k end_ARG , italic_μ = 1 end_POSTSUBSCRIPT (21)
zk^subscript𝑧^𝑘absent\displaystyle z_{\hat{k}}\equivitalic_z start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ≡ σn+k^,μ=2.subscript𝜎𝑛^𝑘𝜇2\displaystyle\sigma_{n+\hat{k},\mu=2}.italic_σ start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_k end_ARG , italic_μ = 2 end_POSTSUBSCRIPT .

Moreover, we define xx0^𝑥subscript𝑥^0x\equiv x_{\hat{0}}italic_x ≡ italic_x start_POSTSUBSCRIPT over^ start_ARG 0 end_ARG end_POSTSUBSCRIPT, yy0^𝑦subscript𝑦^0y\equiv y_{\hat{0}}italic_y ≡ italic_y start_POSTSUBSCRIPT over^ start_ARG 0 end_ARG end_POSTSUBSCRIPT, zz0^𝑧subscript𝑧^0z\equiv z_{\hat{0}}italic_z ≡ italic_z start_POSTSUBSCRIPT over^ start_ARG 0 end_ARG end_POSTSUBSCRIPT. The index n𝑛nitalic_n is not written explicitly here for brevity. Figure 3 shows a graphical representation of this index convention, where we locate the degrees of freedom σn,μsubscript𝜎𝑛𝜇\sigma_{n,\mu}italic_σ start_POSTSUBSCRIPT italic_n , italic_μ end_POSTSUBSCRIPT on the links between sites n𝑛nitalic_n and n+μ^𝑛^𝜇n+\hat{\mu}italic_n + over^ start_ARG italic_μ end_ARG.

Refer to caption
Figure 3: Schematic picture of the three plaquette terms at a lattice point n𝑛nitalic_n in different planes for the three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory. Index notation as introduced in Sec. IV.

The Boltzmann weight at site n𝑛nitalic_n is

Tx,xy^,xz^,y,yx^,yz^,z,zx^,zy^subscript𝑇𝑥subscript𝑥^𝑦subscript𝑥^𝑧𝑦subscript𝑦^𝑥subscript𝑦^𝑧𝑧subscript𝑧^𝑥subscript𝑧^𝑦\displaystyle T_{x,x_{\hat{y}},x_{\hat{z}},y,y_{\hat{x}},y_{\hat{z}},z,z_{\hat% {x}},z_{\hat{y}}}italic_T start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT
\displaystyle\equiv eβ(xxy^yyx^+xxz^zzx^+yyz^zzy^)/8.superscript𝑒𝛽𝑥subscript𝑥^𝑦𝑦subscript𝑦^𝑥𝑥subscript𝑥^𝑧𝑧subscript𝑧^𝑥𝑦subscript𝑦^𝑧𝑧subscript𝑧^𝑦8\displaystyle e^{-\beta\left(xx_{\hat{y}}yy_{\hat{x}}+xx_{\hat{z}}zz_{\hat{x}}% +yy_{\hat{z}}zz_{\hat{y}}\right)}/8.italic_e start_POSTSUPERSCRIPT - italic_β ( italic_x italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT italic_y italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT + italic_x italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT italic_z italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT + italic_y italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT italic_z italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 8 . (22)

We can translate this weight to a tensor network. For example, we can split the index xy^subscript𝑥^𝑦x_{\hat{y}}italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT from the tensor:

Tx,xy^,xz^,y,yx^,yz^,z,zx^,zy^subscript𝑇𝑥subscript𝑥^𝑦subscript𝑥^𝑧𝑦subscript𝑦^𝑥subscript𝑦^𝑧𝑧subscript𝑧^𝑥subscript𝑧^𝑦\displaystyle T_{x,x_{\hat{y}},x_{\hat{z}},y,y_{\hat{x}},y_{\hat{z}},z,z_{\hat% {x}},z_{\hat{y}}}italic_T start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=\displaystyle== ay^=±1Ax,xz^,y,yx^,yz^,z,zx^,zy^ay^Bxy^ay^subscriptsubscript𝑎^𝑦plus-or-minus1superscriptsubscript𝐴𝑥subscript𝑥^𝑧𝑦subscript𝑦^𝑥subscript𝑦^𝑧𝑧subscript𝑧^𝑥subscript𝑧^𝑦subscript𝑎^𝑦superscriptsubscript𝐵subscript𝑥^𝑦subscript𝑎^𝑦\displaystyle\sum_{{a_{\hat{y}}}=\pm 1}A_{x,x_{\hat{z}},y,y_{\hat{x}},y_{\hat{% z}},z,z_{\hat{x}},z_{\hat{y}}}^{a_{\hat{y}}}B_{x_{\hat{y}}}^{a_{\hat{y}}}∑ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT = ± 1 end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT (23)

One of the simplest choices for this decomposition is Bxy^ay^=δay^,xy^superscriptsubscript𝐵subscript𝑥^𝑦subscript𝑎^𝑦subscript𝛿subscript𝑎^𝑦subscript𝑥^𝑦B_{x_{\hat{y}}}^{a_{\hat{y}}}=\delta_{a_{\hat{y}},x_{\hat{y}}}italic_B start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = italic_δ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT. We define a new tensor without summation of the index x𝑥xitalic_x,

Cx,xz^,a,ay^,y,yx^,yz^,z,zx^,zy^subscript𝐶𝑥subscript𝑥^𝑧𝑎subscript𝑎^𝑦𝑦subscript𝑦^𝑥subscript𝑦^𝑧𝑧subscript𝑧^𝑥subscript𝑧^𝑦\displaystyle C_{x,x_{\hat{z}},a,a_{\hat{y}},y,y_{\hat{x}},y_{\hat{z}},z,z_{% \hat{x}},z_{\hat{y}}}italic_C start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_a , italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT
\displaystyle\equiv Tx,ay^,xz^,y,yx^,yz^,z,zx^,zy^δxa,subscript𝑇𝑥subscript𝑎^𝑦subscript𝑥^𝑧𝑦subscript𝑦^𝑥subscript𝑦^𝑧𝑧subscript𝑧^𝑥subscript𝑧^𝑦superscriptsubscript𝛿𝑥𝑎\displaystyle T_{x,a_{\hat{y}},x_{\hat{z}},y,y_{\hat{x}},y_{\hat{z}},z,z_{\hat% {x}},z_{\hat{y}}}\delta_{x}^{a},italic_T start_POSTSUBSCRIPT italic_x , italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT , (24)

where aa0^𝑎subscript𝑎^0a\equiv a_{\hat{0}}italic_a ≡ italic_a start_POSTSUBSCRIPT over^ start_ARG 0 end_ARG end_POSTSUBSCRIPT. Similarly, we can split the indices yz^subscript𝑦^𝑧y_{\hat{z}}italic_y start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT and zx^subscript𝑧^𝑥z_{\hat{x}}italic_z start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT from the tensor and shift the indices. This way, we obtain the initial tensor

Tx,xz^,a,ay^,y,yx^,b,bz^,z,zy^,c,cx^.(delta)subscriptsuperscript𝑇delta𝑥subscript𝑥^𝑧𝑎subscript𝑎^𝑦𝑦subscript𝑦^𝑥𝑏subscript𝑏^𝑧𝑧subscript𝑧^𝑦𝑐subscript𝑐^𝑥\displaystyle T^{(\mathrm{delta})}_{x,x_{\hat{z}},a,a_{\hat{y}},y,y_{\hat{x}},% b,b_{\hat{z}},z,z_{\hat{y}},c,c_{\hat{x}}.}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_a , italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_b , italic_b start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_c , italic_c start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT . end_POSTSUBSCRIPT
\displaystyle\equiv Tx,xz^,ay^,y,yx^,bz^,z,zy^,cx^δxaδybδzc.subscript𝑇𝑥subscript𝑥^𝑧subscript𝑎^𝑦𝑦subscript𝑦^𝑥subscript𝑏^𝑧𝑧subscript𝑧^𝑦subscript𝑐^𝑥superscriptsubscript𝛿𝑥𝑎superscriptsubscript𝛿𝑦𝑏superscriptsubscript𝛿𝑧𝑐\displaystyle T_{x,x_{\hat{z}},a_{\hat{y}},y,y_{\hat{x}},b_{\hat{z}},z,z_{\hat% {y}},c_{\hat{x}}}\delta_{x}^{a}\delta_{y}^{b}\delta_{z}^{c}.italic_T start_POSTSUBSCRIPT italic_x , italic_x start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT , italic_z , italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT . (25)

We define new spin-3/2 indices, [az]n+y^ay^zy^subscriptdelimited-[]𝑎𝑧𝑛^𝑦tensor-productsubscript𝑎^𝑦subscript𝑧^𝑦[az]_{n+\hat{y}}\equiv a_{\hat{y}}\otimes z_{\hat{y}}[ italic_a italic_z ] start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT ≡ italic_a start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT ⊗ italic_z start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT and finally obtain the partition function

Z=[az],[bx],[cy]=14nT[az]n[az]n+y^[bx]n[bx]n+z^[cy]n[cy]n+x^(delta).𝑍superscriptsubscriptdelimited-[]𝑎𝑧delimited-[]𝑏𝑥delimited-[]𝑐𝑦14subscriptproduct𝑛subscriptsuperscript𝑇deltasubscriptdelimited-[]𝑎𝑧𝑛subscriptdelimited-[]𝑎𝑧𝑛^𝑦subscriptdelimited-[]𝑏𝑥𝑛subscriptdelimited-[]𝑏𝑥𝑛^𝑧subscriptdelimited-[]𝑐𝑦𝑛subscriptdelimited-[]𝑐𝑦𝑛^𝑥Z=\sum_{[az],[bx],[cy]=1}^{4}\prod_{n}T^{(\mathrm{delta})}_{[az]_{n}[az]_{n+% \hat{y}}[bx]_{n}[bx]_{n+\hat{z}}[cy]_{n}[cy]_{n+\hat{x}}}.italic_Z = ∑ start_POSTSUBSCRIPT [ italic_a italic_z ] , [ italic_b italic_x ] , [ italic_c italic_y ] = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_a italic_z ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_a italic_z ] start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_y end_ARG end_POSTSUBSCRIPT [ italic_b italic_x ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_b italic_x ] start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_z end_ARG end_POSTSUBSCRIPT [ italic_c italic_y ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_c italic_y ] start_POSTSUBSCRIPT italic_n + over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (26)

This is a locally connected tensor network representation.

Numerical results for the free energy.

We test this representation with the initial tensor T(delta)superscript𝑇deltaT^{(\mathrm{delta})}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT without gauge-fixing by evaluating the partition function numerically. We set the system sizes in x𝑥xitalic_x, y𝑦yitalic_y, z𝑧zitalic_z direction to Nx=2subscript𝑁𝑥2N_{x}=2italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 2, Ny=Nz=215subscript𝑁𝑦subscript𝑁𝑧superscript215N_{y}=N_{z}=2^{15}italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT = 2 start_POSTSUPERSCRIPT 15 end_POSTSUPERSCRIPT. The first dimension is chosen small, similarly to Kuramashi and Yoshimura (2019). First, the three-dimensional system is reduced to a two-dimensional one by an HOTRG step without truncation. Then, we apply ATRG to perform the coarse-graining contractions with a truncation at a given bond dimension D𝐷Ditalic_D.

Refer to caption
Figure 4: Relative error of the free energy F𝐹Fitalic_F for the three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory for different bond dimensions D𝐷Ditalic_D. Shown are the results for ATRG and the shifted ATRG (see App. E) for the initial tensors T(exp)superscript𝑇expT^{(\mathrm{exp})}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT from Eq. 20 and T(delta)superscript𝑇deltaT^{(\mathrm{delta})}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT from Eq. 25.

The free energy

F1βVlnZ,𝐹1𝛽𝑉ln𝑍F\equiv-\frac{1}{\beta V}\mathrm{ln}Z,italic_F ≡ - divide start_ARG 1 end_ARG start_ARG italic_β italic_V end_ARG roman_ln italic_Z , (27)

is calculated from the partition function. The relative error in dependence on the cutoff parameter D𝐷Ditalic_D is estimated by |1F(D)/F(D=128)|1𝐹𝐷𝐹𝐷128|1-F(D)/F(D=128)|| 1 - italic_F ( italic_D ) / italic_F ( italic_D = 128 ) |, where D=128𝐷128D=128italic_D = 128 is the largest bond dimension in our simulations.

Figure 4 shows the error for the ATRG coarse graining method at β=0.6561𝛽0.6561\beta=0.6561italic_β = 0.6561 with oversampling parameter r=2𝑟2r=2italic_r = 2. Additionally, we show the results for the shifted ATRG algorithm, which is explained in App. E. We observe no significant dependence on the initial tensor for both methods. The initial tensor T(delta)superscript𝑇deltaT^{\mathrm{(delta)}}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT, which is constructed without a Taylor expansion, leads to accurate results and the accuracy is comparable to calculations with the initial tensor T(exp)superscript𝑇expT^{\mathrm{(exp)}}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT. The relative error between the ATRG and shifted ATRG methods is |1Fsh,ATRG(D=128)FATRG(D=128)|=𝒪(107)1subscript𝐹shATRG𝐷128subscript𝐹ATRG𝐷128𝒪superscript107\left|1-\frac{F_{\mathrm{sh,ATRG}}(D=128)}{F_{\mathrm{ATRG}}(D=128)}\right|={% \mathcal{O}}(10^{-7})| 1 - divide start_ARG italic_F start_POSTSUBSCRIPT roman_sh , roman_ATRG end_POSTSUBSCRIPT ( italic_D = 128 ) end_ARG start_ARG italic_F start_POSTSUBSCRIPT roman_ATRG end_POSTSUBSCRIPT ( italic_D = 128 ) end_ARG | = caligraphic_O ( 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT ), indicating that both methods converge to the same value. The error from randomized SVDs is sufficiently reduced by an r=2D𝑟2𝐷r=2Ditalic_r = 2 italic_D oversampling. Since the shifted ATRG is better suited for the impurity tensor method, as discussed in App. F, we use the shifted ATRG in calculations of the specific heat of the system.

The calculation of the free energy for the three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model demonstrates that our initial tensor construction T(delta)superscript𝑇deltaT^{\mathrm{(delta)}}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT without expansion and gauge-fixing leads to results as accurate as those with the initial tensor constructions T(exp)superscript𝑇expT^{\mathrm{(exp)}}italic_T start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT.

Refer to caption

(a)

Refer to caption

(b)

Figure 5: Specific heat of the three-dimensional 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory for different inverse temperatures β𝛽\betaitalic_β. The dotted line marks the critical temperature βc=0.6561subscript𝛽𝑐0.6561\beta_{c}=0.6561italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 0.6561 calculated in Kuramashi and Yoshimura (2019). Shown are the results obtained with the shifted ATRG method (see App. E) and the initial tensors T(delta)superscript𝑇deltaT^{(\mathrm{delta})}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT from Eq. 25. See main text for details of the calculation. (a) wide range of β𝛽\betaitalic_β. (b) zoom-in around the maximum value. Neighboring data points fluctuate due to the statistical errors from the randomized SVD. The grey band is taken as an estimate of the uncertainty of βcsubscript𝛽𝑐\beta_{c}italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT. See main text for details.

Numerical results for the specific heat.

We further calculate the specific heat

Cβ21V2lnZβ2.𝐶superscript𝛽21𝑉superscript2ln𝑍superscript𝛽2C\equiv\beta^{2}\frac{1}{V}\frac{\partial^{2}\mathrm{ln}Z}{\partial\beta^{2}}.italic_C ≡ italic_β start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_V end_ARG divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_ln italic_Z end_ARG start_ARG ∂ italic_β start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (28)

First, we obtain the first order derivative βlnZsubscript𝛽ln𝑍\partial_{\beta}\mathrm{ln}Z∂ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT roman_ln italic_Z by the impurity tensor method as explained in App. F. Then, the second order derivative and therefore C𝐶Citalic_C is derived from this with a numerical forth-order approximation of the differentials. For calculations not too close to the critical temperature, we choose a step size of δβ=0.002𝛿𝛽0.002\delta\beta=0.002italic_δ italic_β = 0.002 and a bond dimension D=64𝐷64D=64italic_D = 64. Closer to the critical value of β𝛽\betaitalic_β we set δβ=0.00025𝛿𝛽0.00025\delta\beta=0.00025italic_δ italic_β = 0.00025 and the bond dimension to D=112𝐷112D=112italic_D = 112. The error of the approximation for the second order derivative is 𝒪(δβ4)𝒪𝛿superscript𝛽4{\mathcal{O}}(\delta\beta^{4})caligraphic_O ( italic_δ italic_β start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ), becoming small for smaller δβ𝛿𝛽\delta\betaitalic_δ italic_β. On the other hand, any kind of error of the first order derivative δ(βlnZ)𝛿subscript𝛽ln𝑍\delta(\partial_{\beta}\mathrm{ln}Z)italic_δ ( ∂ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT roman_ln italic_Z ) propagates as 𝒪(δ(βlnZ)/(δβ))𝒪𝛿subscript𝛽ln𝑍𝛿𝛽{\mathcal{O}}(\delta(\partial_{\beta}\mathrm{ln}Z)/(\delta\beta))caligraphic_O ( italic_δ ( ∂ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT roman_ln italic_Z ) / ( italic_δ italic_β ) ), growing for small δβ𝛿𝛽\delta\betaitalic_δ italic_β. If one aims for high precision, the step size δβ𝛿𝛽\delta\betaitalic_δ italic_β should therefore be carefully chosen and optimized.

Figure 5 shows the specific heat of the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory with the initial tensor T(delta)superscript𝑇deltaT^{\mathrm{(delta)}}italic_T start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT. The critical temperature is found to be βc=0.6560(3)subscript𝛽𝑐0.65603\beta_{c}=0.6560(3)italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 0.6560 ( 3 ). The uncertainty is estimated by the spread of results due to the randomized SVD. We choose the uncertainty of βcsubscript𝛽𝑐\beta_{c}italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT such that the largest ten data points lie in the error band, see Fig. 4(b). A more careful study of error sources would be needed if one aims for higher precision. Further methods to improve the accuracy can be found in Kuramashi and Yoshimura (2019). Our result βc=0.6560(3)subscript𝛽𝑐0.65603\beta_{c}=0.6560(3)italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 0.6560 ( 3 ) is consistent to the TRG result βc=0.656097(1)subscript𝛽𝑐0.6560971\beta_{c}=0.656097(1)italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 0.656097 ( 1 ) in Kuramashi and Yoshimura (2019) and the Monte-Carlo result βc=0.65608(5)subscript𝛽𝑐0.656085\beta_{c}=0.65608(5)italic_β start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 0.65608 ( 5 ) in Svetitsky and Yaffe (1982).

The calculations show that our approach can successfully be applied to a wide range of systems including gauge theories, and can become a first candidate to investigate a system by means of TRG methods. The method can be applied to any translationally invariant spin-statistical system which has a finite number of spin degrees of freedom. We demonstrated this in this section in the case of the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory and discuss the generalization and scaling in Sec. V. Since we do not need a model-specific expansion of the original partition function or integrate out the original variables in our construction, this method can straightforwardly be used for a large class of systems, including gauge theories, to find the tensor network representation of physical quantities.

V General form of initial tensors

In this section, we consider the initial tensor construction method with delta functions for general models, including long-range and non-neighboring interactions. We derive the scaling d2[nint+ns1]superscript𝑑2delimited-[]subscript𝑛intsubscript𝑛s1{d}^{2[{n_{\mathrm{int}}}+{n_{\mathrm{s}}}-1]}italic_d start_POSTSUPERSCRIPT 2 [ italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT - 1 ] end_POSTSUPERSCRIPT for the number of elements of the initial tensors. Here, d𝑑{d}{}italic_d is the dimension of the local Hilbert space, nintsubscript𝑛int{n_{\mathrm{int}}}italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT is the number of lattice points of the original, not locally connected tensors representing the partition function. The number of Steiner points nssubscript𝑛s{n_{\mathrm{s}}}italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT corresponds to the number of lattice points needed to connect isolated regions, as explained later in this section.

Connected long range chain in 1d

As an example for longer range interactions, we consider a system where each lattice site is coupled to all sites up to a distance of k𝑘kitalic_k sites. The partition function can be written as

Z=σ=1di=1NKσi,σi+1,,σi+k.𝑍superscriptsubscript𝜎1𝑑superscriptsubscriptproduct𝑖1𝑁subscript𝐾subscript𝜎𝑖subscript𝜎𝑖1subscript𝜎𝑖𝑘Z=\sum_{\sigma=1}^{d}{}\prod_{i=1}^{N}K_{\sigma_{i},\sigma_{i+1},...,\sigma_{i% +k}}.italic_Z = ∑ start_POSTSUBSCRIPT italic_σ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_σ start_POSTSUBSCRIPT italic_i + italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (29)

The local physical dimension is d𝑑{d}italic_d, and nint=k+1subscript𝑛int𝑘1{n_{\mathrm{int}}}=k+1italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT = italic_k + 1 is the number of indices of these initial tensors. See App. A for an example of this type.

We apply the decomposition with a delta matrix,

Kσi,σi+1,,σi+k=ai+1(1)=1dKσi,σi+1,,ai+1(1)δσi+k,ai+1(1).subscript𝐾subscript𝜎𝑖subscript𝜎𝑖1subscript𝜎𝑖𝑘superscriptsubscriptsubscriptsuperscript𝑎1𝑖11𝑑subscript𝐾subscript𝜎𝑖subscript𝜎𝑖1subscriptsuperscript𝑎1𝑖1subscript𝛿subscript𝜎𝑖𝑘subscriptsuperscript𝑎1𝑖1K_{\sigma_{i},\sigma_{i+1},...,\sigma_{i+k}}=\sum_{a^{(1)}_{i+1}=1}^{d}{}K_{% \sigma_{i},\sigma_{i+1},...,a^{(1)}_{i+1}}\delta_{\sigma_{i+k},a^{(1)}_{i+1}}.italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_σ start_POSTSUBSCRIPT italic_i + italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i + italic_k end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (30)

Using translational invariance, we define the new tensor

Kσi,σi+1,,σi+k1,ai(1),ai+1(1)(1)subscriptsuperscript𝐾1subscript𝜎𝑖subscript𝜎𝑖1subscript𝜎𝑖𝑘1subscriptsuperscript𝑎1𝑖subscriptsuperscript𝑎1𝑖1\displaystyle K^{(1)}_{\sigma_{i},\sigma_{i+1},...,\sigma_{i+k-1},a^{(1)}_{i},% a^{(1)}_{i+1}}italic_K start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_σ start_POSTSUBSCRIPT italic_i + italic_k - 1 end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
Kσi,σi+1,,σi+k1,ai+1(1)δσi+k1,ai(1),absentsubscript𝐾subscript𝜎𝑖subscript𝜎𝑖1subscript𝜎𝑖𝑘1subscriptsuperscript𝑎1𝑖1subscript𝛿subscript𝜎𝑖𝑘1subscriptsuperscript𝑎1𝑖\displaystyle\equiv K_{\sigma_{i},\sigma_{i+1},...,\sigma_{i+k-1},a^{(1)}_{i+1% }}\delta_{\sigma_{i+k-1},a^{(1)}_{i}},≡ italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_σ start_POSTSUBSCRIPT italic_i + italic_k - 1 end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i + italic_k - 1 end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (31)

which leads to the same partition function as the original one if one takes the product of all tensors at different lattice sites and sums over all indices, similar to Eq. 29 but including the new indices a(1)superscript𝑎1a^{(1)}italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

We can repeat this procedure k1𝑘1k-1italic_k - 1 times to get the local representation

Kσi,σi+1,ai(1),ai+1(1),,ai(k1),ai+1(k1)(k1)subscriptsuperscript𝐾𝑘1subscript𝜎𝑖subscript𝜎𝑖1subscriptsuperscript𝑎1𝑖subscriptsuperscript𝑎1𝑖1subscriptsuperscript𝑎𝑘1𝑖subscriptsuperscript𝑎𝑘1𝑖1\displaystyle K^{(k-1)}_{\sigma_{i},\sigma_{i+1},a^{(1)}_{i},a^{(1)}_{i+1},...% ,a^{(k-1)}_{i},a^{(k-1)}_{i+1}}italic_K start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
K[σiai(1)ai(k1)],[σi+1,ai+1(1),,ai+1(k1)](k1).absentsubscriptsuperscript𝐾𝑘1delimited-[]subscript𝜎𝑖subscriptsuperscript𝑎1𝑖subscriptsuperscript𝑎𝑘1𝑖subscript𝜎𝑖1subscriptsuperscript𝑎1𝑖1subscriptsuperscript𝑎𝑘1𝑖1\displaystyle\equiv K^{(k-1)}_{[\sigma_{i}a^{(1)}_{i}\dots a^{(k-1)}_{i}],[% \sigma_{i+1},a^{(1)}_{i+1},\dots,a^{(k-1)}_{i+1}]}.≡ italic_K start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT … italic_a start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] , [ italic_σ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ] end_POSTSUBSCRIPT . (32)

The index dimension of the combined indices (σa(1)a(k1))tensor-product𝜎superscript𝑎1superscript𝑎𝑘1(\sigma\otimes a^{(1)}\otimes...\otimes a^{(k-1)})( italic_σ ⊗ italic_a start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ⊗ … ⊗ italic_a start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) between neighboring points is then dk{d}{}^{k}italic_d start_FLOATSUPERSCRIPT italic_k end_FLOATSUPERSCRIPT, and the initial tensor K(k1)superscript𝐾𝑘1K^{(k-1)}italic_K start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT has d=2kd2(nint1){d}{}^{2k}={d}{}^{2({n_{\mathrm{int}}}-1)}italic_d start_FLOATSUPERSCRIPT 2 italic_k end_FLOATSUPERSCRIPT = italic_d start_FLOATSUPERSCRIPT 2 ( italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 ) end_FLOATSUPERSCRIPT elements.

Refer to caption
Figure 6: Schematic picture for the construction of the initial tensor K[σab]x[σab]x+1(2)subscriptsuperscript𝐾2subscriptdelimited-[]𝜎𝑎𝑏𝑥subscriptdelimited-[]𝜎𝑎𝑏𝑥1K^{(2)}_{[\sigma ab]_{x}[\sigma ab]_{x+1}}italic_K start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ italic_a italic_b ] start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT [ italic_σ italic_a italic_b ] start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT from the original tensor Kσxσx+1σx+2σx+3subscript𝐾subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥2subscript𝜎𝑥3K_{\sigma_{x}\sigma_{x+1}\sigma_{x+2}\sigma_{x+3}}italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT for a one-dimensional system with long range interaction. Green dots represent the indices of the original tensor K𝐾Kitalic_K. Each colored arrow stands for a decomposition with a delta function, which creates new indices in K(2)superscript𝐾2K^{(2)}italic_K start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT. See main text for details.

Figure 6 shows a schematic picture of our method for k=3𝑘3k=3italic_k = 3. The original tensor Kσxσx+1σx+2σx+3subscript𝐾subscript𝜎𝑥subscript𝜎𝑥1subscript𝜎𝑥2subscript𝜎𝑥3K_{\sigma_{x}\sigma_{x+1}\sigma_{x+2}\sigma_{x+3}}italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT has four spin variables as indices. These are represented by green dots, and their number is nint=4subscript𝑛int4{n_{\mathrm{int}}}=4italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT = 4. Each decomposition by a delta function creates two new indices and removes the dependence on one spin variable. We represent each such step by a colored arrow. Explicitly, the red arrow removes the dependence on σx+3subscript𝜎𝑥3\sigma_{x+3}italic_σ start_POSTSUBSCRIPT italic_x + 3 end_POSTSUBSCRIPT and creates new indices axsubscript𝑎𝑥a_{x}italic_a start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and ax+1subscript𝑎𝑥1a_{x+1}italic_a start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT. The blue arrow similarly removes the dependence on σx+2subscript𝜎𝑥2\sigma_{x+2}italic_σ start_POSTSUBSCRIPT italic_x + 2 end_POSTSUBSCRIPT and creates new indices bxsubscript𝑏𝑥b_{x}italic_b start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and bx+1subscript𝑏𝑥1b_{x+1}italic_b start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT. The black arrow connects nearest neighbors in the original spin indices, and does not correspond to a decomposition.

Disconnected long range interaction in 1d

The bond size of the tensor network representation in 1d depends only on the maximum interaction distance. For example, we consider a system where the interactions only connect sites at a distance k𝑘kitalic_k from each other. The partition function is

Z=σ=1dx=1NKσi,σi+k.𝑍superscriptsubscript𝜎1𝑑superscriptsubscriptproduct𝑥1𝑁subscript𝐾subscript𝜎𝑖subscript𝜎𝑖𝑘Z=\sum_{\sigma=1}^{d}{}\prod_{x=1}^{N}K_{\sigma_{i},\sigma_{i+k}}.italic_Z = ∑ start_POSTSUBSCRIPT italic_σ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_x = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_i + italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (33)

Our procedure to construct the initial tensors is similar to the previous example, and leads to the same form of the initial tensors as in Eq. 32. The index size is thus the same, the elements of the tensors differ though. Appendix B discusses an example of this type of long range interaction.

Refer to caption
Figure 7: Schematic picture for the construction of the initial tensor K[σab]x[σab]x+1(2)subscriptsuperscript𝐾2subscriptdelimited-[]𝜎𝑎𝑏𝑥subscriptdelimited-[]𝜎𝑎𝑏𝑥1K^{(2)}_{[\sigma ab]_{x}[\sigma ab]_{x+1}}italic_K start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ italic_a italic_b ] start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT [ italic_σ italic_a italic_b ] start_POSTSUBSCRIPT italic_x + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT from the original tensor Kσxσx+3subscript𝐾subscript𝜎𝑥subscript𝜎𝑥3K_{\sigma_{x}\sigma_{x+3}}italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT for a one-dimensional system with interaction only between sites with a distance of 3 lattice sites. Green dots represent the indices of the original tensor K𝐾Kitalic_K. Each colored arrow stands for a decomposition with a delta function, which creates new indices in K(2)superscript𝐾2K^{(2)}italic_K start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT. Red dots with green outlines are indices which did not exist in the original tensor K𝐾Kitalic_K, but have to be included in our procedure to connect disconnected regions (Steiner points Hanan (1966)).

Figure 7 shows a schematic picture of the procedure. The red arrow removes the dependence on σx+ksubscript𝜎𝑥𝑘\sigma_{x+k}italic_σ start_POSTSUBSCRIPT italic_x + italic_k end_POSTSUBSCRIPT but introduces a dependence on the site x+k1𝑥𝑘1x+k-1italic_x + italic_k - 1, which is denoted as a red dot in our graphical notation. This new dependence is removed by the blue arrow. The green outlines indicate that the original tensor did not depend on these sites. The number of arrows is the same as in the previous example, and thus the resulting tensor has the same dimensions.

We define the number of arrows, which corresponds to the number of decompositions in our method, as ndecsubscript𝑛dec{n_{\mathrm{dec}}}italic_n start_POSTSUBSCRIPT roman_dec end_POSTSUBSCRIPT. The initial tensor can then be represented as a (d×ndecd)ndec({d}{}^{{n_{\mathrm{dec}}}}\times{d}{}^{{n_{\mathrm{dec}}}})( italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_dec end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT × italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_dec end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT ) matrix K(k1)superscript𝐾𝑘1K^{(k-1)}italic_K start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT. This can also be expressed in terms of the number of original spin values (green dots) nintsubscript𝑛int{n_{\mathrm{int}}}italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT and the number of generated Steiner points (red dots) nssubscript𝑛s{n_{\mathrm{s}}}italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT. The latter are needed to connect disconnected regions of the lattice, and are the points with green outlines in Fig. 7. In the case discussed here, the new tensor has size (d×nint1+nsd)nint1+ns({d}{}^{{n_{\mathrm{int}}}-1+{n_{\mathrm{s}}}}\times{d}{}^{{n_{\mathrm{int}}}-% 1+{n_{\mathrm{s}}}})( italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT × italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT ), and thus has d2(nint1+ns){d}{}^{2({n_{\mathrm{int}}}-1+{n_{\mathrm{s}}})}italic_d start_FLOATSUPERSCRIPT 2 ( italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT ) end_FLOATSUPERSCRIPT elements.

Higher dimensions

Refer to caption
Figure 8: Schematic picture of the decompositions and initial tensor indices for the J1J2subscript𝐽1subscript𝐽2J_{1}-J_{2}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT Ising model (see App. A) or a plaquette term in two dimensions. See Fig. 6 for symbology.
Refer to caption
Figure 9: Alternative to Fig. 8 for constructing the initial tensor. See Fig. 6 for symbology. The number of arrows and therefore the number of tensor elements remains the same, but the shape of the initial tensor differs.

The same scaling d2(nint1+ns){d}{}^{2({n_{\mathrm{int}}}-1+{n_{\mathrm{s}}})}italic_d start_FLOATSUPERSCRIPT 2 ( italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT ) end_FLOATSUPERSCRIPT holds in higher dimensions as well. However, nintsubscript𝑛int{n_{\mathrm{int}}}italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT typically grows in higher dimensions because interactions happen in more directions. We can use the graphical notation again, as shown for example in Fig. 8. Arrows are introduced such that a path arises from all sites that take part in the interaction to the origin. Each arrow in a given spatial direction in the lattice contributes a factor d𝑑ditalic_d in the bond size of the index for this direction in the constructed tensor. Note, however, that the choice of arrows is not unique anymore in more than one dimension. For example, Fig. 9 shows an alternative way to connect the tensors compared to Fig. 8. The constructed tensor has the same number of elements in this case, but the dimensions of the individual indices differ.

Refer to caption
Figure 10: Schematic picture of the decompositions for constructing the initial tensors for a more complicated interaction structure. See Fig. 7 for symbology. Green outlined dots are indices that are not present in the original tensor. However, the red dot with green outline needs to be included to connect disconnected regions. The model is explained in the main text.

Finally, we discuss the example in Fig. 10 where isolated regions arise. The nearest neighbors of the lower left site do not take part in the interaction, which is symbolized by dashed red outlines of these sites. To form a connected graph, at least one isolated point has to be included. Finding the minimum number of arrows in our graphical representation is a well known problem in graph theory, known as the rectilinear Steiner tree problem Hanan (1966). The graph in Fig. 10 has nx=2subscript𝑛𝑥2n_{x}=2italic_n start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 2 arrows in x-direction, ny=2subscript𝑛𝑦2n_{y}=2italic_n start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT = 2 arrows in y-direction and one isolated point. Thus, the constructed tensor has the dimensions (d×nxd×nxd×nyd)ny({d}{}^{n_{x}}\times{d}{}^{n_{x}}\times{d}{}^{n_{y}}\times{d}{}^{n_{y}})( italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT × italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT × italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT × italic_d start_FLOATSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_FLOATSUPERSCRIPT ) and d=2[nint+ns1]d=2[4+11]d8{d}{}^{2[{n_{\mathrm{int}}}+{n_{\mathrm{s}}}-1]}={d}{}^{2[4+1-1]}={d}{}^{8}italic_d start_FLOATSUPERSCRIPT 2 [ italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT - 1 ] end_FLOATSUPERSCRIPT = italic_d start_FLOATSUPERSCRIPT 2 [ 4 + 1 - 1 ] end_FLOATSUPERSCRIPT = italic_d start_FLOATSUPERSCRIPT 8 end_FLOATSUPERSCRIPT elements.

The connectivity of Fig. 10 allows for various types of interactions. It can express nearest neighbor interactions in positive and negative x- and y-directions, next-to-nearest neighbor interactions (diagonal), and next-to-nearest neighbor interactions (one site up, two sites in y-direction, or two sites up and one site in y-direction). Moreover, three- and four-site interactions are possible. The most generic form of a spin model of this type has 12 parameters. Even such an involved model can be expressed with an initial tensor of moderate dimensions (4×4×4×4)4444(4\times 4\times 4\times 4)( 4 × 4 × 4 × 4 ) for d=2𝑑2{d}=2italic_d = 2. The explicit form of possible interactions for the graph in Fig. 10 is:

Kσx,yσx+1,y+1σx+1,y+2σx+2,y+1subscript𝐾subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥1𝑦2subscript𝜎𝑥2𝑦1\displaystyle K_{\sigma_{x,y}\sigma_{x+1,y+1}\sigma_{x+1,y+2}\sigma_{x+2,y+1}}italic_K start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (34)
=\displaystyle== eh(σx,y+σx+1,y+1+σx+1,y+2+σx+2,y+1)superscript𝑒subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥1𝑦2subscript𝜎𝑥2𝑦1\displaystyle e^{h(\sigma_{x,y}+\sigma_{x+1,y+1}+\sigma_{x+1,y+2}+\sigma_{x+2,% y+1})}italic_e start_POSTSUPERSCRIPT italic_h ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×\displaystyle\times× eJ1(x)σx+1,y+1σx+2,y+1+J1(y)σx+1,y+1σx+1,y+2superscript𝑒superscriptsubscript𝐽1𝑥subscript𝜎𝑥1𝑦1subscript𝜎𝑥2𝑦1superscriptsubscript𝐽1𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥1𝑦2\displaystyle e^{J_{1}^{(x)}\sigma_{x+1,y+1}\sigma_{x+2,y+1}+J_{1}^{(y)}\sigma% _{x+1,y+1}\sigma_{x+1,y+2}}italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_y ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× eJ2(1)σx,yσx+1,y+1+J2(2)σx+1,y+2σx+2,y+1superscript𝑒superscriptsubscript𝐽21subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1superscriptsubscript𝐽22subscript𝜎𝑥1𝑦2subscript𝜎𝑥2𝑦1\displaystyle e^{J_{2}^{(1)}\sigma_{x,y}\sigma_{x+1,y+1}+J_{2}^{(2)}\sigma_{x+% 1,y+2}\sigma_{x+2,y+1}}italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× eg3(1)σx,yσx+2,y+1+g3(2)σx,yσx+1,y+2superscript𝑒superscriptsubscript𝑔31subscript𝜎𝑥𝑦subscript𝜎𝑥2𝑦1superscriptsubscript𝑔32subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦2\displaystyle e^{g_{3}^{(1)}\sigma_{x,y}\sigma_{x+2,y+1}+g_{3}^{(2)}\sigma_{x,% y}\sigma_{x+1,y+2}}italic_e start_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT + italic_g start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× et8σx,yσx+2,y+1σx+1,y+2superscript𝑒subscript𝑡8subscript𝜎𝑥𝑦subscript𝜎𝑥2𝑦1subscript𝜎𝑥1𝑦2\displaystyle e^{t_{8}\sigma_{x,y}\sigma_{x+2,y+1}\sigma_{x+1,y+2}}italic_e start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× et6(1)σx,yσx+1,y+1σx+2,y+1+t6(2)σx,yσx+1,y+1σx+1,y+2superscript𝑒superscriptsubscript𝑡61subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥2𝑦1superscriptsubscript𝑡62subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥1𝑦2\displaystyle e^{t_{6}^{(1)}\sigma_{x,y}\sigma_{x+1,y+1}\sigma_{x+2,y+1}+t_{6}% ^{(2)}\sigma_{x,y}\sigma_{x+1,y+1}\sigma_{x+1,y+2}}italic_e start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT + italic_t start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× et4σx+1,y+1σx+2,y+1σx+1,y+2superscript𝑒subscript𝑡4subscript𝜎𝑥1𝑦1subscript𝜎𝑥2𝑦1subscript𝜎𝑥1𝑦2\displaystyle e^{t_{4}\sigma_{x+1,y+1}\sigma_{x+2,y+1}\sigma_{x+1,y+2}}italic_e start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
×\displaystyle\times× efσx,yσx+1,y+1σx+2,y+1σx+1,y+2.superscript𝑒𝑓subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥2𝑦1subscript𝜎𝑥1𝑦2\displaystyle e^{f\sigma_{x,y}\sigma_{x+1,y+1}\sigma_{x+2,y+1}\sigma_{x+1,y+2}}.italic_e start_POSTSUPERSCRIPT italic_f italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT .

Multi-flavour systems

So far we only considered one-flavour systems, but the ideas can be generalized easily to multi-flavour systems. For example, the degrees of freedom of the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model can be located on the links pointing from r^^𝑟\hat{r}over^ start_ARG italic_r end_ARG to r^+μ^^𝑟^𝜇\hat{r}+\hat{\mu}over^ start_ARG italic_r end_ARG + over^ start_ARG italic_μ end_ARG, where r^^𝑟\hat{r}over^ start_ARG italic_r end_ARG is a coordinate and μ^{x^,y^,z^}^𝜇^𝑥^𝑦^𝑧\hat{\mu}\in\{\hat{x},\hat{y},\hat{z}\}over^ start_ARG italic_μ end_ARG ∈ { over^ start_ARG italic_x end_ARG , over^ start_ARG italic_y end_ARG , over^ start_ARG italic_z end_ARG } is a unit vector in one of the three directions. This is indicated in Fig. 3. We can also localize each such gauge degrees of freedom at the node with position r^^𝑟\hat{r}over^ start_ARG italic_r end_ARG. Then, at each node, an additional degree of freedom arises for the three cases μ^=x^^𝜇^𝑥\hat{\mu}=\hat{x}over^ start_ARG italic_μ end_ARG = over^ start_ARG italic_x end_ARG, μ^=y^^𝜇^𝑦\hat{\mu}=\hat{y}over^ start_ARG italic_μ end_ARG = over^ start_ARG italic_y end_ARG, μ^=z^^𝜇^𝑧\hat{\mu}=\hat{z}over^ start_ARG italic_μ end_ARG = over^ start_ARG italic_z end_ARG. The connectivity is then the same as for the J1J2subscript𝐽1subscript𝐽2J_{1}-J_{2}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT model, but with three distinct flavours. There is no Steiner point, ns=0subscript𝑛s0{n_{\mathrm{s}}}=0italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT = 0, and the degrees of freedom in each direction is two, such that the number of elements of the initial tensor is (23)2(31)=212=(43)2superscriptsuperscript23231superscript212superscriptsuperscript432\left(2^{3}\right)^{2(3-1)}=2^{12}=(4^{3})^{2}( 2 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 ( 3 - 1 ) end_POSTSUPERSCRIPT = 2 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT = ( 4 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The initial tensors can be formed by a (4×4×4×4×4×4)444444(4\times 4\times 4\times 4\times 4\times 4)( 4 × 4 × 4 × 4 × 4 × 4 ) tensor as shown in main text in Sec. IV.

VI Conclusion

In this paper we introduced a simple construction of a tensor network representing a partition function. By inserting a delta function and redefining the tensors, we can construct a locally connected tensor network for any translational invariant theory in any dimension. This network can then be coarse-grained with TRG methods to calculate the partition function and observables.

In a general case, a partition function can be represented by an initial tensor with d2(nint1+ns)superscript𝑑2subscript𝑛int1subscript𝑛s{d}^{2({n_{\mathrm{int}}}-1+{n_{\mathrm{s}}})}italic_d start_POSTSUPERSCRIPT 2 ( italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT - 1 + italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT elements (see Sec. V). Here, d𝑑{d}{}italic_d is the dimension of the local degrees of freedom, and nintsubscript𝑛int{n_{\mathrm{int}}}italic_n start_POSTSUBSCRIPT roman_int end_POSTSUBSCRIPT is the number of indices of the original tensor, which did not form a locally connected tensor network. If disconnected regions exist in the interactions, nssubscript𝑛s{n_{\mathrm{s}}}italic_n start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT corresponds to the Steiner points Hanan (1966) that are needed to connect these regions.

We demonstrated the applicability of our method in a one-dimensional spin system with multiple interaction terms as a simple example. We extended the method to two-dimensions and investigated the initial tensor dependence of the TRG method. The accuracy of these methods highly depends on the initial tensors and on the details of the TRG method. A high sensitivity was found in the original HOTRG. Our results suggest that one should use symmetric initial tensors for this method. We conclude that the initial tensor influences the numerical accuracy significantly depending on the TRG method, and should be chosen carefully for reliable calculations. We found that symmetric initial tensors lead to better results for many coarse-graining methods, and we calculated a symmetric representation for the two-dimensional Ising model based on our initial tensor construction.

Moreover, we showed that the initial tensor dependence can be eliminated by applying the ideas of the boundary TRG method to HOTRG. In general, any TRG method, such as ATRG and MDTRG, that makes use of isometries to form the new indices of the coarse grained tensors, has a strong initial tensor dependence. We showed, however, how these methods can be modified slightly to use squeezers instead of isometries, as introduced in the boundary TRG Iino et al. (2019). This way, the dependence on initial tensors and their symmetries can be removed, which makes the algorithms more resilient against systematic errors coming from an interplay between the choice of initial tensors and the coarse graining method.

The precision of TRG algorithms also depends on the type of index-exchange between coarse-graining steps. There are several possibilities to alternate between coarse-graining in xlimit-from𝑥x-italic_x - and ylimit-from𝑦y-italic_y - direction. We showed that systematic errors can accumulate with the wrong type of index exchange and discussed the optimal choice for different coarse-graining methods.

We further applied our tensor construction to the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory in three-dimensions without gauge-fixing. We neither need to consider any expansion nor do we have to integrate out original variables. The results for the free energy and the specific heat with our simple tensor construction were consistent with TRG calculations using expansions and gauge-fixing, and with Monte-Carlo simulations. For the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory, our construction resulted in an accuracy of the free energy comparable to that of the usual construction by an expansion.

Summarizing, the initial tensor construction presented in this work is a way to translate the partition function to a locally connected tensor network. The approach is simple and can be applied to various systems, without relying on model-specific expansions. Moreover, we found worrisome dependence of HOTRG-like methods (isometric ATRG, MDTRG, HOTRG) on the choice of initial tensors. Even if different choices are mathematically equivalent, the truncation procedures of the coarse graining steps introduces systematic errors. The previously mentioned methods should therefore only be used in their original form for symmetric initial tensors. However, we found that the methods can be made resilient against errors from the choice of initial tensors by using the ideas of the boundary TRG. With this, or by choosing alternative coarse graining algorithms like ATRG, our initial tensor construction leads to a similar accuracy as other construction methods, making it a simple and powerful tool for TRG calculations.

Acknowledgments

We would like to thank Shinji Takeda and Daisuke Kadoh for discussions. This work was supported by JSPS KAKENHI Grant Number 24K17059.

Appendix A J1-J2 Ising model

In this appendix, we discuss how our method can be used to construct the tensor network representation of the J1J2subscript𝐽1subscript𝐽2J_{1}-J_{2}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT Ising model. The system with N𝑁Nitalic_N sites can be described by the partition function

Z=𝑍absent\displaystyle Z=italic_Z = σ=±1x,y=1NeJ12(σx,yσx+1,y+σx,yσx,y+1)subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁superscript𝑒subscript𝐽12subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦subscript𝜎𝑥𝑦1\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}e^{\frac{J_{1}}{2}(\sigma_{x,% y}\sigma_{x+1,y}+\sigma_{x,y}\sigma_{x,y+1})}∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT divide start_ARG italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ12(σx,y+1σx+1,y+1+σx+1,yσx+1,y+1)absentsuperscript𝑒subscript𝐽12subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦1subscript𝜎𝑥1𝑦subscript𝜎𝑥1𝑦1\displaystyle\times e^{\frac{J_{1}}{2}(\sigma_{x,y+1}\sigma_{x+1,y+1}+\sigma_{% x+1,y}\sigma_{x+1,y+1})}× italic_e start_POSTSUPERSCRIPT divide start_ARG italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ2(σx,yσx+1,y+1+σx,y+1σx+1,y)absentsuperscript𝑒subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦\displaystyle\times e^{J_{2}(\sigma_{x,y}\sigma_{x+1,y+1}+\sigma_{x,y+1}\sigma% _{x+1,y})}× italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT (35)
=\displaystyle== σ=±1x,y=1NKσx,y,σx+1,y,σx,y+1,σx+1,y+1(J1J2),subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁subscriptsuperscript𝐾subscript𝐽1subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦1\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}K^{(J_{1}J_{2})}_{\sigma_{x,y% },\sigma_{x+1,y},\sigma_{x,y+1},\sigma_{x+1,y+1}},∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (36)

with the spin indices σx,ysubscript𝜎𝑥𝑦\sigma_{x,y}italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT at sites {x,y}𝑥𝑦\{x,y\}{ italic_x , italic_y } and coupling constants J1subscript𝐽1J_{1}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and J2subscript𝐽2J_{2}italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. By setting the coupling J1<0subscript𝐽10J_{1}<0italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < 0 and J2>0subscript𝐽20J_{2}>0italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0, frustrated systems can be studied in this model.

The representation through the tensor K(J1J2)superscript𝐾subscript𝐽1subscript𝐽2K^{(J_{1}J_{2})}italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT is not a two-dimensional locally connected tensor network. We can construct such a network by inserting delta functions. First, we split the next-nearest neighbor spin σx+1,y+1subscript𝜎𝑥1𝑦1\sigma_{x+1,y+1}italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT from the tensor:

Kσx,y,σx+1,y,σx,y+1,σx+1,y+1(J1J2)subscriptsuperscript𝐾subscript𝐽1subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦1\displaystyle K^{(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_{x,y+1},% \sigma_{x+1,y+1}}italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=a=±1Kσx,y,σx+1,y,σx,y+1,ax+1,y(J1J2)δσx+1,y+1ax+1,y.absentsubscript𝑎plus-or-minus1subscriptsuperscript𝐾subscript𝐽1subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝑎𝑥1𝑦subscriptsuperscript𝛿subscript𝑎𝑥1𝑦subscript𝜎𝑥1𝑦1\displaystyle=\sum_{a=\pm 1}K^{(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},% \sigma_{x,y+1},a_{x+1,y}}\delta^{a_{x+1,y}}_{\sigma_{x+1,y+1}}.= ∑ start_POSTSUBSCRIPT italic_a = ± 1 end_POSTSUBSCRIPT italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (37)

With this, we define a new tensor K(J1J2)K^{{}^{\prime}(J_{1}J_{2})}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT,

Kσx,y,σx+1,y,σx,y+1,ax,y,ax+1,y(J1J2)\displaystyle K^{{}^{\prime}(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_% {x,y+1},a_{x,y},a_{x+1,y}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT
Kσx,y,σx+1,y,σx,y+1,ax+1,y(J1J2)δσx,y+1ax,y.absentsubscriptsuperscript𝐾subscript𝐽1subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝑎𝑥1𝑦subscriptsuperscript𝛿subscript𝑎𝑥𝑦subscript𝜎𝑥𝑦1\displaystyle\equiv K^{(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_{x,y+% 1},a_{x+1,y}}\delta^{a_{x,y}}_{\sigma_{x,y+1}}.≡ italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (38)

As a next step, we split the index σx,y+1subscript𝜎𝑥𝑦1\sigma_{x,y+1}italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT from the tensor:

Kσx,y,σx+1,y,σx,y+1,ax,y,ax+1,y(J1J2)\displaystyle K^{{}^{\prime}(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_% {x,y+1},a_{x,y},a_{x+1,y}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=b=±1Kσx,y,σx+1,y,bx,y+1,ax,y,ax+1,y(J1J2)δσx,y+1bx,y+1,\displaystyle=\sum_{b=\pm 1}K^{{}^{\prime}(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{% x+1,y},b_{x,y+1},a_{x,y},a_{x+1,y}}\delta^{b_{x,y+1}}_{\sigma_{x,y+1}},= ∑ start_POSTSUBSCRIPT italic_b = ± 1 end_POSTSUBSCRIPT italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (39)

and define the new tensor K(J1J2)′′K^{{}^{\prime\prime}(J_{1}J_{2})}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT:

Kσx,y,σx+1,y,ax,y,ax+1,y,bx,y,bx,y+1(J1J2)′′\displaystyle K^{{}^{\prime\prime}(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},a% _{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
Kσx,y,σx+1,y,bx,y+1,ax,y,ax+1,y(J1J2)δσx,ybx,y\displaystyle\equiv K^{{}^{\prime}(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},b% _{x,y+1},a_{x,y},a_{x+1,y}}\delta^{b_{x,y}}_{\sigma_{x,y}}≡ italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT (40)
=Kσx,y,σx+1,y,bx,y+1,ax+1,y(J1J2)δbx,y+1ax,yδσx,ybx,yabsentsubscriptsuperscript𝐾subscript𝐽1subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑏𝑥𝑦1subscript𝑎𝑥1𝑦subscriptsuperscript𝛿subscript𝑎𝑥𝑦subscript𝑏𝑥𝑦1subscriptsuperscript𝛿subscript𝑏𝑥𝑦subscript𝜎𝑥𝑦\displaystyle=K^{(J_{1}J_{2})}_{\sigma_{x,y},\sigma_{x+1,y},b_{x,y+1},a_{x+1,y% }}\delta^{a_{x,y}}_{b_{x,y+1}}\delta^{b_{x,y}}_{\sigma_{x,y}}= italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT (41)
=eJ12(σx,yσx+1,y+σx,ybx,y+1)absentsuperscript𝑒subscript𝐽12subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦subscript𝑏𝑥𝑦1\displaystyle=e^{\frac{J_{1}}{2}(\sigma_{x,y}\sigma_{x+1,y}+\sigma_{x,y}b_{x,y% +1})}= italic_e start_POSTSUPERSCRIPT divide start_ARG italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ12(bx,y+1ax+1,y+σx+1,yax+1,y)absentsuperscript𝑒subscript𝐽12subscript𝑏𝑥𝑦1subscript𝑎𝑥1𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥1𝑦\displaystyle\times e^{\frac{J_{1}}{2}(b_{x,y+1}a_{x+1,y}+\sigma_{x+1,y}a_{x+1% ,y})}× italic_e start_POSTSUPERSCRIPT divide start_ARG italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ2(σx,yax+1,y+bx,y+1σx+1,y)absentsuperscript𝑒subscript𝐽2subscript𝜎𝑥𝑦subscript𝑎𝑥1𝑦subscript𝑏𝑥𝑦1subscript𝜎𝑥1𝑦\displaystyle\times e^{J_{2}(\sigma_{x,y}a_{x+1,y}+b_{x,y+1}\sigma_{x+1,y})}× italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×δbx,y+1ax,yδσx,ybx,y.absentsubscriptsuperscript𝛿subscript𝑎𝑥𝑦subscript𝑏𝑥𝑦1subscriptsuperscript𝛿subscript𝑏𝑥𝑦subscript𝜎𝑥𝑦\displaystyle\times\delta^{a_{x,y}}_{b_{x,y+1}}\delta^{b_{x,y}}_{\sigma_{x,y}}.× italic_δ start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (42)

We combine the σ𝜎\sigmaitalic_σ and a𝑎aitalic_a indices to form new bonds in x𝑥xitalic_x-direction: at position x,y𝑥𝑦x,yitalic_x , italic_y, the new index is [σa]x,yσx,yax,y=(σx,y,ax,y)subscriptdelimited-[]𝜎𝑎𝑥𝑦tensor-productsubscript𝜎𝑥𝑦subscript𝑎𝑥𝑦subscript𝜎𝑥𝑦subscript𝑎𝑥𝑦[\sigma a]_{x,y}\equiv\sigma_{x,y}\otimes a_{x,y}=(\sigma_{x,y},a_{x,y})[ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ≡ italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ⊗ italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT = ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ). Finally, the local tensor representation in terms of K′′superscript𝐾′′K^{\prime\prime}italic_K start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT is

Z=[σa]b=±1x,y=1NK[σa]x,y,[σa]x+1,y,bx,y,bx,y+1(J1J2)′′.Z=\sum_{[\sigma a]}\sum_{b=\pm 1}\prod_{x,y=1}^{N}K^{{}^{\prime\prime}(J_{1}J_% {2})}_{[\sigma a]_{x,y},[\sigma a]_{x+1,y},b_{x,y},b_{x,y+1}}.italic_Z = ∑ start_POSTSUBSCRIPT [ italic_σ italic_a ] end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_b = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , [ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (43)

The indices of this representation are independent of each other, and this initial tensor can be used for TRG coarse-graining.

We note that other representations can also be starting points for our procedure, as long as the contraction of the initial tensor network corresponds to the same partition function. For example, we can substitute K𝐾Kitalic_K in Eq. 42 by K(0)superscript𝐾0K^{(0)}italic_K start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT,

Kσx,y,σx+1,y,σx,y+1,σx+1,y+1(0)subscriptsuperscript𝐾0subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦1\displaystyle K^{(0)}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_{x,y+1},\sigma_{x+1,% y+1}}italic_K start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=eJ1(σx,yσx+1,y+σx,yσx,y+1)absentsuperscript𝑒subscript𝐽1subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦subscript𝜎𝑥𝑦1\displaystyle=e^{{J_{1}}(\sigma_{x,y}\sigma_{x+1,y}+\sigma_{x,y}\sigma_{x,y+1})}= italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ2(σx,yσx+1,y+1+σx,y+1σx+1,y).absentsuperscript𝑒subscript𝐽2subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦1subscript𝜎𝑥𝑦1subscript𝜎𝑥1𝑦\displaystyle\times e^{J_{2}(\sigma_{x,y}\sigma_{x+1,y+1}+\sigma_{x,y+1}\sigma% _{x+1,y})}.× italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y + 1 end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT . (44)

In any case, the tensor construction reproduces the original partition function if all indices of the network are contracted.

Our procedure results in an alternative representation of the partition function to those studied in Li and Yang (2021); Yoshiyama and Hukushima (2023). The authors of Yoshiyama and Hukushima (2023) state that physical quantities depend strongly on the choice of initial tensors and, for finite lattices, on the boundary conditions implemented by the tensor network representation. Additional candidates for initial tensors can therefore be helpful to find the most accurate representation for a given algorithm and system size.

Appendix B J1-J3 Ising model

As a kind of third-nearest neighbor Ising model, we discuss the J1J3subscript𝐽1subscript𝐽3J_{1}-J_{3}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT Ising model, which is also called biaxial next-nearest neighbor Ising model. The partition function is

Z=𝑍absent\displaystyle Z=italic_Z = σ=±1x,y=1NeJ1(σx,yσx+1,y+σx,yσx,y+1)subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁superscript𝑒subscript𝐽1subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥𝑦subscript𝜎𝑥𝑦1\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}e^{J_{1}(\sigma_{x,y}\sigma_{% x+1,y}+\sigma_{x,y}\sigma_{x,y+1})}∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
×eJ3(σx,yσx+2,y+σx,yσx,y+2)absentsuperscript𝑒subscript𝐽3subscript𝜎𝑥𝑦subscript𝜎𝑥2𝑦subscript𝜎𝑥𝑦subscript𝜎𝑥𝑦2\displaystyle\times e^{J_{3}(\sigma_{x,y}\sigma_{x+2,y}+\sigma_{x,y}\sigma_{x,% y+2})}× italic_e start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT (45)
=\displaystyle== σ=±1x,y=1NKσx,y,σx+1,y,σx+2,y,σx,y+1,σx,y+2(J1J3)subscript𝜎plus-or-minus1superscriptsubscriptproduct𝑥𝑦1𝑁subscriptsuperscript𝐾subscript𝐽1subscript𝐽3subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥2𝑦subscript𝜎𝑥𝑦1subscript𝜎𝑥𝑦2\displaystyle\sum_{\sigma=\pm 1}\prod_{x,y=1}^{N}K^{(J_{1}J_{3})}_{\sigma_{x,y% },\sigma_{x+1,y},\sigma_{x+2,y},\sigma_{x,y+1},\sigma_{x,y+2}}∑ start_POSTSUBSCRIPT italic_σ = ± 1 end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (46)

We split the next-next-nearest spins σx+2,ysubscript𝜎𝑥2𝑦\sigma_{x+2,y}italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y end_POSTSUBSCRIPT and σx,y+2subscript𝜎𝑥𝑦2\sigma_{x,y+2}italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 2 end_POSTSUBSCRIPT from the tensor,

Kσx,y,σx+1,y,σx+2,y,σx,y+1,σx,y+2(J1J3)subscriptsuperscript𝐾subscript𝐽1subscript𝐽3subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝜎𝑥2𝑦subscript𝜎𝑥𝑦1subscript𝜎𝑥𝑦2\displaystyle K^{(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_{x+2,y},% \sigma_{x,y+1},\sigma_{x,y+2}}italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=a,b=±1Kσx,y,σx+1,y,ax+1,y,σx,y+1,bx,y+1(J1J3)absentsubscript𝑎𝑏plus-or-minus1subscriptsuperscript𝐾subscript𝐽1subscript𝐽3subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝑏𝑥𝑦1\displaystyle=\sum_{a,b=\pm 1}K^{(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},a_% {x+1,y},\sigma_{x,y+1},b_{x,y+1}}= ∑ start_POSTSUBSCRIPT italic_a , italic_b = ± 1 end_POSTSUBSCRIPT italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
×δax+1,y,σx+2,yδbx,y+1,σx,y+2,absentsubscript𝛿subscript𝑎𝑥1𝑦subscript𝜎𝑥2𝑦subscript𝛿subscript𝑏𝑥𝑦1subscript𝜎𝑥𝑦2\displaystyle\times\delta_{a_{x+1,y},\sigma_{x+2,y}}\delta_{b_{x,y+1},\sigma_{% x,y+2}},× italic_δ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 2 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (47)

and define the tensor K(J1J3)K^{{}^{\prime}(J_{1}J_{3})}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT with shifted delta functions:

Kσx,y,σx+1,y,σx,y+1,ax,y,ax+1,y,bx,y,bx,y+1(J1J3)\displaystyle K^{{}^{\prime}(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_% {x,y+1},a_{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=Kσx,y,σx+1,y,ax+1,y,σx,y+1,bx,y+1(J1J3)absentsubscriptsuperscript𝐾subscript𝐽1subscript𝐽3subscript𝜎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝑎𝑥1𝑦subscript𝜎𝑥𝑦1subscript𝑏𝑥𝑦1\displaystyle=K^{(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},a_{x+1,y},\sigma_{% x,y+1},b_{x,y+1}}= italic_K start_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
×δax,y,σx+1,yδbx,y,σx,y+1.absentsubscript𝛿subscript𝑎𝑥𝑦subscript𝜎𝑥1𝑦subscript𝛿subscript𝑏𝑥𝑦subscript𝜎𝑥𝑦1\displaystyle\times\delta_{a_{x,y},\sigma_{x+1,y}}\delta_{b_{x,y},\sigma_{x,y+% 1}}.× italic_δ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (48)

Similarly, we split σx,y+1subscript𝜎𝑥𝑦1\sigma_{x,y+1}italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT,

Kσx,y,σx+1,y,σx,y+1,ax,y,ax+1,y,bx,y,bx,y+1(J1J3)\displaystyle K^{{}^{\prime}(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},\sigma_% {x,y+1},a_{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=c=±1Kσx,y,σx+1,y,cx,y+1,ax,y,ax+1,y,bx,y,bx,y+1(J1J3)\displaystyle=\sum_{c=\pm 1}K^{{}^{\prime}(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{% x+1,y},c_{x,y+1},a_{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}= ∑ start_POSTSUBSCRIPT italic_c = ± 1 end_POSTSUBSCRIPT italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
×δcx,y+1,σx,y+1,absentsubscript𝛿subscript𝑐𝑥𝑦1subscript𝜎𝑥𝑦1\displaystyle\times\delta_{c_{x,y+1},\sigma_{x,y+1}},× italic_δ start_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (49)

and define the tensor K(J1J3)′′K^{{}^{\prime\prime}(J_{1}J_{3})}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT:

Kσx,y,σx+1,y,cx,y,cx,y+1,ax,y,ax+1,y,bx,y,bx,y+1(J1J3)′′\displaystyle K^{{}^{\prime\prime}(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},c% _{x,y},c_{x,y+1},a_{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=Kσx,y,σx+1,y,cx,y+1,ax,y,ax+1,y,bx,y,bx,y+1(J1J3)\displaystyle=K^{{}^{\prime}(J_{1}J_{3})}_{\sigma_{x,y},\sigma_{x+1,y},c_{x,y+% 1},a_{x,y},a_{x+1,y},b_{x,y},b_{x,y+1}}= italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
×δcx,y,σx,y.absentsubscript𝛿subscript𝑐𝑥𝑦subscript𝜎𝑥𝑦\displaystyle\times\delta_{c_{x,y},\sigma_{x,y}}.× italic_δ start_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (50)

We define the new indices in x𝑥xitalic_x-direction at position x,y𝑥𝑦x,yitalic_x , italic_y as [σa]x,y=(σx,y,ax,y)subscriptdelimited-[]𝜎𝑎𝑥𝑦subscript𝜎𝑥𝑦subscript𝑎𝑥𝑦[\sigma a]_{x,y}=(\sigma_{x,y},a_{x,y})[ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT = ( italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ), and [cb]x,y=(cx,y,bx,y)subscriptdelimited-[]𝑐𝑏𝑥𝑦subscript𝑐𝑥𝑦subscript𝑏𝑥𝑦[cb]_{x,y}=(c_{x,y},b_{x,y})[ italic_c italic_b ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT = ( italic_c start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ) for the y𝑦yitalic_y-direction. We finally obtain the locally connected tensor network representaion

Z=[σa],[c,b]x,y=1NK[σa]x,y[σa]x+1,y[cb]x,y[cb]x,y+1(J1J3)′′.Z=\sum_{[\sigma a],[c,b]}\prod_{x,y=1}^{N}K^{{}^{\prime\prime}(J_{1}J_{3})}_{[% \sigma a]_{x,y}[\sigma a]_{x+1,y}[cb]_{x,y}[cb]_{x,y+1}}.italic_Z = ∑ start_POSTSUBSCRIPT [ italic_σ italic_a ] , [ italic_c , italic_b ] end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_x , italic_y = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT [ italic_σ italic_a ] start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT [ italic_c italic_b ] start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT [ italic_c italic_b ] start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (51)

Compared to the nearest-neighbor Ising model (see Eq. 12) and the J1J2subscript𝐽1subscript𝐽2J_{1}-J_{2}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT Ising model (see Eq. 43), the J1J3subscript𝐽1subscript𝐽3J_{1}-J_{3}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT Ising model is represented by an initial tensor with a larger bond dimension of the combined indices. This is a typical property for models with longer range interactions: they require a larger number of the decompositions, and thus create additional new indices in the locally connected tensor network representation. When these indices are combined, the new bonds have a larger bond dimension. See Sec. V for a general discussion of the scaling behavior.

Appendix C Symmetry of the initial tensor

In order to investigate the initial tensor dependence of various TRG algorithms for the two-dimensional Ising model in Sec. II, we consider the symmetrized tensor K(sym)superscript𝐾symK^{\mathrm{(sym)}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT as a variant of K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT. The partition function in Eq. 11 does not change if we redefine the initial tensor as

KXXYY(sym)kkllAXkAXkKkkll(delta)AlY1AlY1.subscriptsuperscript𝐾sym𝑋superscript𝑋𝑌superscript𝑌subscript𝑘superscript𝑘𝑙superscript𝑙subscript𝐴𝑋𝑘subscript𝐴superscript𝑋superscript𝑘subscriptsuperscript𝐾delta𝑘superscript𝑘𝑙superscript𝑙superscriptsubscript𝐴𝑙𝑌1superscriptsubscript𝐴superscript𝑙superscript𝑌1K^{\mathrm{(sym)}}_{XX^{\prime}YY^{\prime}}\equiv\sum_{kk^{\prime}ll^{\prime}}% A_{Xk}A_{X^{\prime}k^{\prime}}K^{(\mathrm{delta})}_{kk^{\prime}ll^{\prime}}A_{% lY}^{-1}A_{l^{\prime}Y^{\prime}}^{-1}.italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≡ ∑ start_POSTSUBSCRIPT italic_k italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_l italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_X italic_k end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_l italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_l italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (52)

This reconstructed tensor can be made symmetric under swapping of the two indices Kabcd=K{abcd}subscript𝐾𝑎𝑏𝑐𝑑subscript𝐾𝑎𝑏𝑐𝑑K_{abcd}=K_{\{abcd\}}italic_K start_POSTSUBSCRIPT italic_a italic_b italic_c italic_d end_POSTSUBSCRIPT = italic_K start_POSTSUBSCRIPT { italic_a italic_b italic_c italic_d } end_POSTSUBSCRIPT if we choose A𝐴Aitalic_A in the right way.

Several methods are possible to find a suitable A𝐴Aitalic_A to make KXXYY(sym)subscriptsuperscript𝐾sym𝑋superscript𝑋𝑌superscript𝑌K^{\mathrm{(sym)}}_{XX^{\prime}YY^{\prime}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT a symmetric tensor. We apply a numerical optimization starting from a random matrix. This matrix is optimized element-wise to minimize the cost function

c(sym)xxyy|KXXYYK{xxyy}|.superscript𝑐symsubscript𝑥superscript𝑥𝑦superscript𝑦subscript𝐾𝑋superscript𝑋𝑌superscript𝑌subscript𝐾𝑥superscript𝑥𝑦superscript𝑦c^{\mathrm{(sym)}}\equiv\sum_{xx^{\prime}yy^{\prime}}|K_{XX^{\prime}YY^{\prime% }}-K_{\{xx^{\prime}yy^{\prime}\}}|.italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT ≡ ∑ start_POSTSUBSCRIPT italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | italic_K start_POSTSUBSCRIPT italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_K start_POSTSUBSCRIPT { italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } end_POSTSUBSCRIPT | . (53)

In each optimization step we change a matrix element by the step size Δ1009similar-toΔsuperscript10similar-to09\Delta\sim 10^{0\sim-9}roman_Δ ∼ 10 start_POSTSUPERSCRIPT 0 ∼ - 9 end_POSTSUPERSCRIPT and consider Akl=Akl±Δsuperscriptsubscript𝐴𝑘𝑙plus-or-minussubscript𝐴𝑘𝑙ΔA_{kl}^{\prime}=A_{kl}\pm\Deltaitalic_A start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_A start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ± roman_Δ. We choose either Asuperscript𝐴A^{\prime}italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT or A𝐴Aitalic_A for the next step and accept or reject the change, depending on which of the two has the lower cost function. We sweep several times through all matrix elements and decrease ΔΔ\Deltaroman_Δ if all Asuperscript𝐴A^{\prime}italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT get rejected. Because the optimization can be stuck in local minima, we repeat the optimization with different randomly initialized matrices A𝐴Aitalic_A, until we find 1000 matrices with c(sym)2superscript𝑐sym2c^{\mathrm{(sym)}}\leq 2italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT ≤ 2. The partition function is calculated with HOTRG and the results are shown in Fig. 11 for all outcomes of this optimization. We found a tensor KXXYY(sym)subscriptsuperscript𝐾sym𝑋superscript𝑋𝑌superscript𝑌K^{\mathrm{(sym)}}_{XX^{\prime}YY^{\prime}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT with a cost function c(sym)superscript𝑐symc^{\mathrm{(sym)}}italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT smaller than 101superscript10110^{-1}10 start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. The explicit form of K(sym)superscript𝐾symK^{\mathrm{(sym)}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT for h=00h=0italic_h = 0, g=1𝑔1g=1italic_g = 1 is given in Eq. 54. Note that K(sym)K(exp)superscript𝐾symsuperscript𝐾expK^{\mathrm{(sym)}}\neq K^{\mathrm{(\mathrm{exp})}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT ≠ italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT, although K(exp)superscript𝐾expK^{\mathrm{(\mathrm{exp})}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT is also a symmetric tensor.

Refer to caption
Refer to caption
Figure 11: Dependence of TRG methods on the symmetry c(sym)superscript𝑐symc^{\mathrm{(sym)}}italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT of the initial tensors (see Eq. 53) for the two-dimensional critical Isinig model. Shown is the relative error of the free energy for 1000 different initial tensors, which are obtained from an incomplete symmetrization process of the asymmetric tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT. See main text for details. The dotted lines are results with the symmetric tensor K(exp)superscript𝐾expK^{(\mathrm{exp})}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT with c(sym)=0superscript𝑐sym0c^{\mathrm{(sym)}}=0italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT = 0. Dots are mean values in a bin of width 0.1. (Upper panel): HOTRG with strong symmetry dependence; (Lower panel:) TRG with no significant dependence on the initial tensors.

The free energy calculated with the symmetrized tensor K(sym)superscript𝐾symK^{\mathrm{(sym)}}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT is shown in Fig. 1 for HOTRG, and the accuracy is similar to a calculation with K(exp)superscript𝐾expK^{\mathrm{(\mathrm{exp})}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT. This shows that a symmetrization of the initial tensor K(delta)superscript𝐾deltaK^{(\mathrm{delta})}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT can improve the results for symmetry-dependent TRG methods like HOTRG.

Furthermore, we study the c(sym)superscript𝑐symc^{\mathrm{(sym)}}italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT dependence of the TRG and HOTRG methods in Fig. 11. The results clearly show that the HOTRG method becomes less precise and accurate when the initial tensors are less symmetric and c(sym)superscript𝑐symc^{\mathrm{(sym)}}italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT is large. In contrast to this, TRG shows almost no dependence on the symmetry behavior of the initial tensors.

We list the explicit representation of the symmetrized initial tensor for the two-dimensional Ising model in the following. For each {σ,a}𝜎𝑎\{\sigma,a\}{ italic_σ , italic_a } we define the combined index [σa]delimited-[]𝜎𝑎[\sigma a][ italic_σ italic_a ]. The indices are ordered as []=0,[+]=1,[+]=2,[++]=3[--]=0,[+-]=1,[-+]=2,[++]=3[ - - ] = 0 , [ + - ] = 1 , [ - + ] = 2 , [ + + ] = 3. Then, the symmetrized tensor K[σx,yax,y][σx+1,yax,y+1](sym)subscriptsuperscript𝐾symdelimited-[]subscript𝜎𝑥𝑦subscript𝑎𝑥𝑦delimited-[]subscript𝜎𝑥1𝑦subscript𝑎𝑥𝑦1K^{\mathrm{(sym)}}_{[\sigma_{x,y}a_{x,y}][\sigma_{x+1,y}a_{x,y+1}]}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT [ italic_σ start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y end_POSTSUBSCRIPT ] [ italic_σ start_POSTSUBSCRIPT italic_x + 1 , italic_y end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_x , italic_y + 1 end_POSTSUBSCRIPT ] end_POSTSUBSCRIPT is

K00(sym)=subscriptsuperscript𝐾sym00absent\displaystyle K^{\mathrm{(sym)}}_{00}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT = 2.48037458878,2.48037458878\displaystyle 2.48037458878,2.48037458878 ,
K01(sym)=K02(sym)=subscriptsuperscript𝐾sym01subscriptsuperscript𝐾sym02absent\displaystyle K^{\mathrm{(sym)}}_{01}=K^{\mathrm{(sym)}}_{02}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT = 0.167834510235,0.167834510235\displaystyle 0.167834510235,0.167834510235 ,
K10(sym)=K20(sym)=subscriptsuperscript𝐾sym10subscriptsuperscript𝐾sym20absent\displaystyle K^{\mathrm{(sym)}}_{10}=K^{\mathrm{(sym)}}_{20}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT = 0.166746023749,0.166746023749\displaystyle 0.166746023749,0.166746023749 ,
K11(sym)=K12(sym)=K21(sym)=K22(sym)=subscriptsuperscript𝐾sym11subscriptsuperscript𝐾sym12subscriptsuperscript𝐾sym21subscriptsuperscript𝐾sym22absent\displaystyle K^{\mathrm{(sym)}}_{11}=K^{\mathrm{(sym)}}_{12}=K^{\mathrm{(sym)% }}_{21}=K^{\mathrm{(sym)}}_{22}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT = 0.334196191574,0.334196191574\displaystyle 0.334196191574,0.334196191574 ,
K13(sym)=K23(sym)=subscriptsuperscript𝐾sym13subscriptsuperscript𝐾sym23absent\displaystyle K^{\mathrm{(sym)}}_{13}=K^{\mathrm{(sym)}}_{23}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT = 0.749091024240,0.749091024240\displaystyle 0.749091024240,0.749091024240 ,
K31(sym)=K32(sym)=subscriptsuperscript𝐾sym31subscriptsuperscript𝐾sym32absent\displaystyle K^{\mathrm{(sym)}}_{31}=K^{\mathrm{(sym)}}_{32}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 31 end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 32 end_POSTSUBSCRIPT = 0.749047098416,0.749047098416\displaystyle 0.749047098416,0.749047098416 ,
K03(sym)=subscriptsuperscript𝐾sym03absent\displaystyle K^{\mathrm{(sym)}}_{03}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT = 0.334224186621,0.334224186621\displaystyle 0.334224186621,0.334224186621 ,
K30(sym)=subscriptsuperscript𝐾sym30absent\displaystyle K^{\mathrm{(sym)}}_{30}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 30 end_POSTSUBSCRIPT = 0.334168680654,0.334168680654\displaystyle 0.334168680654,0.334168680654 ,
K33(sym)=subscriptsuperscript𝐾sym33absent\displaystyle K^{\mathrm{(sym)}}_{33}=italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT = 1.67966015282.1.67966015282\displaystyle 1.67966015282.1.67966015282 . (54)

Note that this initial tensor is not exactly symmetric: to achieve c(sym)=0superscript𝑐sym0c^{\mathrm{(sym)}}=0italic_c start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT = 0, the relation Kab(sym)=Kba(sym)subscriptsuperscript𝐾sym𝑎𝑏subscriptsuperscript𝐾sym𝑏𝑎K^{\mathrm{(sym)}}_{ab}=K^{\mathrm{(sym)}}_{ba}italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = italic_K start_POSTSUPERSCRIPT ( roman_sym ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b italic_a end_POSTSUBSCRIPT must hold for any index. However, the symmetry is sufficient for a reliable coarse graining with sufficient accuracy as can be seen in Fig. 1.

Appendix D Boundary TRG method

The boundary TRG method was originally introduced for open boundary systems to take into account the boundary effect in the coarse graining step. In this appendix, we present a generalization of the original HOTRG method Xie et al. (2012) using the boundary TRG technique Iino et al. (2019), which removes the dependence on the symmetry properties of the initial tensors. The idea can be generalized to other tensor renormalization methods.

CU=subscript𝐶𝑈absent\displaystyle C_{U}=italic_C start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT = Refer to caption
CV=subscript𝐶𝑉absent\displaystyle C_{V}=italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT = Refer to caption
Figure 12: Cost functions of the isometries U(HOTRG)superscript𝑈HOTRGU^{\mathrm{(HOTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT and V(HOTRG)superscript𝑉HOTRGV^{\mathrm{(HOTRG)}}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT for HOTRG.

The difference between common TRG methods like HOTRG and boundary TRG is the truncation method in the coarse-graining step. In the original HOTRG, the isometries U(HOTRG)superscript𝑈HOTRGU^{\mathrm{(HOTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT and V(HOTRG)superscript𝑉HOTRGV^{\mathrm{(HOTRG)}}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT, which minimize the cost function in Fig. 12, are both calculated. The isometries are found by truncated SVDs with singular values λ(U)superscript𝜆𝑈\lambda^{(U)}italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT and λ(V)superscript𝜆𝑉\lambda^{(V)}italic_λ start_POSTSUPERSCRIPT ( italic_V ) end_POSTSUPERSCRIPT. For example, for λ(U)superscript𝜆𝑈\lambda^{(U)}italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT:

x1,x2,y,yt,y1,y2subscriptsubscript𝑥1subscript𝑥2𝑦superscript𝑦𝑡superscriptsubscript𝑦1subscript𝑦2\displaystyle\sum_{x_{1},x_{2},y,y^{t},y_{1}^{\prime},y_{2}}∑ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y , italic_y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT Kx1ytx1ty1Kx2y2x2tytKx1yx1y1Kx2y2x2ysuperscriptsubscript𝐾subscript𝑥1superscript𝑦𝑡superscriptsuperscriptsubscript𝑥1𝑡superscriptsubscript𝑦1superscriptsubscript𝐾subscript𝑥2subscript𝑦2superscriptsuperscriptsubscript𝑥2𝑡superscript𝑦𝑡subscript𝐾subscript𝑥1𝑦superscriptsubscript𝑥1superscriptsubscript𝑦1subscript𝐾subscript𝑥2subscript𝑦2superscriptsubscript𝑥2𝑦\displaystyle K_{x_{1}y^{t}{x_{1}^{\prime}}^{t}y_{1}^{\prime}}^{*}K_{x_{2}y_{2% }{x_{2}^{\prime}}^{t}y^{t}}^{*}K_{x_{1}yx_{1}^{\prime}y_{1}^{\prime}}K_{x_{2}y% _{2}x_{2}^{\prime}y}italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y end_POSTSUBSCRIPT
a,bDsimilar-to-or-equalsabsentsuperscriptsubscript𝑎𝑏𝐷\displaystyle\simeq\sum_{a,b}^{D}≃ ∑ start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT Uax1tx2t(HOTRG)(λ(U))ab2Ubx1x2(HOTRG).subscriptsuperscript𝑈absentHOTRG𝑎superscriptsuperscriptsubscript𝑥1𝑡superscriptsuperscriptsubscript𝑥2𝑡subscriptsuperscriptsuperscript𝜆𝑈2𝑎𝑏subscriptsuperscript𝑈HOTRG𝑏superscriptsubscript𝑥1superscriptsubscript𝑥2\displaystyle U^{*(\mathrm{HOTRG})}_{a{x_{1}^{\prime}}^{t}{x_{2}^{\prime}}^{t}% }\left(\lambda^{(U)}\right)^{2}_{ab}U^{(\mathrm{HOTRG})}_{bx_{1}^{\prime}x_{2}% ^{\prime}}.italic_U start_POSTSUPERSCRIPT ∗ ( roman_HOTRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (55)

Here, xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (xisubscriptsuperscript𝑥𝑖x^{\prime}_{i}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) are the indices that connect the tensor K𝐾Kitalic_K to its nearest neighbor to the left (right). Accordingly, yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (yisubscriptsuperscript𝑦𝑖y^{\prime}_{i}italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) connects to the next tensor below (above). Upper labels t𝑡titalic_t as in xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT indicate that these bonds connect conjugate tensors. For brevity, we drop the indices in the following and use a shorthand notation like KKKKU(HOTRG)(λ(U))2U(HOTRG)similar-to-or-equalssuperscript𝐾superscript𝐾𝐾𝐾superscript𝑈absentHOTRGsuperscriptsuperscript𝜆𝑈2superscript𝑈HOTRGK^{\dagger}K^{\dagger}KK\simeq U^{\mathrm{\dagger(HOTRG)}}\left(\lambda^{(U)}% \right)^{2}U^{\mathrm{(HOTRG)}}italic_K start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_K italic_K ≃ italic_U start_POSTSUPERSCRIPT † ( roman_HOTRG ) end_POSTSUPERSCRIPT ( italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT.111On the notation for the SVD used here: a Hermitian matrix M𝑀Mitalic_M can be written as M=AA𝑀superscript𝐴𝐴M=A^{\dagger}Aitalic_M = italic_A start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_A. With the SVD A=UλV𝐴𝑈𝜆𝑉A=U\lambda Vitalic_A = italic_U italic_λ italic_V, we can decompose M𝑀Mitalic_M as M=VλUUλV=Vλ2V𝑀superscript𝑉𝜆superscript𝑈𝑈𝜆𝑉superscript𝑉superscript𝜆2𝑉M=V^{\dagger}\lambda U^{\dagger}U\lambda V=V^{\dagger}\lambda^{2}Vitalic_M = italic_V start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_λ italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_U italic_λ italic_V = italic_V start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_V. In actual calculations, we decompose M𝑀Mitalic_M in an SVD as M=UMλMVM𝑀subscript𝑈𝑀subscript𝜆𝑀subscript𝑉𝑀M=U_{M}\lambda_{M}V_{M}italic_M = italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT and identify V=VM=UM𝑉subscript𝑉𝑀superscriptsubscript𝑈𝑀V=V_{M}=U_{M}^{\dagger}italic_V = italic_V start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT, λ2=λMsuperscript𝜆2subscript𝜆𝑀\lambda^{2}=\lambda_{M}italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_λ start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT. We use the names U𝑈Uitalic_U and V𝑉Vitalic_V interchangeably for isometries. Typically, we label isometries as U𝑈Uitalic_U, and call them V𝑉Vitalic_V whenever they have to be distinguished from a given U𝑈Uitalic_U because they act on different indices of a tensor. Furthermore, we do not put any daggers \dagger on tensors in SVDs. With this convention, isometries are always applied in the form Usuperscript𝑈U^{\dagger}italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT or Vsuperscript𝑉V^{\dagger}italic_V start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT to the tensors when indices shall be combined and truncated. The indices can be reconstructed from the corresponding diagrams.

In the cost function CUsubscript𝐶𝑈C_{U}italic_C start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT in Fig. 12, U(HOTRG)superscript𝑈absentHOTRGU^{\dagger(\mathrm{HOTRG})}italic_U start_POSTSUPERSCRIPT † ( roman_HOTRG ) end_POSTSUPERSCRIPT is applied to the right indices of the K𝐾Kitalic_K tensors. Instead, one can also apply an isometry to the left indices. The corresponding cost function CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT is minimized by V(HOTRG)superscript𝑉absentHOTRGV^{\dagger(\mathrm{HOTRG})}italic_V start_POSTSUPERSCRIPT † ( roman_HOTRG ) end_POSTSUPERSCRIPT as an isometry. In general, the isometries U(HOTRG)superscript𝑈HOTRGU^{(\mathrm{HOTRG})}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT and V(HOTRG)superscript𝑉HOTRGV^{(\mathrm{HOTRG})}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT are different. In the usual HOTRG algorithm, the cost functions CUsubscript𝐶𝑈C_{U}italic_C start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT and CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT are computed by summing the squared truncated singular values (λ>D(U))2superscriptsuperscriptsubscript𝜆absent𝐷𝑈2\left(\lambda_{>D}^{(U)}\right)^{2}( italic_λ start_POSTSUBSCRIPT > italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and (λ>D(V))2superscriptsuperscriptsubscript𝜆absent𝐷𝑉2\left(\lambda_{>D}^{(V)}\right)^{2}( italic_λ start_POSTSUBSCRIPT > italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_V ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT in both cases. Then, the isometry which corresponds to the smaller cost function is chosen for the truncation step. This introduces a systematic error, which favors one direction (left or right in Fig. 12) in the truncation. In the case of symmetric initial tensors, CUsubscript𝐶𝑈C_{U}italic_C start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT and CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT are the same in each step and thus no choice is needed. Since no direction is favored in this case, the algorithm is more suited for symmetric initial tensors than for non-symmetric, in agreement with our numerical observations.

CP1,P2=Refer to captionsubscript𝐶subscript𝑃1subscript𝑃2Refer to captionC_{P_{1},P_{2}}=\parbox{359.90538pt}{ \includegraphics[width=433.62pt]{./Figure/append_fig/00_Iso_of_bHOTRG.pdf} }italic_C start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =

Figure 13: Cost function of the squeezers P1(bHOTRG)superscriptsubscript𝑃1bHOTRGP_{1}^{\mathrm{(bHOTRG)}}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT and P2(bHOTRG)superscriptsubscript𝑃2bHOTRGP_{2}^{\mathrm{(bHOTRG)}}italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT for boundary HOTRG.

In the boundary TRG method this decision is not applied. Instead, squeezers are created from a combination of U(HOTRG)superscript𝑈HOTRGU^{\mathrm{(HOTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT and V(HOTRG)superscript𝑉HOTRGV^{\mathrm{(HOTRG)}}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT. These squeezers are used for the truncation in the coarse graining step. The procedure minimizes the cost function in Fig. 13. First, the isometries are calculated without truncation as in Eq. 55 and similarly for V(HOTRG)superscript𝑉HOTRGV^{\mathrm{(HOTRG)}}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT. Then, a truncated SVD is performed:

λ(U)U(HOTRG)V(HOTRG)λ(V)UΛV.similar-to-or-equalssuperscript𝜆𝑈superscript𝑈HOTRGsuperscript𝑉HOTRGsuperscript𝜆𝑉𝑈Λ𝑉{\lambda^{(U)}}U^{\mathrm{(HOTRG)}}V^{\mathrm{(HOTRG)}}{\lambda^{(V)}}\simeq U% \Lambda V.italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT ( italic_V ) end_POSTSUPERSCRIPT ≃ italic_U roman_Λ italic_V . (56)

The squeezers can be constructed from these tensors and the previous isometries:

P1(bHOTRG)superscriptsubscript𝑃1bHOTRG\displaystyle P_{1}^{\mathrm{(bHOTRG)}}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT \displaystyle\equiv V(HOTRG)λ(V)V/Λsuperscript𝑉HOTRGsuperscript𝜆𝑉superscript𝑉Λ\displaystyle V^{\mathrm{(HOTRG)}}{\lambda^{(V)}}V^{\dagger}/\sqrt{\Lambda}italic_V start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT ( italic_V ) end_POSTSUPERSCRIPT italic_V start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT / square-root start_ARG roman_Λ end_ARG (57)
P2(bHOTRG)superscriptsubscript𝑃2bHOTRG\displaystyle P_{2}^{\mathrm{(bHOTRG)}}italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT \displaystyle\equiv (1/Λ)Uλ(U)U(HOTRG).1Λsuperscript𝑈superscript𝜆𝑈superscript𝑈HOTRG\displaystyle(1/\sqrt{\Lambda})U^{\dagger}{\lambda^{(U)}}U^{\mathrm{(HOTRG)}}.( 1 / square-root start_ARG roman_Λ end_ARG ) italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT ( italic_U ) end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT . (58)

The total computational cost is of the same order as the original HOTRG, and the calculation of P1subscript𝑃1P_{1}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and P2subscript𝑃2P_{2}italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is not the dominant cost in the renormalization step. The results of this boundary HOTRG method are much less dependent on the symmetry properties of the initial tensors as discussed in Sec. III. Therefore, the method creates more reliable results. In addition, the cost function of the boundary HOTRG in Fig. 13 approximates four tensors instead of two for the usual HOTRG as in Fig. 12. The approximation takes into account a larger region and can thus improve the accuracy of the approximation. We note that the bond-weighted TRG method for HOTRG is also based on the boundary TRG truncation Adachi et al. (2022).

The ideas presented here can generally be used in any TRG method with isometries. Replacing U(HOTRG)P1(bHOTRG),P2(bHOTRG)superscript𝑈HOTRGsuperscriptsubscript𝑃1bHOTRGsuperscriptsubscript𝑃2bHOTRGU^{\mathrm{(HOTRG)}}\rightarrow P_{1}^{\mathrm{(bHOTRG)}},P_{2}^{\mathrm{(% bHOTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_HOTRG ) end_POSTSUPERSCRIPT → italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_bHOTRG ) end_POSTSUPERSCRIPT does not require significant additional computational costs but can strongly reduce the initial tensor dependence.

Appendix E ATRG, MDTRG and variants

We explain the coarse graining steps with ATRG and MDTRG in this appendix. We also introduce variants of the established algorithms and benchmark the different methods for the two-dimensional Ising model.

The accuracy of the free energy depends on the method used in the coarse-graining step. Particularly, we observe that algorithms which use isometries to create the indices of the next coarse-grained tensors K(next)superscript𝐾nextK^{\mathrm{(next)}}italic_K start_POSTSUPERSCRIPT ( roman_next ) end_POSTSUPERSCRIPT are highly dependent on the initial tensor properties.

We start from the partition function Z=tr(iKxiyixiyi)𝑍trsubscriptproduct𝑖subscript𝐾subscript𝑥𝑖subscript𝑦𝑖subscriptsuperscript𝑥𝑖subscriptsuperscript𝑦𝑖Z=\mathrm{tr}\left(\prod_{i}K_{x_{i}y_{i}x^{\prime}_{i}y^{\prime}_{i}}\right)italic_Z = roman_tr ( ∏ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), where xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (xisubscriptsuperscript𝑥𝑖x^{\prime}_{i}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) are the indices that connect a lattice point at site i𝑖iitalic_i to its nearest neighbor in negative (positive) x𝑥xitalic_x-direction. Accordingly, yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (yisubscriptsuperscript𝑦𝑖y^{\prime}_{i}italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) connects to the next tensor in negative (positive) y𝑦yitalic_y-direction. Note that xi+1=xisubscript𝑥𝑖1subscriptsuperscript𝑥𝑖x_{i+1}=x^{\prime}_{i}italic_x start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and yi+1=yisubscript𝑦𝑖1subscriptsuperscript𝑦𝑖y_{i+1}=y^{\prime}_{i}italic_y start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The trace trtr\mathrm{tr}roman_tr implies a summation over all indices. K𝐾Kitalic_K can, for example, be K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT or K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT as defined in Sec. III.

Tensor renormalization group algorithms provide a way to coarse-grain a given tensor network to a new network with fewer tensors. This step is approximate to avoid an exponential growth of the numerical costs, and the algorithms differ in the way they truncate the tensors. Typically, two tensors of an initial lattice are replaced by one tensor on a coarse-grained lattice. We restrict ourselves to square lattices in two dimensions but note that most algorithms discussed here can be generalized to higher dimensions. In short, the goal of a tensor renormalization group algorithm is to find the coarse-grained tensor K(next)superscript𝐾nextK^{\mathrm{(next)}}italic_K start_POSTSUPERSCRIPT ( roman_next ) end_POSTSUPERSCRIPT from the initial tensor K𝐾Kitalic_K,

KxyxyKXYXY(next).subscript𝐾𝑥𝑦superscript𝑥superscript𝑦subscriptsuperscript𝐾next𝑋𝑌superscript𝑋superscript𝑌K_{xyx^{\prime}y^{\prime}}\rightarrow K^{\mathrm{(next)}}_{XYX^{\prime}Y^{% \prime}}.italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_K start_POSTSUPERSCRIPT ( roman_next ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_Y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_Y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (59)
Refer to caption
Figure 14: Initial steps of a coarse-graining iteration in ATRG-like algorithms. From (a) to (b): the upper (lower) initial tensor is split by an SVD into tensors E𝐸Eitalic_E and F𝐹Fitalic_F (G𝐺Gitalic_G and H𝐻Hitalic_H). From (b) to (c): the tensors F𝐹Fitalic_F and G𝐺Gitalic_G are contracted and approximated by an SVD in order to change the index direction. See main text for details.

For ATRG and MDTRG, we consider two nearest neighbor tensors yKx1yx1y1Kx2y2x2ysubscript𝑦subscript𝐾subscript𝑥1𝑦subscriptsuperscript𝑥1subscriptsuperscript𝑦1subscript𝐾subscript𝑥2subscript𝑦2subscriptsuperscript𝑥2𝑦\sum_{y}K_{x_{1}yx^{\prime}_{1}y^{\prime}_{1}}K_{x_{2}y_{2}x^{\prime}_{2}y}∑ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT in the coarse-graining step. The tensors are first decomposed into triads, as shown in Fig. 14(a) to (b). For this, the initial tensors of the translational invariant network are split using an SVD:

Kxyxyb,cDHxybλbcExyc.similar-to-or-equalssubscript𝐾𝑥𝑦superscript𝑥superscript𝑦superscriptsubscript𝑏𝑐𝐷subscript𝐻𝑥𝑦𝑏subscript𝜆𝑏𝑐subscript𝐸superscript𝑥superscript𝑦𝑐K_{xyx^{\prime}y^{\prime}}\simeq\sum_{b,c}^{D}H_{xyb}\lambda_{bc}E_{x^{\prime}% y^{\prime}c}.italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≃ ∑ start_POSTSUBSCRIPT italic_b , italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT italic_x italic_y italic_b end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_b italic_c end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_c end_POSTSUBSCRIPT . (60)

Here, H𝐻Hitalic_H and E𝐸Eitalic_E are truncated unitary matrices or isometries, and λ𝜆\lambdaitalic_λ is a diagonal matrix with non-negative entries. The smallest singular values are dropped in order not to exceed a maximum bond dimension D𝐷Ditalic_D in the algorithm. Note that we do not use internal line oversampling in this paper, so we truncate the singular values in intermediate steps to the bond dimension D𝐷Ditalic_D everywhere. We define the triad tensors

Fxyesubscript𝐹𝑥𝑦𝑒absent\displaystyle F_{xye}\equivitalic_F start_POSTSUBSCRIPT italic_x italic_y italic_e end_POSTSUBSCRIPT ≡ bHxybλbesubscript𝑏subscript𝐻𝑥𝑦𝑏subscript𝜆𝑏𝑒\displaystyle\sum_{b}H_{xyb}\lambda_{be}∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_x italic_y italic_b end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_b italic_e end_POSTSUBSCRIPT (61)
Gxygsubscript𝐺superscript𝑥superscript𝑦𝑔absent\displaystyle G_{x^{\prime}y^{\prime}g}\equivitalic_G start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_g end_POSTSUBSCRIPT ≡ cExycλcg.subscript𝑐subscript𝐸superscript𝑥superscript𝑦𝑐subscript𝜆𝑐𝑔\displaystyle\sum_{c}E_{x^{\prime}y^{\prime}c}\lambda_{cg}.∑ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_c end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_c italic_g end_POSTSUBSCRIPT . (62)

The contraction of two neighboring tensors in the initial network can then be written as

yKx1yx1y1Kx2y2x2yy,e,gEx1y1eFx1yeGx2ygHx2y2g,similar-to-or-equalssubscript𝑦subscript𝐾subscript𝑥1𝑦subscriptsuperscript𝑥1subscriptsuperscript𝑦1subscript𝐾subscript𝑥2subscript𝑦2subscriptsuperscript𝑥2𝑦subscript𝑦𝑒𝑔subscript𝐸subscriptsuperscript𝑥1subscriptsuperscript𝑦1𝑒subscript𝐹subscript𝑥1𝑦𝑒subscript𝐺subscriptsuperscript𝑥2𝑦𝑔subscript𝐻subscript𝑥2subscript𝑦2𝑔\sum_{y}K_{x_{1}yx^{\prime}_{1}y^{\prime}_{1}}K_{x_{2}y_{2}x^{\prime}_{2}y}% \simeq\sum_{y,e,g}E_{x^{\prime}_{1}y^{\prime}_{1}e}F_{x_{1}ye}G_{x^{\prime}_{2% }yg}H_{x_{2}y_{2}g},∑ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ≃ ∑ start_POSTSUBSCRIPT italic_y , italic_e , italic_g end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_e end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y italic_g end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT , (63)

corresponding to Fig. 14(b).

E.1 ATRG and variants

In the ATRG method, an additional SVD is applied to swap the indices in x𝑥xitalic_x-direction as shown in Fig. 14(c):

yFx1yeGx2ygsimilar-to-or-equalssubscript𝑦subscript𝐹subscript𝑥1𝑦𝑒subscript𝐺subscriptsuperscript𝑥2𝑦𝑔absent\displaystyle\sum_{y}F_{x_{1}ye}G_{x^{\prime}_{2}yg}\simeq∑ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_e end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y italic_g end_POSTSUBSCRIPT ≃ f,hDF~x2feλfhG~x1hgsuperscriptsubscript𝑓𝐷subscript~superscript𝐹subscriptsuperscript𝑥2𝑓𝑒subscriptsuperscript𝜆𝑓subscript~superscript𝐺subscript𝑥1𝑔\displaystyle\sum_{f,h}^{D}\tilde{F^{\prime}}_{x^{\prime}_{2}fe}\lambda^{% \prime}_{fh}\tilde{G^{\prime}}_{x_{1}hg}∑ start_POSTSUBSCRIPT italic_f , italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT over~ start_ARG italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f italic_h end_POSTSUBSCRIPT over~ start_ARG italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_h italic_g end_POSTSUBSCRIPT (64)
=\displaystyle== fFx2feGx1fg.subscript𝑓subscriptsuperscript𝐹subscriptsuperscript𝑥2𝑓𝑒subscriptsuperscript𝐺subscript𝑥1𝑓𝑔\displaystyle\sum_{f}F^{\prime}_{x^{\prime}_{2}fe}G^{\prime}_{x_{1}fg}.∑ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f italic_g end_POSTSUBSCRIPT . (65)

The singular values λsuperscript𝜆\lambda^{\prime}italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are included in Fsuperscript𝐹F^{\prime}italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with square root λsuperscript𝜆\sqrt{\lambda^{\prime}}square-root start_ARG italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG factors.

CATRG=Refer to captionsuperscript𝐶ATRGRefer to captionC^{\mathrm{ATRG}}=\parbox{359.90538pt}{ \includegraphics[width=170.71652pt]{./Figure/append_fig/00_Iso_of_ATRG.pdf} }italic_C start_POSTSUPERSCRIPT roman_ATRG end_POSTSUPERSCRIPT =
Figure 15: Cost function to be minimized by the isometry U(ATRG)superscript𝑈ATRGU^{\mathrm{(ATRG)}}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT in the isometric ATRG algorithm.
Refer to caption
Figure 16: Final step to obtain the coarse grained tensor K(Iso,ATRG)superscript𝐾IsoATRGK^{\mathrm{(Iso,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_Iso , roman_ATRG ) end_POSTSUPERSCRIPT in the isometric ATRG algorithm. E,F,G,H𝐸superscript𝐹superscript𝐺𝐻E,F^{\prime},G^{\prime},Hitalic_E , italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_H as in Fig. 14, U(ATRG)superscript𝑈ATRGU^{\mathrm{(ATRG)}}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT from Fig. 15.

Isometric ATRG.

In the isometric ATRG, two indices x1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and x2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are combined by applying an isometry U(ATRG)superscript𝑈ATRGU^{\mathrm{(ATRG)}}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT. This tensor is obtained by an SVD of a combination of triads: EFGHHGFE=U(ATRG)(λ(ATRG))2U(ATRG)𝐸superscript𝐹superscript𝐺𝐻superscript𝐻superscriptsuperscript𝐺superscriptsuperscript𝐹superscript𝐸superscript𝑈ATRGsuperscriptsuperscript𝜆ATRG2superscript𝑈absentATRGEF^{\prime}G^{\prime}HH^{\dagger}{G^{\prime}}^{\dagger}{F^{\prime}}^{\dagger}E% ^{\dagger}=U^{\mathrm{(ATRG)}}\left(\lambda^{\mathrm{(ATRG)}}\right)^{2}U^{% \mathrm{\dagger(ATRG)}}italic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT ( italic_λ start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT † ( roman_ATRG ) end_POSTSUPERSCRIPT.1 This minimizes the cost function in Fig. 15.

We finally calculate the coarse-grained tensor K(Iso,ATRG)superscript𝐾IsoATRGK^{\mathrm{(Iso,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_Iso , roman_ATRG ) end_POSTSUPERSCRIPT, as shown in Fig. 16, K(Iso,ATRG)=UEFGHUsuperscript𝐾IsoATRGsuperscript𝑈𝐸superscript𝐹superscript𝐺𝐻𝑈K^{\mathrm{(Iso,ATRG)}}=U^{\dagger}EF^{\prime}G^{\prime}HUitalic_K start_POSTSUPERSCRIPT ( roman_Iso , roman_ATRG ) end_POSTSUPERSCRIPT = italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H italic_U.

ATRG without isometries, and shifted ATRG.

Refer to caption
Figure 17: Final coarse graining steps of the ATRG and shifted ATRG algorithms. (a) Initial tensor decomposition of four tensors, see Fig. 14 for details. From (a) to (b): the SVD of EFGH𝐸superscript𝐹superscript𝐺𝐻EF^{\prime}G^{\prime}Hitalic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H leads to the tensors M𝑀Mitalic_M and L𝐿Litalic_L. From (b) to (c): the tensors M𝑀Mitalic_M and L𝐿Litalic_L are contracted to form the tensor K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT which is the new coarse grained tensor in the ATRG algorithm. From (b) to (d): the tensors M𝑀Mitalic_M and L𝐿Litalic_L are contracted to form the tensor J𝐽Jitalic_J which is the new coarse grained tensor K(sh,ATRG)superscript𝐾shATRGK^{\mathrm{(sh,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_ATRG ) end_POSTSUPERSCRIPT in the shifted ATRG algorithm.

We discuss variants of the ATRG algorithm which do not rely on the applications of isometries as before. Instead, we use further contractions and SVDs. See Fig. 17 for a graphical representation of the individual steps.

First, we take the SVD of the tensor composition EFGH𝐸superscript𝐹superscript𝐺𝐻EF^{\prime}G^{\prime}Hitalic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H from Fig. 14(b) as

x1,x2,e,gEx1yeFx2feGx1fgHx2ygsubscriptsubscriptsuperscript𝑥1subscriptsuperscript𝑥2𝑒𝑔subscript𝐸subscriptsuperscript𝑥1superscript𝑦𝑒subscriptsuperscript𝐹subscriptsuperscript𝑥2𝑓𝑒subscriptsuperscript𝐺subscriptsuperscript𝑥1superscript𝑓𝑔subscript𝐻subscriptsuperscript𝑥2𝑦𝑔\displaystyle\sum_{x^{\prime}_{1},x^{\prime}_{2},e,g}E_{x^{\prime}_{1}y^{% \prime}e}F^{\prime}_{x^{\prime}_{2}fe}G^{\prime}_{x^{\prime}_{1}f^{\prime}g}H_% {x^{\prime}_{2}yg}∑ start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_e , italic_g end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_e end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_g end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y italic_g end_POSTSUBSCRIPT
X,XM~fyXλXX(LM)L~fyX=XMfyXLfyXsimilar-to-or-equalsabsentsubscript𝑋superscript𝑋subscript~𝑀𝑓superscript𝑦superscript𝑋subscriptsuperscript𝜆𝐿𝑀superscript𝑋𝑋subscript~𝐿superscript𝑓𝑦𝑋subscript𝑋subscript𝑀𝑓superscript𝑦𝑋subscript𝐿superscript𝑓𝑦𝑋\displaystyle\simeq\sum_{X,X^{\prime}}\tilde{M}_{fy^{\prime}X^{\prime}}\lambda% ^{(LM)}_{X^{\prime}X}\tilde{L}_{f^{\prime}yX}=\sum_{X}{M}_{fy^{\prime}X}{L}_{f% ^{\prime}yX}≃ ∑ start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over~ start_ARG italic_M end_ARG start_POSTSUBSCRIPT italic_f italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_λ start_POSTSUPERSCRIPT ( italic_L italic_M ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X end_POSTSUBSCRIPT over~ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y italic_X end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_f italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y italic_X end_POSTSUBSCRIPT
=Jfyfy.absentsubscript𝐽𝑓𝑦superscript𝑓superscript𝑦\displaystyle=J_{fyf^{\prime}y^{\prime}}.= italic_J start_POSTSUBSCRIPT italic_f italic_y italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (66)

We define the shifted ATRG, which takes these tensors J𝐽Jitalic_J as the coarse grained tensors:

KXyXy(sh,ATRG)=JXyXy.subscriptsuperscript𝐾shATRG𝑋𝑦superscript𝑋superscript𝑦subscript𝐽𝑋𝑦superscript𝑋superscript𝑦K^{\mathrm{(sh,ATRG)}}_{XyX^{\prime}y^{\prime}}=J_{XyX^{\prime}y^{\prime}}.italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_ATRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_J start_POSTSUBSCRIPT italic_X italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (67)

Alternatively, another contraction defines the coarse grained tensor of ATRG without isometry,

KXyXy(ATRG)=fMfyXLfyX.subscriptsuperscript𝐾ATRG𝑋𝑦superscript𝑋superscript𝑦subscript𝑓subscript𝑀𝑓superscript𝑦superscript𝑋subscript𝐿𝑓𝑦𝑋K^{\mathrm{(ATRG)}}_{XyX^{\prime}y^{\prime}}=\sum_{f}M_{fy^{\prime}X^{\prime}}% L_{fyX}.italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_f italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_f italic_y italic_X end_POSTSUBSCRIPT . (68)

The SVD which leads to M𝑀Mitalic_M and L𝐿Litalic_L requires 𝒪(D6)𝒪superscript𝐷6{\mathcal{O}}(D^{6})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT ) operations if we do not apply a truncated SVD method. If we apply the ideas of the randomized SVD instead, the costs can be reduced to 𝒪(D5)𝒪superscript𝐷5{\mathcal{O}}(D^{5})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ). See Morita et al. (2018); Kadoh and Nakayama (2019); Nakayama (2023) for more details.

The method to create the coarse-grained tensors K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT is equivalent to the original introduction of ATRG in Adachi et al. (2020). The original ATRG method can be understood as a replacement of the isometries U(ATRG)superscript𝑈ATRGU^{(\mathrm{ATRG})}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT in the isometric ATRG as in Fig. 16 by squeezers. These originate from the truncated SVD in Eq. 66. Explicitly, the squeezers are:

P1(ATRG)=superscriptsubscript𝑃1ATRGabsent\displaystyle P_{1}^{(\mathrm{ATRG})}=italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT = [GH]L~/λ(LM)delimited-[]superscript𝐺𝐻superscript~𝐿superscript𝜆𝐿𝑀\displaystyle[G^{\prime}H]\tilde{L}^{\dagger}/\sqrt{\lambda^{(LM)}}[ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H ] over~ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT / square-root start_ARG italic_λ start_POSTSUPERSCRIPT ( italic_L italic_M ) end_POSTSUPERSCRIPT end_ARG (69)
P2(ATRG)=superscriptsubscript𝑃2ATRGabsent\displaystyle P_{2}^{(\mathrm{ATRG})}=italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT = (1/λ(LM))M~[EF].1superscript𝜆𝐿𝑀superscript~𝑀delimited-[]𝐸superscript𝐹\displaystyle(1/\sqrt{\lambda^{(LM)}})\tilde{M}^{\dagger}[EF^{\prime}].( 1 / square-root start_ARG italic_λ start_POSTSUPERSCRIPT ( italic_L italic_M ) end_POSTSUPERSCRIPT end_ARG ) over~ start_ARG italic_M end_ARG start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] . (70)

The algorithms which create K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT and K(Iso,ATRG)superscript𝐾IsoATRGK^{\mathrm{(Iso,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_Iso , roman_ATRG ) end_POSTSUPERSCRIPT differ in the regions that are approximated in the truncation step, and in the way how the coarse-grained tensors are constructed. The shifted ATRG also creates a different approximation compared to K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT. This method can, however, only be used to coarse-grain the indices in one direction. For example, the tensor J𝐽Jitalic_J would have additional indices for the z𝑧zitalic_z-direction in three dimensions. JXyz1z2Xyz1z2subscript𝐽𝑋𝑦subscript𝑧1subscript𝑧2superscript𝑋superscript𝑦superscriptsubscript𝑧1superscriptsubscript𝑧2J_{Xyz_{1}z_{2}X^{\prime}y^{\prime}z_{1}^{\prime}z_{2}^{\prime}}italic_J start_POSTSUBSCRIPT italic_X italic_y italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT has D8superscript𝐷8D^{8}italic_D start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT elements, and creating it directly is not possible within the leading costs of 𝒪(D7)𝒪superscript𝐷7{\mathcal{O}}(D^{7})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT ) for the ATRG methods in three dimensions. Shifted ATRG is thus only applicable for two-dimensional systems or in combination with other methods which coarse-grain the additional directions beforehand. The other two ATRG methods (isometric ATRG and ATRG) can be directly generalized to higher dimensions Adachi et al. (2020).

Shifted isometric ATRG.

Refer to caption
Figure 18: Final coarse-graining steps of the shifted isometric ATRG. (a) The initial tensor decomposition of four tensors (see Fig. 14 for details) is approximated by inserting the isometries from Fig. 15. (b) This leads to the new coarse-grained tensors K(shIso,ATRG)superscript𝐾shIsoATRGK^{\mathrm{(shIso,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_shIso , roman_ATRG ) end_POSTSUPERSCRIPT.

Instead of using randomized techniques for the contraction or SVD of J𝐽Jitalic_J, we can approximate the contraction using the isometry U(ATRG)superscript𝑈ATRGU^{\mathrm{(ATRG)}}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT that was introduced for the isometric ATRG: J=EFGHEFU(ATRG)U(ATRG)GH𝐽𝐸superscript𝐹superscript𝐺𝐻similar-to-or-equals𝐸superscript𝐹superscript𝑈ATRGsuperscript𝑈absentATRGsuperscript𝐺𝐻J=EF^{\prime}G^{\prime}H\simeq EF^{\prime}U^{\mathrm{(ATRG)}}U^{\mathrm{% \dagger(ATRG)}}G^{\prime}Hitalic_J = italic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H ≃ italic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT † ( roman_ATRG ) end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H. We call this method shifted isometric ATRG. It is shown in Fig. 18. Note that the isometry does not create the indices of the coarse-grained tensors directly, since all indices of U(ATRG)superscript𝑈ATRGU^{\mathrm{(ATRG)}}italic_U start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT are contracted. This approximation of J𝐽Jitalic_J may not be optimal, because the isometry is not calculated from the same subregion of the tensor network as J𝐽Jitalic_J itself: U𝑈Uitalic_U optimizes (HG)(FE)UU𝐻superscript𝐺superscript𝐹𝐸𝑈superscript𝑈(HG^{\prime})(F^{\prime}E)UU^{\dagger}( italic_H italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ( italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_E ) italic_U italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT, not (FE)UU(HG)superscript𝐹𝐸𝑈superscript𝑈𝐻superscript𝐺(F^{\prime}E)UU^{\dagger}(HG^{\prime})( italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_E ) italic_U italic_U start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_H italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). We include this method in our benchmark, however, to test the accuracy of a method that uses isometries for the contractions.

CMDTRG=Refer to captionsuperscript𝐶MDTRGRefer to captionC^{\mathrm{MDTRG}}=\parbox{359.90538pt}{ \includegraphics[width=170.71652pt]{./Figure/append_fig/00_Iso_of_MDTRG.pdf} }italic_C start_POSTSUPERSCRIPT roman_MDTRG end_POSTSUPERSCRIPT =

Figure 19: Cost function of the isometry U(MDTRG)superscript𝑈MDTRGU^{\mathrm{(MDTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT for MDTRG.
Refer to caption
Figure 20: Coarse-graining iteration of the MDTRG algorithm. From (a) to (b): the upper (lower) initial tensor is split by an SVD into tensors E𝐸Eitalic_E and F𝐹Fitalic_F (G𝐺Gitalic_G and H𝐻Hitalic_H), similar to ATRG (see Fig. 14). From (b) to (c): isometries are applied, which minimize the cost function in Fig. 19. From (c) to (d): an approximate contraction leads to the coarse-grained tensors K(MDTRG)superscript𝐾MDTRGK^{\mathrm{(MDTRG)}}italic_K start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT.

E.2 MDTRG and variants

In the following we explain the MDTRG method and also introduce a variation of it. The method is similar to the TTRG Kadoh and Nakayama (2019) but with a different approximation in the contraction step. Compared to ATRG, the index swapping (from (b) to (c) in Fig. 14) is omitted and the tensors EFGH𝐸𝐹𝐺𝐻EFGHitalic_E italic_F italic_G italic_H are directly used instead of EFGH𝐸superscript𝐹superscript𝐺𝐻EF^{\prime}G^{\prime}Hitalic_E italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H.

For MDTRG, we calculate the isometry U(MDTRG)superscript𝑈MDTRGU^{\mathrm{(MDTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT from the tensors EFGH𝐸𝐹𝐺𝐻EFGHitalic_E italic_F italic_G italic_H. The cost function is shown in Fig. 19. Namely, we use the decomposition EFGHHGFE=U(MDTRG)(λ(MDTRG))2U(MDTRG)𝐸𝐹𝐺𝐻superscript𝐻superscript𝐺superscript𝐹superscript𝐸superscript𝑈MDTRGsuperscriptsuperscript𝜆MDTRG2superscript𝑈absentMDTRGEFGHH^{\dagger}G^{\dagger}F^{\dagger}E^{\dagger}=U^{\mathrm{(MDTRG)}}\left(% \lambda^{\mathrm{(MDTRG)}}\right)^{2}U^{\mathrm{\dagger(MDTRG)}}italic_E italic_F italic_G italic_H italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = italic_U start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT ( italic_λ start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT † ( roman_MDTRG ) end_POSTSUPERSCRIPT. Note that the lefthand-side of this equation is Hermitian, and thus the left- and right-singular vectors are equal on the righthand-side.1 The coarse grained tensor is then obtained, as shown in Fig. 20, by a contraction with the isometries:

KXyXy(MDTRG)=subscriptsuperscript𝐾MDTRG𝑋𝑦superscript𝑋superscript𝑦absent\displaystyle K^{\mathrm{(MDTRG)}}_{XyX^{\prime}y^{\prime}}=italic_K start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = x1,x2,x1,x2,y,e,gUx1x2XMDTRG)Ux1x2X(MDTRG)\displaystyle\sum_{x_{1},x_{2},x^{\prime}_{1},x^{\prime}_{2},y,e,g}U_{x^{% \prime}_{1}x^{\prime}_{2}X^{\prime}}^{\mathrm{*MDTRG)}}U_{x_{1}x_{2}X}^{% \mathrm{(MDTRG)}}∑ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y , italic_e , italic_g end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ roman_MDTRG ) end_POSTSUPERSCRIPT italic_U start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT
×Ex1yeFx1yeGx2ygHx2yg.absentsubscript𝐸subscriptsuperscript𝑥1superscript𝑦𝑒subscript𝐹subscript𝑥1𝑦𝑒subscript𝐺subscriptsuperscript𝑥2𝑦𝑔subscript𝐻subscript𝑥2𝑦𝑔\displaystyle\times E_{x^{\prime}_{1}y^{\prime}e}F_{x_{1}ye}G_{x^{\prime}_{2}% yg}H_{x_{2}yg}.× italic_E start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_e end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y italic_e end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y italic_g end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_y italic_g end_POSTSUBSCRIPT . (71)

This contraction requires a truncated SVD method to reduce the costs, in two dimensions to 𝒪(D5)𝒪superscript𝐷5{\mathcal{O}}(D^{5})caligraphic_O ( italic_D start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ). We use the randomized SVD, as in Nakayama (2023).

Using the approximation, we get the triad representation of the K(MDTRG)superscript𝐾MDTRGK^{\mathrm{(MDTRG)}}italic_K start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT as the SVD of K(MDTRG)superscript𝐾MDTRGK^{\mathrm{(MDTRG)}}italic_K start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT with square root weight,

KAyAy(MDTRG)nNAynOAyn.similar-to-or-equalssubscriptsuperscript𝐾MDTRG𝐴𝑦superscript𝐴superscript𝑦subscript𝑛subscript𝑁𝐴𝑦𝑛subscript𝑂superscript𝐴superscript𝑦𝑛K^{\mathrm{(MDTRG)}}_{AyA^{\prime}y^{\prime}}\simeq\sum_{n}N_{Ayn}O_{A^{\prime% }y^{\prime}n}.italic_K start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A italic_y italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≃ ∑ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_A italic_y italic_n end_POSTSUBSCRIPT italic_O start_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_n end_POSTSUBSCRIPT . (72)

Replacing the isometries by squeezers and applying the ideas of the boundary TRG (see App. D) to MDTRG is straightforward. For this boundary MDTRG method, we calculate U(MDTRG)superscript𝑈MDTRGU^{\mathrm{(MDTRG)}}italic_U start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT and V(MDTRG)superscript𝑉MDTRGV^{\mathrm{(MDTRG)}}italic_V start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT by a randomized SVD with oversampling size rD𝑟𝐷rDitalic_r italic_D, and compute the squeezers P1(MDTRG)superscriptsubscript𝑃1MDTRGP_{1}^{\mathrm{(MDTRG)}}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT and P2(MDTRG)superscriptsubscript𝑃2MDTRGP_{2}^{\mathrm{(MDTRG)}}italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_MDTRG ) end_POSTSUPERSCRIPT from this.

Refer to caption
Figure 21: Coarse-graining iteration of the MDTRG algorithm. From (a) to (b): the upper (lower) initial tensor is split by an SVD into tensors E𝐸Eitalic_E and F𝐹Fitalic_F (G𝐺Gitalic_G and H𝐻Hitalic_H), similar to ATRG (see Fig. 14). From (b) to (c): isometries are constructed, which minimize the cost function in Fig. 19. From (c) to (d): an approximate SVD of the tensor network in (c) leads to the the decomposition into O𝑂Oitalic_O and N𝑁Nitalic_N, with square root weighted singular values included. From (d) to (e): the indices are shifted, making use of translational invariance. From (e) to (f): contraction of O𝑂Oitalic_O and N𝑁Nitalic_N leads to the coarse-grained tensors K(sh,MDTRG)superscript𝐾shMDTRGK^{\mathrm{(sh,MDTRG)}}italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_MDTRG ) end_POSTSUPERSCRIPT.

Furthermore, we define the shifted MDTRG as depicted in Fig. 21. In the previous MDTRG algorithm, the tensors EFGH𝐸𝐹𝐺𝐻EFGHitalic_E italic_F italic_G italic_H and the isometries were contracted to form the new coarse-grained tensors. Instead, the shifted MDTRG replaces this contraction by an approximate SVD, which can be applied efficiently to the tensor network. From this tensor decomposition we obtain truncated unitaries, which are combined with the square roots of the singular values to form new tensors O𝑂Oitalic_O and N𝑁Nitalic_N. Their contraction leads to the coarse-grained tensors:

KXyXy(sh,MDTRG)ANAyXOAyX.similar-to-or-equalssubscriptsuperscript𝐾shMDTRG𝑋𝑦superscript𝑋superscript𝑦subscript𝐴subscript𝑁𝐴𝑦superscript𝑋subscript𝑂𝐴superscript𝑦𝑋K^{\mathrm{(sh,MDTRG)}}_{XyX^{\prime}y^{\prime}}\simeq\sum_{A}N_{AyX^{\prime}}% O_{Ay^{\prime}X}.italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_MDTRG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≃ ∑ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_A italic_y italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_O start_POSTSUBSCRIPT italic_A italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_X end_POSTSUBSCRIPT . (73)

Note that the index that was created in the SVD forms one of the indices of the coarse grained tensor. Figure 21 shows the contraction for shifted MDTRG as (e) to (f). New indices of the shifted MDTRG is dotted purple line which comes from the truncated SVD of Eq. 71.

E.3 Comparison of coarse-graining methods

We benchmark the different TRG algorithms for the two-dimensional critical Ising model. The results are summarized in table 1 and discussed in the main text. As mentioned there, we divide the algorithm into three classes. Algorithms denoted as iso in table 1 apply isometries to the tensors to create the coarse grained indices. The methods marked as iso* use isometries as well, but only for intermediate contraction steps, and the final coarse-grained indices are not directly the truncated indices of the isometries. Finally, all other algorithms are marked as sqz.

When isometries are introduced in a tensor network to combine bonds and to compress the bond dimension, there is an ambiguity in choosing these tensors. They can be optimized for either direction of the bond that shall be compressed. An example can be seen in Fig. 12, where the isometries U𝑈Uitalic_U and V𝑉Vitalic_V minimize the error with respect to different contraction directions. Only one of the two is chosen in isometric algorithms for the coarse-graining, and this can lead to a significant decrease of the accuracy if the tensor is not symmetric. Otherwise, U𝑈Uitalic_U and V𝑉Vitalic_V are identical and the problem does not arise. The squeezers introduced in the boundary TRG algorithm and discussed in App. D take into account both isometries. Thus, these methods do not suffer from the errors introduced by omitting the other isometry.

Even though the original TRG algorithm uses an SVD as well for the coarse-graining, both isometries are used in this case. This makes it equivalent to the squeezer algorithms and we group it as sqz.

Refer to caption
Figure 22: Relative error of the free energy of the two-dimensional Ising model at the critical temperature for different bond dimensions D𝐷Ditalic_D. Comparison between ATRG and isometric ATRG for the symmetric initial tensor K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT and the non-symmetric initial tensor K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT.
Refer to caption
Figure 23: Relative error of the free energy of the two-dimensional Ising model at the critical temperature for different bond dimensions D𝐷Ditalic_D. Comparison between shifted ATRG and shifted isometric ATRG for the symmetric initial tensor K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT and the non-symmetric initial tensor K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT.
Refer to caption
Figure 24: Relative error of the free energy of the two-dimensional Ising model at the critical temperature for different bond dimensions D𝐷Ditalic_D. Comparison between MDTRG, shifted MDTRG and boundary MDTRG for the symmetric initial tensor K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT and the non-symmetric initial tensor K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT.

Our benchmark results for the ATRG and MDTRG methods are shown in Figs. 22, 23 and 24. For the truncated SVD, we use the randomized SVD with an oversampling parameter r=4𝑟4r=4italic_r = 4, such that the SVD is performed in an rD𝑟𝐷rDitalic_r italic_D dimensional subspace. We test all methods with two initial tensors, a symmetric tensor K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT (see Eq. 15) and a non-symmetric one K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT from our initial tensor construction (see Eq. 12).

Figure 22 shows that the ATRG does not only produce more accurate results for large bond dimensions compared to the isometric ATRG. Also, ATRG (type sqz) shows no dependence on the initial tensors, while isometric ATRG (type iso) has a strong dependence and is much less accurate for the non-symmetric initial tensor.

Both shifted ATRG methods (shifted ATRG, type sqz and isometric shifted ATRG, type iso*) show only a very mild dependence on the initial tensors as can be seen in Fig. 23. The shifted ATRG has a similar accuracy compared to the common ATRG. Combined with the technical advantages discussed in App. F, this method makes a good candidate for the impurity tensor method to calculate observables.

For the MDTRG methods shown in Fig. 24, we find that the MDTRG (type iso) produces much less accurate results if a non-symmetric initial tensor is chosen. If the boundary TRG method is applied (type sqz), the results coincide with those of the usual MDTRG method and symmetric initial tensors. The boundary MDTRG obtains similar results, however, for non-symmetric tensors as well. This shows again how the squeezers can make observables more resilient against the choice of initial tensors. The shifted MDTRG (type iso*) shows only a mild dependence on the initial tensors but has slightly larger errors than boundary MDTRG for large bond dimensions.

From our numerical calculations with the variants of the ATRG and MDTRG, TRG, and HOTRG, we find that the TRG methods with coarse-grained tensors K(next)superscript𝐾nextK^{\mathrm{(next)}}italic_K start_POSTSUPERSCRIPT ( roman_next ) end_POSTSUPERSCRIPT, whose indices are directly created from isometries, have large initial tensor dependencies. This dependence is eliminated if we apply the boundary TRG technique as discussed in App. D. We therefore recommend the truncation method with squeezers based on the boundary TRG method, which does not increase the numerical costs significantly but leads to more reliable results.

Appendix F Impurity tensor method for ATRG

Impurity tensors can be used to calculate physical observables with TRG methods. We give a brief introduction and overview and discuss the differences that arise for the ATRG and the shifted ATRG method. The latter was introduced in App. E. The impurity tensor method was first suggested in Gu et al. (2008). It is elsewhere discussed in much detail for TRG Nakamoto and Takeda (2016) and also for HOTRG (with isometries) Morita and Kawashima (2019).

In tensor renormalization group methods, the partition function Z𝑍Zitalic_Z is represented by a translational invariant repetition of a tensor Tabcd(β)subscript𝑇𝑎𝑏𝑐𝑑𝛽T_{abcd}(\beta)italic_T start_POSTSUBSCRIPT italic_a italic_b italic_c italic_d end_POSTSUBSCRIPT ( italic_β ) in a volume V𝑉Vitalic_V as

Z=tri=1VTaibicidi(β).𝑍trsuperscriptsubscriptproduct𝑖1𝑉subscript𝑇subscript𝑎𝑖subscript𝑏𝑖subscript𝑐𝑖subscript𝑑𝑖𝛽Z=\mathrm{tr}\prod_{i=1}^{V}T_{a_{i}b_{i}c_{i}d_{i}}(\beta).italic_Z = roman_tr ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_V end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_β ) . (74)

We assume the tensor T(β)𝑇𝛽T(\beta)italic_T ( italic_β ) is a function of a parameter β𝛽\betaitalic_β, which could, for example, be the inverse temperature. Using the product rule and exploiting the translational invariance of the network, the derivative of Z with respect to β𝛽\betaitalic_β is

1VZβ=tr(Ta1b1c1d1β)i=2VTaibicidi(β).1𝑉𝑍𝛽trsubscript𝑇subscript𝑎1subscript𝑏1subscript𝑐1subscript𝑑1𝛽superscriptsubscriptproduct𝑖2𝑉subscript𝑇subscript𝑎𝑖subscript𝑏𝑖subscript𝑐𝑖subscript𝑑𝑖𝛽\frac{1}{V}\frac{\partial Z}{\partial\beta}=\mathrm{tr}\left(\frac{\partial T_% {a_{1}b_{1}c_{1}d_{1}}}{\partial\beta}\right)\prod_{i=2}^{V}T_{a_{i}b_{i}c_{i}% d_{i}}(\beta).divide start_ARG 1 end_ARG start_ARG italic_V end_ARG divide start_ARG ∂ italic_Z end_ARG start_ARG ∂ italic_β end_ARG = roman_tr ( divide start_ARG ∂ italic_T start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_β end_ARG ) ∏ start_POSTSUBSCRIPT italic_i = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_V end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_β ) . (75)

We call (Tβ)𝑇𝛽\left(\frac{\partial T}{\partial\beta}\right)( divide start_ARG ∂ italic_T end_ARG start_ARG ∂ italic_β end_ARG ) the impurity tensor.

In the impurity tensor method, we need to consider the propagation of the impurity tensor information in each coarse graining step. In order to keep the information of the impurity tensor at each step, we have to store sub tensor networks Nakamoto and Takeda (2016). For the simple TRG, we need to store four different tensors. We show how ATRG (Fig. 25) and its variation shifted ATRG (Fig. 26) can be used for the impurity tensor method.

Refer to caption
Figure 25: Propagation of the impurity tensor information for ATRG. The green dots represent tensors which include information of the impurity and differ from the other tensors. The spread of information can be understood from Fig. 17, going from (b) to (c). The information of a pair of tensors in vertical direction (see Fig. 14) is spread to three coarse-grained tensors in horizontal direction in Fig. 17 (c). We refer to the vertical and horizontal directions as shown in Fig. 17. In the figure above, the lattice orientation does not change. From (1) to (2): The first coarse-graining step leads to three affected tensors. From (2) to (3): After exchanging the xlimit-from𝑥x-italic_x - and ylimit-from𝑦y-italic_y -direction, two pairs in vertical direction are affected, and each pair creates three tensors with impurity information. The total number of impurity tensors is thus six. (4) and (5): Following the same logic, the information ultimately spreads to eight tensors for the ATRG algorithm.
Refer to caption
Figure 26: Propagation of the impurity tensor information for shifted ATRG. The green dots represent tensors which include information of the impurity and differ from the other tensors. From (1) to (2): Initially, the information is spread to two tensors. From (2) to (3): In Fig. 17, the impurity information is contained in the four central tensors in step (b). These are contracted to two coarse-grained tensors in (d), which leads to two impurity tensors after the coarse-graining. From (2) to (3): In the next steps, an exchange of xlimit-from𝑥x-italic_x - and ylimit-from𝑦y-italic_y -directions leads to a similar picture, where the information is again only contained in the four central tensors in (b). These are contracted to two coarse-grained impurity tensors. This keeps on repeating in subsequent steps, and only two impurity tensors have to be kept with the shifted ATRG.

With the original ATRG, the information of the initial impurity tensor propagates to eight different tensors in later coarse-graining steps, as is shown in Fig. 25. In contrast to this we only need to calculate and store two coarse-grained impurity tensors with the shifted ATRG, as is shown in Fig. 26. The difference arises from the contraction step in Fig. 17. There, the tensor network EFGH𝐸𝐹𝐺𝐻EFGHitalic_E italic_F italic_G italic_H contributes to three coarse-grained tensors K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT in original ATRG (from Fig. 17(b) to (c)). In contrast to this, the tensors EFGH𝐸𝐹𝐺𝐻EFGHitalic_E italic_F italic_G italic_H only affect two coarse-grained tensors K(sh,ATRG)superscript𝐾shATRGK^{\mathrm{(sh,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_ATRG ) end_POSTSUPERSCRIPT for the shifted ATRG, see Fig. 17(b) to (d). We use the shifted ATRG method to calculate the free energy in the 2subscript2\mathbb{Z}_{2}blackboard_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT gauge theory (see Sec. IV) because of the lower memory footprint and computational costs.

Appendix G Index direction swapping

In a TRG coarse-graining step, two initial tensors K𝐾Kitalic_K are combined into a single new tensor K(next)superscript𝐾nextK^{\mathrm{(next)}}italic_K start_POSTSUPERSCRIPT ( roman_next ) end_POSTSUPERSCRIPT. This was explained in App. E for two tensors connected by a link in y𝑦yitalic_y-direction. For a two-dimensional lattice, this step is followed by a similar coarse-graining in x𝑥xitalic_x-direction and these directions are alternated. The same algorithm can be used if the indices of the initial tensors are permuted accordingly after each coarse-graining step. There are four different choices to exchange the x𝑥xitalic_x and y𝑦yitalic_y directions, which are also shown in Fig. 27:

Kxyxysubscript𝐾𝑥𝑦superscript𝑥superscript𝑦\displaystyle K_{xyx^{\prime}y^{\prime}}italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Kyxyxabsentsubscript𝐾𝑦𝑥superscript𝑦superscript𝑥\displaystyle\leftrightarrow K_{yxy^{\prime}x^{\prime}}↔ italic_K start_POSTSUBSCRIPT italic_y italic_x italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (xy),xy\displaystyle\mathrm{\ \ \ (x\leftrightarrow y),}( roman_x ↔ roman_y ) , (76)
Kxyxysubscript𝐾𝑥𝑦superscript𝑥superscript𝑦\displaystyle K_{xyx^{\prime}y^{\prime}}italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Kyxyxabsentsubscript𝐾superscript𝑦superscript𝑥𝑦𝑥\displaystyle\leftrightarrow K_{y^{\prime}x^{\prime}yx}↔ italic_K start_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y italic_x end_POSTSUBSCRIPT (xy),xsuperscripty\displaystyle\mathrm{\ \ \ (x\leftrightarrow y^{\prime}),}( roman_x ↔ roman_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , (77)
Kxyxysubscript𝐾𝑥𝑦superscript𝑥superscript𝑦\displaystyle K_{xyx^{\prime}y^{\prime}}italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Kyxyxabsentsubscript𝐾𝑦superscript𝑥superscript𝑦𝑥\displaystyle\rightarrow K_{yx^{\prime}y^{\prime}x}→ italic_K start_POSTSUBSCRIPT italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x end_POSTSUBSCRIPT (),\displaystyle\mathrm{\ \ \ (\circlearrowleft),}( ↺ ) , (78)
Kxyxysubscript𝐾𝑥𝑦superscript𝑥superscript𝑦\displaystyle K_{xyx^{\prime}y^{\prime}}italic_K start_POSTSUBSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Kyxyxabsentsubscript𝐾superscript𝑦𝑥𝑦superscript𝑥\displaystyle\rightarrow K_{y^{\prime}xyx^{\prime}}→ italic_K start_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x italic_y italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ().\displaystyle\mathrm{\ \ \ (\circlearrowright)}.( ↻ ) . (79)
Refer to caption
Figure 27: Exchange of indices after each coarse-graining step for a two-dimensional system: xy𝑥𝑦xyitalic_x italic_y flips and rotations.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 28: Dependence of TRG algorithms on the type of exchange between xlimit-from𝑥x-italic_x - and ylimit-from𝑦y-italic_y -directions between coarse-graining steps. Shown is the relative error of the free energy for the critical Ising model in two-dimensions for different bond dimensions D𝐷Ditalic_D and for a symmetric initial tensor K(exp)superscript𝐾expK^{\mathrm{(exp)}}italic_K start_POSTSUPERSCRIPT ( roman_exp ) end_POSTSUPERSCRIPT and a non-symmetric initial tensor K(delta)superscript𝐾deltaK^{\mathrm{(delta)}}italic_K start_POSTSUPERSCRIPT ( roman_delta ) end_POSTSUPERSCRIPT. Comparison between different ATRG and MDTRG algorithms as introduced in App. E.
xylimit-from𝑥𝑦xy-italic_x italic_y -swap dep. Trunc. K𝐾Kitalic_K dep.
ATRG Adachi et al. (2020) -- sqz --- -
Iso-ATRG Adachi et al. (2020) +++ iso +++++ +
sh-ATRG +++++ + sqz --
sh-Iso-ATRG +++++ + iso* --
MDTRG Kadoh and Nakayama (2019) -- iso +++++ +
sh-MDTRG +++++ + iso* --
b-MDTRG --- - sqz --- -
Table 2: Properties of different ATRG and MDTRG methods. 2nd column: dependence on the type of exchange between xlimit-from𝑥x-italic_x - and ylimit-from𝑦y-italic_y -direction between coarse-graining steps; +++++ +/+++/--/--- - stands for very strong/noticeable/slight but not significant/nearly no dependence; 3rd column: truncation method; iso stands for isometries which are used to create the coarse-grained indices; iso* means that isometries are used for intermediate approximate contractions, but they do not create the new indices of the coarse-grained tensors directly; sqz denotes all other methods, so either the squeezers from boundary TRG Iino et al. (2019) (see main text and App. D), or a simple contraction and singular value decomposition. 4th column: dependence on the initial tensors; --- - stands for no dependence, -- for a slight but not significant dependence, +++++ + for strong dependence;

We test the dependence of the TRG variants on the type of xy𝑥𝑦xyitalic_x italic_y-exchange, see Fig. 28. We only show data for xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y and \circlearrowleft because the results for xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y and \circlearrowleft coincide with xy𝑥superscript𝑦x\leftrightarrow y^{\prime}italic_x ↔ italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and \circlearrowright, respectively. We summarize our findings in table 2. The main observations from the numerical benchmarks are:

  1. 1.

    The shifted methods with a flip xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y do not converge to the correct results when the bond dimension is increased, and the errors remain large or even increase with the bond dimension (red and purple triangles in Fig. 28).

  2. 2.

    Non-shifted methods have a similar or better accuracy when a flip xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y is applied. The results are in particular better for the isometric ATRG (black and gray dots in Fig. 28(c)).

  3. 3.

    The boundary TRG methods do not significantly depend on the type of exchange, xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y or rotation \circlearrowleft.

  4. 4.

    Overall, the different types of exchange (rotation \circlearrowleft or flip xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y) lead to different accuracies, depending on the details of the coarse-graining algorithm. Therefore, the exchange type should be chosen accordingly.

When implementing a TRG algorithm, one has to carefully keep track of the index order and conventions. For example, we identified f𝑓fitalic_f in Eq. 66 as X𝑋Xitalic_X in Eq. 67, and fsuperscript𝑓f^{\prime}italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT as Xsuperscript𝑋X^{\prime}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. If one would instead set X=fsuperscript𝑋𝑓X^{\prime}=fitalic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_f and X=f𝑋superscript𝑓X=f^{\prime}italic_X = italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, it would correspond to an exchange between the xy𝑥𝑦xyitalic_x italic_y flip and a rotation. These conventions should be explicitly checked when comparing flips and rotations between algorithms and implementations.

The observations can be understood from the interplay of the last step in obtaining a coarse grained tensor, the exchange of indices, and the initial tensor decomposition in a TRG algorithm. For example, the coarse grained tensor K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT in the ATRG algorithm is obtained by a contraction of two tensors M𝑀Mitalic_M and L𝐿Litalic_L in Eq. 68 and Fig. 17 from (b) to (c). A flip xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y (xy𝑥superscript𝑦x\leftrightarrow y^{\prime}italic_x ↔ italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT) exchanges two indices of M𝑀Mitalic_M and two indices of L𝐿Litalic_L, but does not move only one index to the other tensor. In the next coarse-graining iteration, the tensor K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT is initially split into E𝐸Eitalic_E and F𝐹Fitalic_F, which are exactly M𝑀Mitalic_M and L𝐿Litalic_L (L𝐿Litalic_L and M𝑀Mitalic_M) respectively. Therefore, this SVD does not introduce a further truncation. This is not be the case if a rotation of the indices is used. Similarly, the initial splitting of K(sh,ATRG)superscript𝐾shATRGK^{\mathrm{(sh,ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_sh , roman_ATRG ) end_POSTSUPERSCRIPT into E𝐸Eitalic_E and F𝐹Fitalic_F for the shifted ATRG reconstructs the tensors M𝑀Mitalic_M and L𝐿Litalic_L (L𝐿Litalic_L and M𝑀Mitalic_M) respectively if the exchange type \circlearrowleft (\circlearrowright) is used. This can be seen from Eqs. 66 and 67 or Fig. 17(b) to (c). The same arguments hold for the MDTRG algorithms.

The optimal choice for the index exchange can also be understood if the triad representation is used everywhere instead of coarse-graining to a square lattice Kadoh and Nakayama (2019); Nakayama (2023); Morita et al. (2018). For example, the tensor K(ATRG)superscript𝐾ATRGK^{\mathrm{(ATRG)}}italic_K start_POSTSUPERSCRIPT ( roman_ATRG ) end_POSTSUPERSCRIPT does not need to be constructed explicitly as a contraction between M𝑀Mitalic_M and L𝐿Litalic_L. Instead, these two tensors can be used in the next coarse graining step. In this formulation, the natural index exchange order is more apparent.

We benchmarked the two-dimensional Ising model at the critical temperature here. Since we discover a significant dependence on the type of xy𝑥𝑦xyitalic_x italic_y-swapping for some of the methods, we suggest to check this behavior for other models as well to find the optimal choice. This is particularly true since we found specific cases for the flip-index exchange xy𝑥𝑦x\leftrightarrow yitalic_x ↔ italic_y with a systematic accumulation of errors, which led to a decreased accuracy when the bond dimension is increased. Similarly, the type of index permutation after each coarse-graining step can be important for other TRG methods and in higher dimensions, where the number of variants becomes even larger.

References