Theoretical modelling of nano-scaled systems with heavy ions Dissertation zur Erlangung des Doktorgrades der Naturwissenschaften (Dr. rer. nat.) Kris Holtgrewe Fachbereich 07 Institut für theoretische Physik Gießen, Juli 2022 Dem Fachbereich 07 der Justus-Liebig-Universität Gießen als Dissertation vorgelegt. Tag der Disputation: 24. Oktober 2022 Erstgutachter: Prof. Dr. Simone Sanna Zweitgutachter: Prof. Dr. Wolf Gero Schmidt Equidem beatos puto, quibus deorum munere datum est aut facere scribenda aut scribere legenda, beatissimos vero quibus utrumque. — Gaius Plinius Caecilius Secundus (Plinius Minor), Epistulae VI.16 — Ich jedenfalls glaube, dass diejenigen glücklich sind, denen als Geschenk der Götter die Gabe zuteil geworden ist, entweder Dinge zu tun, die es Wert sind aufgeschrieben zu werden, oder Dinge zu schreiben, die es Wert sind gelesen zu werden. Die Glücklichsten sind wahrhaftig diejenigen, die beides erfüllen. — I, for one, believe that those are blissful who are gifted by the gods either to do things worth writing down, or to write things worth reading. The most blissful, truly, are those who fulfil both. Abstract Systems on the nanoscale are a hot topic in science and technology as they may be applied in future concepts for microprocessors, memory storage and sensors. This PhD thesis covers two nano-scaled systems which promise particularly advant- ageous properties: rare-earth silicide nanowires on silicon surfaces, a quasi-one- dimensional electronic system, and thin antimony layers on bismuth selenide, a two- dimensional system with topologically protected surface states. These systems have in common that their fascinating properties are due to heavy ions incorporated in the structures. This thesis theoretically investigates them by means of density func- tional theory (DFT). However, DFT encounters problems when describing such ma- terials since the high atomic numbers of the involved elements give rise to exotic phenomena, e.g. strongly correlated electronic subshells (the incomplete 4f shell of the lanthanoids), strong relativistic effects (topologically non-trivial insulators) and high contributions to the electronic long-range correlation (“van der Waals interac- tions”). These problems are solved by approaches beyond DFT, including LDA+U, spin-orbit coupling and dispersion corrections. Both systems investigated in this work have higher-dimensional prototype struc- tures, which are explored at first and then scaled down to the final nanostructures. Different structure models are set up and optimised regarding the ionic positions. Their stability is evaluated by means of ab initio thermodynamics and phase dia- grams are derived. For the most stable structure models, the electronic properties are calculated, including band structures, Fermi surfaces and simulated scanning tunnelling microscopy. All theoretical findings on the structural and electronic prop- erties are carefully compared with experimental reference. In this way, a conclusive ab initio framework is established for these systems, which permits a deep under- standing of the underlying physics. Furthermore, novel and fascinating phenomena are identified. The rare-earth silicide nanowires on Si(557) show a unique dimen- sional crossover, which gives rise to quasi-one-dimensional, metallic edge states. The thin antimony layers are proven to underlie a complex interplay with the topolo- gically protected surfaces states of the bismuth selenide surface, which involves an intricate series of topological phase transitions. iv Zusammenfassung Nanoskopische Systeme sind ein heißes Thema in Wissenschaft und Forschung, da sie Anwendung in künftigen Konzepten für Mikroprozessoren, Datenspeicher und Sen- soren finden könnten. Diese Dissertation behandelt zwei nanoskopische Systeme, die besonders vorteilhafte Eigenschaften versprechen: Seltenerdsilizidnanodrähte auf Siliziumoberflächen, ein quasi-eindimensionales elektronisches System, und dün- ne Antimonlagen auf Bismutselenid, ein zweidimensionales System mit topologisch geschützten Oberflächenzuständen. Diesen Systemen ist gemein, dass ihre faszinie- renden Eigenschaften auf die Einbindung schwerer Ionen in die Strukturen zurück- zuführen sind. Diese Dissertation untersucht sie mittels der Dichtefunktionaltheo- rie (DFT). Jedoch ist die DFT problematisch in der Beschreibung solcher Materiali- en, da die hohen Ordnungszahlen der beteiligten Elemente exotische Phänomene verursachen, z.B. stark korrelierte elektronische Unterschalen (die inkomplette 4f- Schale der Lanthanoide), starke relativistische Effekte (topologisch nicht-triviale Iso- latoren) und hohe Beiträge zur langreichweitigen elektronischen Korrelation („van- der-Waals-Wechselwirkungen“). Diese Probleme werden durch Methoden jenseits der DFT gelöst, u.a. LDA+U, Spin-Bahn-Kopplung und Dispersionskorrekturen. Für beide in dieser Arbeit untersuchten Systeme existieren höherdimensionale Strukturprototypen, die zunächst erforscht und danach auf die finalen Nanostruktu- ren herunter skaliert werden. Verschiedene Strukturmodelle werden aufgesetzt und bezüglich der ionischen Positionen optimiert. Ihre Stabilität wird mittels ab initio- Thermodynamik ausgewertet und Phasendiagramme werden aufgestellt. Für die sta- bilsten Strukturmodelle werden die elektronischen Eigenschaften berechnet, darun- ter Bandstrukturen, Fermiflächen und simulierte Rasterelektronenmikroskopie. Al- le theoretischen Ergebnisse für die strukturellen und elektronischen Eigenschaften werden sorgfältig mit experimenteller Referenz verglichen. Auf diese Weise wird für diese Systeme ein geschlossenes Rahmenwerk konstruiert, das ein tiefes Verständ- nis der zugrunde liegenden Physik ermöglicht. Des Weiteren werden neuartige und faszinierende Phänomene identifiziert. In den Seltenerdsilizidnanodrähten wird ein einzigartiger dimensionaler Übergang entdeckt, der quasi-eindimensionale metalli- sche Kantenzustände erzeugt. Den dünnen Antimonlagen wird nachgewiesen, dass sie einem komplexen Zusammenspiel mit den topologisch geschützten Oberflächen- zuständen der Bismutselenidoberfläche unterworfen sind, das eine komplizierte Rei- he von topologischen Phasenübergängen beinhaltet. v Contents 1 Introduction 1 1.1 The microprocessor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 The structure of this work . . . . . . . . . . . . . . . . . . . . . . . . . 8 2 Methods 9 2.1 Density functional theory . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.1.1 The LDA+U method . . . . . . . . . . . . . . . . . . . . . . . . . 12 2.1.2 Spin-orbit coupling in DFT . . . . . . . . . . . . . . . . . . . . . 16 2.1.3 Van der Waals corrections . . . . . . . . . . . . . . . . . . . . . 17 2.2 The Bloch theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.2.1 Basis changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 I Rare-earth silicide nanowires on silicon surfaces 25 3 Chemical background 28 3.1 The rare-earth elements . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 3.2 Silicon and silicides . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 4 The rare-earth silicide bulk phases 34 4.1 Structure prototypes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 4.2 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 4.2.1 Methodological details . . . . . . . . . . . . . . . . . . . . . . . 41 4.2.2 The stoichiometric RESi2 phases . . . . . . . . . . . . . . . . . . 45 4.2.3 The role of the 4f electrons in RESi2 . . . . . . . . . . . . . . . . 47 4.2.4 Vibrational contributions in RESi2 . . . . . . . . . . . . . . . . . 50 4.2.5 The vacancy-populated RESi2 – x phases . . . . . . . . . . . . . . 52 4.2.6 The CaSi2 phases . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 4.3 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 4.4 Discussion and summary . . . . . . . . . . . . . . . . . . . . . . . . . . 69 5 Monolayer films on silicon(111) 71 5.1 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 5.2 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 5.2.1 Band structures . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 5.2.2 Charge transfer . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 5.2.3 Fermi surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86 5.3 Discussion and summary . . . . . . . . . . . . . . . . . . . . . . . . . . 88 5.3.1 Strain induced growth? . . . . . . . . . . . . . . . . . . . . . . . 89 vi 6 Nanowires on silicon(557) 91 6.1 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 6.2 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 6.2.1 Band structures . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 6.2.2 Fermi surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 6.3 Discussion and summary . . . . . . . . . . . . . . . . . . . . . . . . . . 103 II Thin antimony layers on bismuth selenide 105 7 Geometric phases 108 7.1 The Aharonov-Bohm effect . . . . . . . . . . . . . . . . . . . . . . . . . 108 7.2 The Berry phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 7.3 The electric polarisation in a crystal . . . . . . . . . . . . . . . . . . . . 113 7.4 Topological transport . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 7.4.1 The integer Hall effect . . . . . . . . . . . . . . . . . . . . . . . . 116 7.4.2 The two-dimensional ℤ2 insulator . . . . . . . . . . . . . . . . . 120 7.4.3 The three-dimensional ℤ2 insulator . . . . . . . . . . . . . . . . 128 8 The bulk phases of bismuth selenide and antimony 131 8.1 Bismuth selenide . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 8.1.1 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . 133 8.1.2 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . 135 8.2 Antimony . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 8.2.1 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . 138 8.2.2 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . 139 8.3 Discussion and summary . . . . . . . . . . . . . . . . . . . . . . . . . . 142 9 The antimony on bismuth selenide heterostructure 143 9.1 Structure optimisation . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 9.1.1 The stacking sequence of the adlayer . . . . . . . . . . . . . . . 145 9.1.2 The effects from SOC and vdW on the heterostructure . . . . . 148 9.2 Electronic properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 9.2.1 Band structures . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 9.2.2 Simulated STM images . . . . . . . . . . . . . . . . . . . . . . . 154 9.3 The topological phase transition . . . . . . . . . . . . . . . . . . . . . . 156 9.4 Discussion and summary . . . . . . . . . . . . . . . . . . . . . . . . . . 160 10 Conclusions 162 Appendix A1 Bibliography A15 Publication list . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A24 Danksagung A25 vii List of Figures 1.1 Performance development of the Top500 list . . . . . . . . . . . . . . . 6 3.1 Atomic radii of the REs. . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 4.1 Overview over the RESi2 structure prototypes. . . . . . . . . . . . . . 36 4.2 Possible configurations of ordered Si vacancies in RESi1.67. . . . . . . 36 4.3 Schematic phase diagram of the Tb–Si system. . . . . . . . . . . . . . . 39 4.4 Murnaghan fits of TbSi2, DySi2, HoSi2 and ErSi2 in the hex-AlB2 and the tet-ThSi2 phase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 4.5 Vibrational free energies of hex-AlB2-TbSi2, ort-AlB2-TbSi2 and tet- ThSi2-TbSi2 and the corrected Tb chemical potentials 𝜇Tb(𝑇 ) calculated in different supercells. . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 4.6 Structure models of vacancy-populated TbSi2 – b. . . . . . . . . . . . . . 54 4.7 Linear interpolation of Δ𝜇RE for TbSi2 – x and ErSi 12 – x between 𝑥 = 3 and 𝑥 = 0. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 4.8 Models of TbSi2 in the ort-AlB2, tr3-CaSi2 and tr6-CaSi2 structures. . . 60 4.9 Band structures of tet-ThSi2-TbSi2 and hex-AlB2-TbSi2. . . . . . . . . . 63 4.10 Band structures of hex-AlB2-TbSi2 and ort-AlB2-TbSi2. . . . . . . . . . 65 4.11 Band structures of trivalent and f-valent, FMO ort-AlB2-RESi2. . . . . . 67 4.12 Band structures of hex-AlB2-TbSi2 – b and ort-AlB2-TbSi2. . . . . . . . . 68 4.13 Charge transfer in ort-AlB2-TbSi2 and hex-AlB2-RESi2 – b. . . . . . . . . 69 5.1 Sketch of the Si(111) slab and the TbSi2 monolayer structures. . . . . 74 5.2 Tb chemical potentials of the monolayer-Tb@Si(111) structures and de- cision tree for the energy penalties upon structure variations. . . . . 76 5.3 Vertical positions and distances of the atomic layers in the monolayer- Tb@Si(111) structures. . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 5.4 Band structures of the T4-h-B structure and the clean Si(111) surface. 80 5.5 Band structures of the monolayer-Tb@Si(111) structures. . . . . . . . 83 5.6 Band folding of the bulk TbSi2 structures. . . . . . . . . . . . . . . . . 84 5.7 Charge transfer in T4-h-B. . . . . . . . . . . . . . . . . . . . . . . . . . . 86 5.8 Fermi surfaces of the monolayer-Tb@Si(111) structures. . . . . . . . . 87 5.9 Energy surfaces of T4-h-B at different energy levels. . . . . . . . . . . 87 6.1 Sketch of the Si(551) slab and the optimised TbSi2 nanowire structures. 93 6.2 Band structures of dense and sparse nanowires on Si(557) and the monolayer on Si(111). . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 6.3 Segmentation of the Brillouin zone of the monolayer. . . . . . . . . . . 99 6.4 Band structures of the nanowires and projected bands of the monolayer. . . 100 6.5 Fermi surfaces of the Tb@Si(ℎℎ𝑘) systems. . . . . . . . . . . . . . . . . 102 viii 7.1 Sketch of the ABE setting. . . . . . . . . . . . . . . . . . . . . . . . . . . 110 7.2 Path for calculating the electric polarisation as a geometric phase. . . . . . 115 7.3 Sketch of the directions of the integer Hall effect and path of 𝝃 on the Brillouin torus for calculating the topological transport. . . . . . . . . 118 7.4 Sketch of the reciprocal space of the T-symmetric Bloch system and flow chart of the connection between the partial polarisations 𝑷I and 𝑷II, the total polarisation 𝑷+ and the time-reversal polarisation 𝑷− . . . . . . . 124 7.5 Sketch of the Laughlin-type gedankenexperiment and sketches of pos- sible topologies of T-symmetric edge band structures. . . . . . . . . . 127 7.6 TRIMs 𝚪𝑖 in a cubic Brillouin zone. . . . . . . . . . . . . . . . . . . . . . . 128 8.1 Bulk structure of Bi2Se3. . . . . . . . . . . . . . . . . . . . . . . . . . . 133 8.2 Band structure of Bi2Se3 for the rhombohedrally centred DFT-D2+SOC structure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136 8.3 Bulk structure of 𝛽-Sb. . . . . . . . . . . . . . . . . . . . . . . . . . . . 138 8.4 Band structure of bulk 𝛽-Sb and isolated 𝛽-Sb sheets. . . . . . . . . . . 140 9.1 Sketch of the BS(0001) slab. . . . . . . . . . . . . . . . . . . . . . . . . 145 9.2 Sb chemical potentials 𝜇Sb and interlayer distances 𝑑1 and 𝑑2 of the structure models for the Sb@BS system optimised with DFT-D2+SOC. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 9.3 Interlayer distances 𝑑1 and 𝑑2 of 1BL-Sb and 2BL-Sb optimised with dif- ferent treatments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 9.4 Band structures of the clean Bi2Se3 surface and the heterostructures 1BL-Sb and 2BL-Sb optimised with DFT-D2+SOC. . . . . . . . . . . . . 150 9.5 Comparison between the theoretical band structures and ARPES images of the heterostructures 1B-Sb and 2BL-Sb. . . . . . . . . . . . . . . . . 152 9.6 Spin texture of the clean Bi2Se3 surface and the heterostructures 1BL-Sb and 2BL-Sb. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153 9.7 STM images of 1BL-Sb and 2BL-Sb. . . . . . . . . . . . . . . . . . . . . 155 9.8 Real-space isosurfaces of the wavefunctions at the Dirac points D and D∗ of the clean substrate, (b) 1BL-Sb and (c) 2BL-Sb. . . . . . . . . . . . 156 9.9 Hexagonal Brillouin zone with TRIMs. . . . . . . . . . . . . . . . . . . . . 157 9.10 Topological phase transitions in the Sb@BS heterostructure. . . . . . 158 B.1 Band structures of hex-AlB2-TbSi2, hex-AlB2-Si2, ort-AlB2-TbSi2 and ort- AlB2-Si2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A8 B.2 Band structures of trivalent and f-valent tet-ThSi2-TbSi2. . . . . . . . . A8 C.1 Band structures of 1BL-Sb and 2BL-Sb structure models with different stacking optimised with DFT-D2+SOC. . . . . . . . . . . . . . . . . . . . A13 ix List of Tables 3.1 Valencies of the lanthanoids occurring in aqueous solution and VEC of the trivalent ions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 4.1 Bulk mechanical properties of TbSi2, DySi2, HoSi2 and ErSi2 in the hex- AlB2 and the tet-ThSi2 phase. . . . . . . . . . . . . . . . . . . . . . . . 46 4.2 Lattice parameters and relative RE chemical potentials of TbSi2, DySi2, HoSi2 and ErSi2 in the hex-AlB2 and the tet-ThSi2 phase. . . . . . . . . 46 4.3 Lattice parameters and relative Tb chemical potentials of TbSi2 in the AlB2 and the ThSi2 phase under different f-electron treatments. . . . 48 4.4 Lattice parameters and relative Er chemical potentials of ErSi2 in the AlB2 and the ThSi2 phase under different f-electron treatments. . . . 49 4.5 Lattice parameters and relative Tb chemical potentials of TbSi2 in the AlB2 and the ThSi2 phase in orthorhombic Tb4Si8 cells. . . . . . . . . . 51 4.6 Lattice parameters and relative Tb chemical potentials of TbSi2 – x in the stoichiometric and the vacancy-populated AlB2 and ThSi2 phases. . . 55 4.7 Si–Si nearest-neighbour distances in TbSi2 – x optimised with PBE. . . 58 4.8 Lattice parameters and relative Tb chemical potentials of stoichiomet- ric TbSi2 in the AlB2 and CaSi2 phases. . . . . . . . . . . . . . . . . . . 61 5.1 Bond distances and angles in the monolayer-Tb@Si(111) structures. . 77 8.1 Lattice parameters of bulk Bi2Se3. . . . . . . . . . . . . . . . . . . . . . 135 8.2 Lattice parameters of bulk 𝛽-Sb. . . . . . . . . . . . . . . . . . . . . . . 139 9.1 Sb chemical potentials 𝜇Sb, basal lattice constants 𝑎 and interlayer dis- tances 𝑑 of 2D and 3D 𝛽-Sb systems. . . . . . . . . . . . . . . . . . . . . 146 B.1 Lattice parameters and relative Er chemical potentials of ErSi2 – x in the stoichiometric and the vacancy-populated AlB2 and ThSi2 phases. . . A6 B.2 Si–Si nearest neighbour distances in TbSi2 – x optimised with different xc-functionals. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A6 B.3 Si–Si nearest neighbour distances in ErSi2 – x optimised with different xc-functionals. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A7 B.4 Lattice parameters and relative RE chemical potentials of TbSi2 and ErSi2 in the stoichiometric AlB2 and CaSi2 phases. . . . . . . . . . . . . A7 x Acronyms ABE Aharonov-Bohm effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 AMO antiferromagnetically ordered . . . . . . . . . . . . . . . . . . . . . . . . . 48 ARPES angle-resolved photoemission spectroscopy . . . . . . . . . . . . . . . . . 26 BL bilayer (layers consisting of two atomic sublayers) . . . . . . . . . . . . . . . 138 CB conduction band . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 CBM conduction band minimum . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 CI conventional insulator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 DFT density functional theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 DOS density of states . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 FMO ferromagnetically ordered . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 GGA generalised-gradient approximation . . . . . . . . . . . . . . . . . . . . . . 12 HPC high-performace computer . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 IHE integer Hall effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 LDA local density approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 LDOS local density of states . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154 LEED low-energy electron diffraction . . . . . . . . . . . . . . . . . . . . . . . . . 78 MEIS medium-energy ion scattering . . . . . . . . . . . . . . . . . . . . . . . . . 78 MOSFET metal-oxide-semiconductor field-effect transistor . . . . . . . . . . . . . 2 PAW projector-augmented wave . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 PBE Perdew-Burke-Ernzerhof functional . . . . . . . . . . . . . . . . . . . . . . 12 PBEsol Perdew-Burke-Ernzerhof functional revised for solids . . . . . . . . . . 12 pBZ prmitive Brillouin zone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 PDOS partial density of states . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 QE Quantum Espresso (https://www.quantum-espresso.org/) . . . . . . . . . . . . 9 QL quintuple layer (layers consisting of five atomic sublayers) . . . . . . . . . . 132 RE rare-earth element . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 xi sBZ supercell Brillouin zone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 SKP surface Kramers pair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 SOC spin-orbit coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 STM scanning tunnelling microscopy . . . . . . . . . . . . . . . . . . . . . . . . 26 STS scanning tunnelling spectroscopy . . . . . . . . . . . . . . . . . . . . . . . . 26 TI topological insulator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 TRIM time-reversal-invariant momentum . . . . . . . . . . . . . . . . . . . . . . 121 TSS topologically protected surface state . . . . . . . . . . . . . . . . . . . . . . . 107 UHV ultra-high vacuum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 VASP Vienna Ab initio Simulation Package (https://www.vasp.at/) . . . . . . . . 9 VB valence band . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159 VBM valence band maximum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 vdW van der Waals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 VEC valence electron configuration . . . . . . . . . . . . . . . . . . . . . . . . . . 28 XRD X-ray diffraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 ZKB Zintl-Klemm-Busmann concept . . . . . . . . . . . . . . . . . . . . . . . . . 33 xii Erklärung Ich erkläre: Ich habe die vorgelegte Dissertation selbstständig und ohne unerlaubte fremde Hilfe und nur mit den Hilfen angefertigt, die ich in der Dissertation angege- ben habe. Alle Textstellen, die wörtlich oder sinngemäß aus veröffentlichten Schrif- ten entnommen sind, und alle Angaben, die auf mündlichen Auskünften beruhen, sind als solche kenntlich gemacht. Ich stimme einer evtl. Überprüfung meiner Dis- sertation durch eine Antiplagiat-Software zu. Bei den von mir durchgeführten und in der Dissertation erwähnten Untersuchungen habe ich die Grundsätze guter wis- senschaftlicher Praxis, wie sie in der „Satzung der Justus-Liebig-Universität Gießen zur Sicherung guter wissenschaftlicher Praxis“ niedergelegt sind, eingehalten. Ort, Datum Unterschrift xiii 1 Introduction Systems on the nanoscale are one of the most active scientific research fields. They all have in common that they trap particles in less than three spatial dimensions: surfaces (2D), nanowires (1D), quantum dots (0D) and systems with dimensions in between. The reduced dimensionality gives rise to fascinating physical phenomena, some of them are of pure quantum nature. The research efforts in the broad field of nano-scaled systems are driven by two major motives. On the one hand, nano- structured devices may be applied in new technologies, e.g. for processors, memory storage and sensors. On the other hand, nano-scaled systems provide also the oppor- tunity to observe and understand fundamental physical principles which are other- wise predicted by theoretical frameworks. This PhD thesis investigates two groups of low-dimensional systems. The rare-earth silicide nanowires on silicon surfaces are ensembles of few nanometre wide and several 100 nm long RESi2 strips which grow horizontally on silicon substrates. Since bearing highly anisotropic metallic states, they are promising candidates for quasi- one-dimensional electronic systems. The thin antimony layers on bismuth selen- ide are van der Waals heterostructures consisting of a topologically non-trivial Bi2Se3 substrate and a few atomic layers thick Sb adsorbate. Such heterostructures consti- tute a junction between materials of different dimensions and different topological classes. Therefore, they bear a complex system of two-dimensional metallic surface states, which have a unique spin texture and are protected by topology. Although the two systems seem to be very different at first glance, they have in common that they involve the heaviest non-radioactive elements of the periodic table: the lanthanoids (La–Lu, 𝑍 ∈ [57, 71]) and bismuth (Bi, 𝑍 = 83), both located in the sixth and last stable row. The high atomic numbers of the constituents are responsible for the unique properties of the systems and give rise to exciting phenomena. A further common property of the systems is the practical motivation that they both may play a role in future processor technology. The nanowires are interesting from the point of view of miniaturising the feature sizes. The Sb@Bi2Se3 heterostructures are interesting for spintronics, an alternative concept of circuits based on spin densities. Before the investigations of the above systems begin, the applicability-related motiv- ation will be elaborated in more detail. In short, it is all about Moore’s law approach- ing its limit. However, solely stating the emergence of general problems would be too superficial to understand the technical and economic challenges impending in the medium-term future. Therefore, a short overview of the microprocessor is given, beginning with the success story of the computer and the silicon-based integrated circuits. It is followed by the problems which the semiconductor and computer in- dustry will encounter in the 2020s. These considerations clarify why so much effort is put into developing concepts for novel electronics. 1 1 Introduction 1.1 The microprocessor There are probably no technical component which has been affecting humankind more than the microprocessor. Being part of nearly every device, microprocessors are not only present in obviously computer-like devices, e.g. smartphones, laptops and tablets. Also smart devices like watches, TVs, sound systems and kitchen equip- ments contain small data-processing semiconductor chips, which run more or less elaborate programs to facilitate our life. Since the breakthrough of home automa- tion, formerly simple devices like thermostats, plug sockets and light switches have been upgraded by processing units as well. They can be integrated into a “network of smart things” where they communicate with each other, controlled by a computer- based central control unit. Microprocessors do not only inspire intelligence into in- dividual components. Also the internet, on which every job in every economic sector depends, is based on servers, which are accumulations of microprocessors. A striking example for how strong technological progress depends on powerful com- puting units are cars. Despite their seemingly simple purpose to transport passengers or freight, they are equipped with many microcontrollers and processors. The R&D departments of the automotive industry are quite busy with developing computing hardware and software as the abilities of a car become a growingly important sales argument, beside CO2-neutral vehicle propulsion. Modern cars have an interface which looks rather like a smartphone display than a traditional car interface. The superficial applications like the board info, the navigation system or the smartphone integration are only the visible output from the built-in microprocessors. Moreover, there are advanced driver-assistance systems, such as adaptive cruise control and lane centring, which are driven by processing units: the input from many sensors is converted into output for the steering, braking and acceleration actuators. As Ger- many licensed autonomous driving this year1, the future cars will require even more powerful and, in particular, reliable processing units – on boards and off board – to cope with the complexity of traffic. A historical note The history of the microprocessor is intricate as several parallel trends lead to those silicon-based microprocessors which keep our modern world running. There were two main threads: the development of the computer, which includes the fundament- als on how to run a program on a machine, and the invention of the metal-oxide- semiconductor field-effect transistor (MOSFET), which permitted miniaturising the chip features. The history can be experienced in respective museums. The world’s largest computer museum is the Heinz-Nixdorf-Forum in Paderborn, Germany, ded- icated to Heinz Nixdorf, an important entrepreneur in the German computer in- dustry. It offers many exhibits about mathematics, calculation machines and com- puters from the past to the present, including original parts of famous computers 1Bald fahren autonome Fahrzeuge in Deutschland, https://www.faz.net/aktuell/wirtschaft/bald- fahren-autonome-fahrzeuge-auf-deutschen-strassen-17827948.html (visited on 24/05/2022). 2 1.1 The microprocessor which fundamentally changed the world2. The development of the transistor and the microprocessor is comprehensively demonstrated as a timeline by the Computer History Museum near San Jose, California3. The important milestones, which can be found also on the museum websites, are summarised in the following paragraphs. The development of the computer began with the emergence of relay-based calculat- ors in the 1930s. Konrad Zuse developed the first programmable, program-controlled machines in Germany. The most famous is the Z3 machine (1941), which was a purely binary computer and, thus, capable for floating-point arithmetic. At the same time, several relay-based, decimal machines were developed in the USA. Driven by their participation in World War II, the USA needed fast machines for ballistic calculations and for the development of nuclear weapons. The Harvard Mark I machine (1944) by Howard H. Aiken was an early example, which, like the other machines, was slow since based on mechanical relays. J. Presper Eckert and John Mauchly had the idea to replace the slow relays with faster vacuum tubes. They presented the first purely electronic computer at the University of Pennsylvania in 1946: the Electronic Nu- merical Integrator and Computer (ENIAC). Although it was a 1000 times faster than relay-based machines, the ENIAC had the drawback that “programming” consisted in plugging cables, a procedure which could take several days. At that time, John von Neumann, who worked in the atomic bomb projects, was looking for machines capable of doing calculations on detonators and ignition heights. He soon got notice of the ENIAC and joined the ENIAC team to develop a stored-program successor: the Electronic Discrete Variable Automatic Computer (EDVAC, 1952). Its design, the von- Neumann architecture comprising the arithmetic unit, the control unit, the memory, input and output, is the basic principle for most of our modern computers2. The basic ingredient for all computers is the transistor. In its simplest form, it has three contacts: the source, the drain and the gate. In principle, it blocks the current between the source and the drain, until the gate receives a signal. Relays and vacuum tubes were the first “transistors”, which did their job, but could not noteworthy be miniaturised. This inspired researchers to pursue the concept of solid-state transist- ors, realised at the interface of materials with different carrier chemical potentials. Although early ideas had already been proposed in the 1920s, it was not before 1947, until a working point-contact transistor was invented by William Bradford Shockley, John Bardeen and Walter Houser Brattain at Bell Labs. The fundamentality of their invention was acknowledged with the Nobel Prize in Physics in 19564. Short after the group at Bell Labs, Herbert Mataré and Heinrich Welker independently inven- ted the point-contact transistor in Paris in 1948. In 1951, Gordon Teal and Morgan Sparks from Bell Labs fabricated the first working 𝑛𝑝𝑛-transistor. It soon replaced the vacuum tubes in many products, e.g. radios and clocks, until it dominated the world electronics market less than ten years later. The fast growing transistor mar- ket was accompanied by the foundation of several companies, of which some still 2THE INVENTION OF THE COMPUTER, https://www.hnf.de/en/permanent-exhibition/exhibition- areas/the-invention-of-the-computer.html (visited on 14/04/2022) 3TIMELINE, https://www.computerhistory.org/siliconengine/timeline/ (visited on 15/04/2022) 4All Nobel Prizes in Physics, https://www.nobelprize.org/prizes/lists/all- nobel- prizes- in- physics/ (visited on 04/03/2022). 3 1 Introduction exist today: Intermetall (West Germany, today part of TDK-Micronas), Texas Instru- ments (USA) and the Tokyo Telecommunications Company (Japan, renamed to Sony in 1958). Of course, computers were upgraded with transistors as well. The first fully transistorised computer was the TRAnsistor DIgital Computer (TRADIC), developed at Bell Labs in 1954 and used by the U.S. Air Force. In the first half of the 1950s, germanium transistors dominated the market as they are easier to fabricate and can operate at higher frequencies than silicon transistors. However, the operational temperature of germanium devices is limited to a range of 0 ◦C to 70 ◦C, whereas silicon devices work in a broader temperature range of −55 ◦C to 125 ◦C. This excluded germanium devices in particular from aerospace technology, which required devices operating reliably under extreme conditions. Although Mor- ris Tanenbaum from Bell Labs succeeded in developing a working silicon transistor in Jan. 1954, the laboratory misjudged the importance of his work and decided not to pursue the silicon approach any more. Three months later, in Apr. 1954, Gordon Teal, who had changed from Bell Labs to Texas Instruments by the time, succeeded in the synthesis of a silicon transistor as well – unknowingly about Tanenbaum’s work. After this breakthrough, silicon transistors quickly conquered the market and had replaced the germanium transistors by the end of the decade. The 1950s were also the decade in which important fabrication techniques were developed, including dif- fusion processes, oxide masking and photolithography. These techniques combined produced the planar integrated circuit, released in 1960. Finally, the MOSFET permit- ted a fast miniaturisation of the features of integrated circuits. While the first patent for a field-effect transistor was written by Julius Lilienfeld in 1928, the successful fab- rication was not achieved before 1960 by Mohammed M. Atalla and Dawon Kahng at Bell Labs. Since 1964, MOSFET-based integrated circuits have been sold and, since the beginning of the 1970s, the central processing components of computers have been integrated into central processing units (CPUs), called microprocessors today3 5. Moore’s Law Since their breakthrough, planar integrated circuits had smaller features with each new generation so that more transistors could be assembled on a chip of fixed size. Consequently, they became more and more powerful while their costs of produc- tion and the energy demand decreased. In 1965, Gordon Moore stated his famous law in an internal correspondence, telling that the number of transistors per chip grows over time at exponential speed. He estimated the doubling time to 12 months at first and corrected it to two years in 1975. Since then, Moore’s law has been a self-fulfilling prophecy as the semiconductor industry benchmarks its new releases 5In the course of technical break-overs, the topography of companies underwent many changes. The most important one was the foundation of the Shockley Semiconductor Laboratory by William Shockley and Arnold Beckman in 1955. They recruited several capable people, among them Gor- don Moore and Robert Noyce. After differences between Shockley and his employees, Moore, Noyce and six other colleagues left the company to found the Fairchild Semiconductor Corporation in Palo Alto, California, in 1957. This was the birth of Silicon Valley, named after the breakthroughs based on the silicon transistor. Several startups and spin-offs of Fairchild still exist today, among them Intel (from INTegrated ELectronics, founded by Moore and Noyce) and Advanced Micro Devices (AMD, founded by Jerry Sanders, a sales manager at Fairchild who was singed off). 4 1.1 The microprocessor to compete with Moore’s prediction. Although it has been valid for several decades, it encountered problems, of which the first emerged in the 1980s: The growing com- plexity of the chips required an increasing number of production steps and special- ised machines which were distributed over a growing number of companies. This made the costs of developing new chip generations explode. To maintain Moore’s law, the US semiconductor industry initiated a board consisting of engineers from several semiconductor companies worldwide. It developed the International Techno- logy Roadmap for Semiconductors in 1998, which, updated every second year, should facilitate the coordination, identify upcoming problems early and find solutions for them so as to maintain the technological progress [1]. The first principal problem arose in the early 2000s, when the transistors became so dense that the heat could not dissipate any more. The temperature problem forced the manufacturers to limit the clock rates, which have not exceeded a few GHz un- til today. In order to still gain computational power from feature miniaturisation, the processor was divided into cores which each represent a closed processing unit. Provided that a program is parallelisable, i.e. it can be divided into independent chunks which can be distributed over the cores to be run in parallel, the total cal- culative capability of a processor is proportional to the clock rate multiplied with the number of cores [1]. The multi-core strategy maintained Moore’s law until the second principal problem arose. When the feature sizes drop below a few 10 nm, quantum mechanical effects like electron tunnelling produce undesirable leakage currents. The complementary metal-oxide semiconductor (CMOS, a combination of 𝑝-type and 𝑛-type MOSFETs) was the last planar processor technology, which hit the physical limits at the 22 nm technology node6in the 2010s. Smaller features were real- ised with the fin field-effect transistor (FinFET), which exploits the third dimension to a certain extent. In 2020, the most miniaturised processors belonged to the 5 nm technology node with ∼100 million transistors per square millimetre. However, the term “5 nm” is misleading as it refers to the minimum feature width instead of the av- erage half pitch, which is in the range of 10 nm7. This marketing-driven redefinition of the feature sizes cannot obfuscate that the development of the transistor density clearly begins to deviate from Moore’s law8. Further miniaturisation might be phys- ically possible and is actually planned. However, the development costs grow faster than the transistor density as each new processor generation requires a new set of more precise production facilities. Therefore, it is likely that Moore’s law will cease for economical reasons before it reaches the physical limits [1]. The emergence of high-performance computing Scientific research and technological progress have become crucially dependent on the internet and on powerful computing facilities. Today’s science is based on the solution of complex numerical problems and the evaluation of huge amounts of data, 622 nm refer to the half distance between repeating features, the half pitch; 22 nm process, https: //en.wikipedia.org/wiki/22_nm_process (visited on 18/04/2022) 75 nm process, https://en.wikipedia.org/wiki/5_nm_process (visited on 18/04/2022). 8Intel Now Packs 100 Million Transistors in Each Square Millimeter, https://spectrum.ieee.org/ nanoclast/semiconductors/processors/intel- now- packs- 100- million- transistors- in- each- square-millimeter (visited on 18/04/2022). 5 1 Introduction 0 Top 50 sum #1 00 Figure 1.1: Performance devel- #5 opment of the Top500 list [W10]. Green→ sum of the list; Orange → leading machine; Blue→ last machine. The lines are guides for the eyes to demonstrate the exponential growth and its re- cent slowdown. Lists which both exponentially grow in size. Consequently, the calculative demand of re- search has exceeded the capability of local computers, which are limited in perform- ance, even if equipped with multicore CPUs. Therefore, the numerics are outsourced to large servers providing thousands of CPUs: the high-performace computers (HPCs) or supercomputers. A very recent example for HPC-aided science is the research re- lated to the SARS-CoV2 pandemic: HPC resources have been supporting the investig- ations of the interaction between the spike protein of the virus and the human ACE2 receptors, the development of remedies and vaccines, and the analysis of the infec- tion spread9. A further example is the large hadron collider (LHC) at CERN: Even after filtering the experimental data, the LHC produces one petabyte (106 GB) of data per day10, which can only be stored and analysed by huge computing facilities. In addition to the numerics based on classical programs, artificial intelligence (AI) has emerged as a tool for analysing scientific data. The growth in demand for HPC facil- ities is thus expected to remain undiminished. The fast growth of the world’s computing facilities can impressively be demonstrated by the Top500 list of the world’s largest supercomputers11. In 1993, right after my birth, the total peak performance of the Top500 amounted to 1.1 · 1012 floating-point operations per second (FLOPS) or – with the appropriate prefix – 1.1 TFLOPS. To classify this number, consider that a modern smartphone has a peak performance of a few GFLOPS, so the top 500 smartphones in 2022 have more computational power than the Top500 supercomputers in 1993. Today, almost 30 years later, the Top500 peak performance has passed six orders of magnitude – or two prefixes – 9Mit Höchstleistungsrechnern und Data Analytics gegen das Coronavirus, https : / / www . gesundheitsindustrie- bw.de/fachbeitrag/aktuell/mit- hoechstleistungsrechnern- und- data- analytics-gegen-das-coronavirus (visited on 13/04/2022). 10Storage, https://home.cern/science/computing/storage (visited on 13/04/2022). 11PERFORMANCE DEVELOPMENT, https : / / top500 . org / statistics / perfdevel/ (visited on 13/04/2022). 6 Performance 1.1 The microprocessor and amounts to 3.0 EFLOPS. It is a matter of time until performance on the exas- cale is provided by the leading machine on its own. Fig. 1.1 shows that the total performance of the Top500 list grows at exponential speed. From the 1990s to the 2010s, the tenfold-increase time was 4 years. In the last decade, however, the growth has decelerated to a tenfold-increase time of approximately 8 years. Although there is no definite reason for the decline, two points certainly contribute to arising dif- ficulties in setting up more and more powerful supercomputers12. Firstly, Moore’s law, having described the improvement of single processors (single- and multi-core) for decades, approaches its limits. As a consequence, computational power cannot notably improve at the processor level any more. Instead, the number of processors in the HPC facilities grows, which, however, leads to the second problem: Because each processor costs a certain amount of money and consumes a certain amount of electric power, the costs for installing and running a supercomputer inflate with the number of processors. A concrete example for the hugeness of the electric power consumption of modern supercomputers is “Fugaku”13 in Japan, the leading super- computer in 2021 with a peak performance of ∼500 PFLOPS. It is supplied with a power of 30 MW, which corresponds to 260 GWh per annum. This equals the electric power demand of 85 000 German average households in 201914. The exploding operating costs put an economic limit on the growth of the world’s HPC capacity based on the current technology. In particular, the electricity demand bears a power-consumption wall which is already approaching and will become growingly important within the 2020s (see the decline in growth, Fig. 1.1). The power prob- lem even tapers as sustainability policies claim that the significantly growing energy demand of the IT sector must be satisfied by regenerative sources – in addition, to de- carbonising all other electric power generation. Thus, the maintenance of the compu- tational progress will certainly be a tough challenge. As the recent chip crisis demon- strates15, our modern economy is very sensitive towards the availability of more and more capacious computing facilities. A stalling of their growth would entail disrup- tions in every sector of our everyday life and seriously threaten the progress of sci- ence and technology. In order to leave the silicon MOSFET technology, billions of dollars are invested in the exploration of alternative concepts for integrated circuits which permit miniaturising the features, increasing the clock rates and curbing the energy consumption. The two concepts relevant for this work are the field of aniso- tropic, metallic nanowires, to which the rare-earth silicide nanowires belong, and spintronics, for which the Sb@Bi2Se3 heterostructures may be interesting. A break- 12TOP500 Meanderings: Sluggish Performance Growth May Portend Slowing HPC Market, https://www. top500.org/news/top500-meanderings-sluggish-performance-growth-may-portend-slowing-hpc- market/ (visited on 13/04/2022). 13TOP 10 Sites for November 2020, https://top500.org/lists/top500/2020/11/ (visited on 13/04/2022). 14Stromverbrauch der privaten Haushalte nach Haushaltsgrößenklassen, https://www.destatis.de/ DE / Themen / Gesellschaft - Umwelt / Umwelt / UGR / private - haushalte / Tabellen / stromverbrauch - haushalte.html (visited on 24/05/2022). 15Halbleiter-Knappheit – Wie der Chipmangel überwunden werden kann, https://www.faz.net/aktuell/ wirtschaft/digitec/chipmangel- wie- das- problem- ueberwunden- werden- kann- 17682182.html (visited on 24/05/2022), Ende der Chip-Engpässe könnte noch auf sich warten lassen, https://www. faz.net/agenturmeldungen/dpa/ende- der- chip- engpaesse- koennte- noch- auf- sich- warten- lassen-18027141.html (visited on 24/05/2022). 7 1 Introduction through in novel processor technology would end the age of silicon-based MOSFET processors, which have driven the scientific and economic progress for almost 60 years. The replacement of the vacuum tubes by solid-state transistors in the 1950s proves how quick such an adaption can proceed. 1.2 The structure of this work This thesis is organised in two parts, which are each dedicated to one of the two above-mentioned nano-scaled systems. Both systems are derivable from higher di- mensional structure prototypes, which are investigated as well since their properties are necessary for understanding the lower dimensional systems. At the beginning of each part, the respective systems are introduced in a general manner. The first chapter of each part contains preparatory remarks which are necessary for under- standing the particular methods employed in that part. The subsequent chapters each treat one system, beginning with an overview over the literature and the for- mulation of the scientific questions, and concluding with a discussion of the results. In detail: Part I treats the rare-earth silicide nanowires in four chapters: the chemical back- ground of the constituents (Chap. 3), the bulk phases of RESi2 – x (Chap. 4), the mono- layer TbSi2 film on Si(111) (Chap. 5), and the TbSi2 nanowires on Si(557) (Chap. 6). Part II treats the thin antimony layers on Bi2Se3 in three chapters: the concept of geo- metric phases (Chap. 7), the bulk phases of Bi2Se3 and Sb (Chap. 8), and the Sb@Bi2Se3 heterostructures (Chap. 9). The thesis concludes with Chap. 10, where also an outlook on future work is given. Part of the findings on the nano-scaled systems have already been published. The own and co-authored publications are labelled by a separate indexing system, pre- fixed by the letter “P”. Whenever this thesis addresses those publications, they are referenced accordingly. The full publication list can be found on p. A24. Before the investigations start, the general methods are shortly introduced in Chap. 2, with focus on the density functional theory and its extensions. 8 2 Methods The stationary condensed-matter problem consists in solving the eigenvalue equa- tion for the Hamilton operator for 𝑁K nuclei with positions {𝑹𝑘} and momenta {𝑷𝑘}, and 𝑁e electrons with positions {𝒓𝑖} and momenta {𝒑𝑖}: 𝐻 = 𝑇K({𝑷𝑘}) + 𝑇e({𝒑𝑖}) +𝑉e–e({𝒓𝑖}) +𝑉K–K({𝑹𝑘}) +𝑉e–K({𝒓𝑖}, {𝑹𝑘}) (2.1) where 𝑇K({𝑷𝑘}) are the kinetic energies of the nuclei; 𝑇e({𝒑𝑖}) are the kinetic ener- gies of the electrons;𝑉e–e({𝒓𝑖}) are the electron-electron interactions;𝑉K–K({𝑹𝑘}) are the nucleus-nucleus interactions; 𝑉e–K({𝒓𝑖}, {𝑹𝑘}) are the electron-nucleus interac- tions. Calculating the eigenspace of Eq. (2.1) is a quantum-mechanical “𝑁 -particle problem”, which is insoluble – even within numerics – as 𝑁 ∼ 1024 ≈ 1 mol in typical solid-state systems. Therefore, methods are necessary which simplify the problem. This chapter presents the main methods employed in this work: the density func- tional theory (DFT) and the Bloch theorem. While the DFT is quite familiar to most condensed-matter scientists, its extensions might be less known since they are rel- evant only in special situations. The three approaches relevant for this work are the LDA+U method, spin-orbit coupling (SOC) and van der Waals (vdW) corrections, which are each shortly introduced. Both the DFT and the Bloch theorem are the fun- dament of many plane-wave codes by means of which the properties of a solid-state system can be calculated. The codes relevant for this work are the Vienna Ab initio Simulation Package (VASP) [2, 3] and Quantum Espresso (QE) [4, 5]. 2.1 Density functional theory The first simplification of the solid-state problem consists in the Born-Oppenheimer approximation. It is based on the assumption that the light electrons follow the move- ment of the much heavier nuclei instantaneously, hence also called adiabatic approx- imation. Consequently, the electronic subsystem can be separated from the dynamics of the nuclei in that the nuclear positions {𝑹𝑘} enter the electronic Hamiltonian 𝐻e merely as parameters. The electronic ground state energy 𝐸e({𝑹𝑘}) in turn determ- ines the potential of the nuclear Hamiltonian 𝐻K [6, pp. 27ff]. 𝐻e({𝑹𝑘}) = 𝑇e({𝒑𝑖}) + 𝑉e–e({𝒓𝑖〉}) +𝑉e–K({𝒓𝑖} | {𝑹𝑘}) +〉𝑉K–K({𝑹𝑘}) 𝐻e({𝑹 𝑘}) Ψ({𝑹𝑘}) = 𝐸e({𝑹 }) 𝑘 Ψ({𝑹𝑘}) (2.2) 𝐻K = 𝑇K({𝑷𝑘}) + 𝐸e({𝑹𝑘}) 9 2 Methods As the electronic and the nuclear subproblems are disentangled, they can be treated independently. A very common approach is to treat the electronic system quantum- mechanically, while the nuclei (or ions if core electrons are attached) move in their potentials 𝐸e({𝑹𝑘}) according to classical mechanics. The electronic system can be simplified by exploiting symmetries. Most importantly, the translation symmetry of crystals reduces the infinite number of electrons to a finite one, according to the Bloch theorem (vide infra). Furthermore, the symmetries of the space group remove redundancies in the unit cell. Although the symmetries can reduce the number of interacting electrons to ∼100, the computational demand of directly solving the 𝑁 - particle Schrödinger equation will be still huge. Walter Kohn, who was awarded with the Nobel prize in chemistry for the DFT, impressively demonstrates this in his Nobel prize lecture in 1999 [7, p. 1257]: For an exemplary accuracy of O(10−2) of the total energy, each degree of freedom contributes 3 ≤ 𝑝 ≤ 10 parameters to the Hilbert space of the multi-electron system, resulting in a total number of parameters of 𝑀 = 𝑝3𝑁 . (2.3) Even if 𝑝 is set to the lower boundary, the number of parameters for 𝑁e = 100 elec- trons will exceed any imagination: 𝑀 = 33·100 ≈ 10143. Kohn states that he “cannot foresee an advance in computer science which can minimize a quantity in a space of 10150 dimensions” and referred to Eq. (2.3) as an “exponential wall” [7]. Today, 20 years later, one of the fastest supercomputers in the world is the Hawk machine in Stuttgart, Germany. If its peak performance of 𝑃 = 26 PFLOPS [W16] was used for calculating a quantity like the total energy depending on 𝑀 parameters, the compu- tation time would be 𝑇 = 𝑀/𝑃 ≈ 10127 s at the minimum. For comparison, the age of the universe amounts to 1018 s. The basic idea of the DFT is that knowing observable quantities of a solid-state sys- tem like the total energy 𝐸 and the ground state electron density 𝑛(𝒓) are sufficient for many purposes. The detailed electronic many-particle wavefunction |Ψ⟩ is just a means to an end. The bijective link between these three quantities is ensured by the Hohenberg-Kohn theorem: 1) The electronic ground-state density 𝑛(𝒓) is uniquely connected to the external potential 𝑣ext(𝒓) and 2) this ground state density minim- ises the total energy 𝐸 of the system [7, 8]. The second part of the theorem leads to the energy functional 𝐸[𝑛(𝒓)]. 𝐸∫[𝑛(𝒓)] = 𝑇s [𝑛(𝒓)] +𝑉 [𝑛(𝒓)] +𝑈 [𝑛(𝒓)]∫+ 𝐸xc∫[𝑛(𝒓)] (2.4)with ′ 𝑉 [𝑛(𝒓)] ( ) ( ) [ ( )] 1 ′ 𝑛 (𝒓)𝑛(𝒓 )= d𝑟 𝑛 𝒓 𝑣ext 𝒓 and 𝑈 𝑛 𝒓 = d𝑟 d𝑟 2 𝒓 − 𝒓′ The total energy in Eq. (2.4) comprises four terms: • the single-particle kinetic energy 𝑇s [𝑛(𝒓)] of the electrons • the interaction between the electrons gas and the external potential 𝑉 [𝑛(𝒓)] 10 2.1 Density functional theory • the classical Coulomb energy of the electron gas𝑈 [𝑛(𝒓)] (Hartree term) • the exchange-correlation functional 𝐸xc [𝑛(𝒓)] The last term, often abbreviated to xc-functional, contains any many-particle effects: the part of the kinetic energy not covered by 𝑇s, the exchange energy between elec- trons with parallel spins and the correlation due to electrostatic repulsion. Eq. (2.4) can formally minimised by introducing the Lagrange parameter 𝜀 to reflect the boundary cond∫ition of particle-number∫conservation[: ] 𝑁e − d𝑟 𝑛(𝒓) = 0 ⇒ 0 = d𝑟 𝛿𝑛(𝒓) · 𝛿𝐸[𝑛(𝒓)] ( ) − 𝜀 (2.5)𝛿𝑛 𝒓 Kohn and Sham realised that Eq. (2.5) appears in the self-consistent Hartree scheme. This scheme considers non-interacting electrons which move in an effective poten- tial given by the external potential and their own electron gas. The correspond- ing energy-density functional comprises 𝑇s [𝑛(𝒓)], 𝑉 [𝑛(𝒓)] and 𝑈 [𝑛(𝒓)]. Thus, the Hartree scheme can be adapted to the Hohenberg-Kohn functional (Eq. (2.4)) by adding the functional derivative of 𝐸xc [∑︁𝑛(𝒓)] to the effective potential [7, 9].𝑁e 2 [ 𝑛(𝒓) = 𝑓 𝑖 𝜓𝑖 (𝒓) ] (2.6a)𝑖=1 𝜀𝑖𝜓𝑖 (𝒓) = −1 22∇∫ + 𝑣ext(𝒓) + 𝑣H(𝒓) + 𝑣xc(𝒓) 𝜓𝑖 (𝒓) (2.6b)with ( ) 𝛿𝑈 [𝑛(𝒓)] ′ 𝑛(𝒓′) ( ) 𝛿𝐸xc [𝑛(𝒓)]𝑣H 𝒓 = ( ) = d𝑟 ′ and 𝑣xc 𝒓 =𝛿𝑛 𝒓 𝒓 − 𝒓 𝛿𝑛(𝒓) (2.6c) The coefficients 𝑓𝑖 ∈ [0, 1] in Eq. (2.6a) are occupation numbers which fill the ener- getically lowest eigenstates according to a function of choice, e.g. finite-temperature smearing. The Fermi energy is defined as the chemical potential of the fermions, coin- ciding with the highest occupied level if the occupation function is a step function. Eqs. (2.6), the Kohn-Sham equations, define a self-consistent scheme for obtaining the electronic ground state: 1) Start with a trial charge density 𝑛(𝒓) and calculate the potentials 𝑣H(𝒓) and 𝑣xc(𝒓) with Eq. (2.6c). 2) Calculate the eigenvalues and eigenstates {(𝜀𝑖 , 𝜓𝑖 (𝒓))𝑖} of Eq. (2.6b). 3) With these states, calculate the new charge density with Eq. (2.6a) and iterate. The above steps are looped until convergence is reached according to a stop criterion, e.g. the constancy of the total energy 𝐸[𝑛(𝒓)] (Eq. (2.4)). The Kohn-Sham scheme maps the many-particle problem of 𝑁e interacting electrons onto an effective system of single-particle differential equations for 𝑁e non-interacting fermions. Therefore, it considerably reduces the computational demand of the solution. The only term which depends on the concrete system is the external potential 𝑣ext(𝒓), while all other 11 2 Methods terms depend uniquely and universally on the charge density. Although the Kohn- Sham scheme is formally exact, the unknown xc-functional 𝐸xc [𝑛(𝒓)] is a source of uncertainty as it has to be approximated, despite its universality (vide infra). All density-related quantities are, in principle, observable: the ground state density 𝑛(𝒓), the ground state energy 𝐸[𝑛(𝒓)] and, hence, all derived quantities, e.g. the ionic forces. On the contrary, the Lagrange parameters 𝜀𝑖 , having the dimension of an en- ergy, and the single particle orbitals 𝜓𝑖 (𝒓) are mere mathematical artefacts which have no physical meaning a priori. In particular, they are not necessarily observable and can differ considerably from real excitation energies, even if an exact represent- ation of the xc-functional was given. Several approximations for the xc-functional 𝐸xc [𝑛(𝒓)] exist. The best-known is the local density approximation (LDA) which approximates 𝐸xc [𝑛(𝒓)] locally by the value for a respective homogeneous electron gas. While systems with slowly varying dens- ity can be described well, inhomogeneous systems can incur systematic failures. This can be corrected (and is then likely to be overcorrected) by taking also the gradi- ents of the local density into account, leading to the generalised-gradient approxim- ation (GGA). Perdew et al. proposed a simple implementation of GGA, the Perdew- Burke-Ernzerhof functional (PBE), which avoids several weaknesses of former GGA approaches and in which all parameters are fundamental constants [10]. Although PBE works quite well for atoms and molecules, it overcorrects the overbinding of LDA in solids, so bulk lattice constants are slightly too large by ∼1 %. This issue is ad- dressed by the Perdew-Burke-Ernzerhof functional revised for solids (PBEsol) [11]. The xc-functional is not restricted to scalar electronic densities, but it can be exten- ded to spinorial densities. In this case, the density is a (2 × 2) matrix 𝑁 (𝒓). The scalar electron density is then the trace of 𝑁 and the magnetisation densities are the expectation values of the Pauli matrices {𝜎(𝑖}: ) 𝑛(𝒓) = tr(𝑁 (𝒓) ) (2.7) 𝑚𝑖 (𝒓) = tr 𝜎𝑖𝑁 (𝒓) The spinor components couple via the xc-functional, which depends on the scalar electron density 𝑛(𝒓) and the total magnetisation density |𝒎(𝒓) |. As the derivatives of the xc-functional yield a (2 × 2) potential, non-collinear magnetism emerges by this means [12]. Although this approach allows the magnetic moments to assume certain angles to each other, the magnetic structure is still isotropic, i.e. if all magnetic densities are rotated by a rigid angle, the total energy remains the same. Magnetic anisotropy follows from SOC (vide infra). 2.1.1 The LDA+U method The approximative xc-functionals describe the total-energy-related quantities well if the electrons are delocalised. In some cases, however, DFT fails in finding the cor- rect ground state and yields considerably wrong energies and forces. In particular, 12 2.1 Density functional theory systems with strong electronic correlation are problematic because DFT tends to de- localise electrons and, thus, to weaken correlation effects. An example relevant for this work consists in valence electronic multiplets which populate nearly atomic or- bitals. The electronic correlation is then subjected to Hund’s rules, which force the electrons into a certain configuration concerning spin and magnetic quantum num- bers. In particular, the occupations of the single-particle subshell orbitals |𝜑𝑚⟩ are integral, ergo 0 or 1. These on-site correlation effects prevail in localised d shells and, even more pronounced, in localised f shells. The LDA+U method, on which Co- coccioni wrote a comprehensive discussion [13], addresses this issue. The approach is based on the orthogonality relations of quantum states, i.e. a loc- alised subshell projects the delocalised electrons forming the chemical bonds out. As a consequence, the intracorrelation of the subshell not covered by DFT is much stronger than the intercorrelation between the subshell and the outer electrons. This permits a selective correction of the former. A suitable ansatz is the Hubbard model, which, being a tight-binding approach, is naturally constructed from a localised basis set. In detail, each localised subshell orbital |𝜑𝐼𝑚⟩ with index 𝑚 and belonging to the atomic site 𝐼 interacts with each subshell orbital |𝜑𝐼𝑚′⟩ on the same site. The sum of the pairwise interactions, the Hubbard energy 𝐸Hub, is added to the DFT energy func- tional 𝐸LDA. In order to avoid double-counting, the correlation energy 𝐸dc which DFT already includes has to be subtracted. It is calculated from the mean-field interaction of the electrons on the respective site, which depends only on their total number. 𝐸LDA+U [𝑛(𝒓)] = 𝐸 𝐼𝜎 𝐼𝜎LDA [𝑛(𝒓)] + 𝐸Hub [{𝑛𝑚𝑚′}] − 𝐸dc [{𝑁 }] (2.8) where 𝑛(𝒓) is the DFT electron density and 𝜎 the spin. 𝑛𝐼𝜎𝑚𝑚′ is the on-site density matrix defined by the projections of the Kohn-Sham eigenfunctions |𝜓𝜎⟩ (Eq. (2.6b)) 𝑖 on the localised basis set {|𝜑𝐼𝑚⟩} of site 𝐼 . {𝑁 𝐼𝜎∑︁ } is the total number of electrons inthe subshell of site 𝐼 . 〈 〉〈 〉 𝑛𝐼𝜎 = 𝑓 𝜎 𝜓∑︁𝜎 𝜑𝐼 𝜑𝐼 𝜎𝑚𝑚′ ′ 𝑖 𝑖 𝑚 𝑚 𝜓𝑖 (2.9a)𝑖 𝑁 𝐼𝜎 = 𝑛𝐼𝜎𝑚𝑚 (2.9b) 𝑚 where 𝑓 𝜎 is the occupation function. 𝑖 In the one-band version of the Hubbard model, each site 𝐼 can be occupied by two electrons with opposite spin. Only electron pairs on the same site increase the po- tential energy of the Hamiltonian by the parameter𝑈 > 0 representing the repulsive Coulomb interaction. In the limit of zero-hopping (total localisation), the Hubbard total energy comprises only the 𝑈 contr∑︁ibutions. If the sites are inequivalent, theparameter𝑈 depends on 𝐼 . 𝐸 = 𝑈 𝐼𝑛𝐼↑𝑛𝐼↓Hub,1 (2.10) 𝐼 where 𝑛𝐼𝜎 is the occupation number of site 𝐼 and spin 𝜎. If two or more orbitals interact on the same site, the total Coulomb interaction equals the sum of two-particle 13 2 Methods interactions between all pairs of electrons with different orbital and/or spin quantum numbers on that site. In this case, the total energy comprises not only the Coulomb repulsion between each two electrons, which is modelled by the parameter𝑈 . It has also a term describing the attractive exchange interaction between electrons with parallel spin, which is modelled by a second parameter 𝐽 > 0. The Hubbard energy functional 𝐸 [{𝑛𝐼𝜎Hub 𝑚∑︁𝑚′}[] is the∑︁n: ∑︁ ∑︁ ]𝐼 𝐼 𝐸 [{𝑛𝐼𝜎 }] 𝑈= [ 𝑛𝐼𝜎 ′ 𝑛𝐼𝜎 − 𝐽 𝑛𝐼𝜎 𝐼𝜎Hub 𝑚𝑚′ ′ ′ 𝑛 ′ ′∑︁ 2 𝑚𝑚 𝑚 𝑚 2 𝑚𝑚 𝑚 𝑚𝐼 𝑚,𝜎≠𝑚′𝜎′ 𝑚≠𝑚′ 𝜎 ]𝑈 𝐼 ∑︁ 𝐼 𝐼 ∑︁ = 𝐼𝜎 𝐼 (−𝜎) 𝑈 − 𝐽 𝑛 𝑛 𝐼𝜎 𝐼𝜎 2 𝑚𝑚 𝑚′𝑚′ + 𝑛 2 𝑚𝑚 𝑛𝑚′𝑚′ (2.11) 𝐼,𝜎∑︁ [ 𝑚,𝑚′ ( 𝑚≠𝑚′ )]Eq. (2.9b) 𝑈 𝐼 𝐼 𝐼 ∑︁ = 𝑁 𝐼𝜎𝑁 𝐼 (−𝜎) + 𝑈 − 𝐽 𝑁 𝐼𝜎𝑁 𝐼𝜎 − 𝑛𝐼𝜎𝑚𝑚𝑛𝐼𝜎2 2 𝑚𝑚 𝐼,𝜎 𝑚 The double counting term 𝐸 [{𝑁 𝐼𝜎dc }] is approximated by the mean-field interaction of the total number of electrons. If 𝑁↑ and 𝑁↓ are integers, there are 𝑁↑ ·𝑁↓ antipar- allel spin pairs and 𝑁𝜎 · (𝑁𝜎 − 1)/2 parallel spin pairs for each spin. The former con- tribute only repulsive 𝑈-terms, the latter repulsive 𝑈-terms and attractive 𝐽-terms. This can be extrapolat∑︁ed t[o∑︁non-integral {𝑁 𝜎}∑︁. ]𝑈 𝐼 𝐼[{ 𝐼𝜎}] 𝐼𝜎 𝐼 (−𝜎) + 𝑈 − 𝐽 𝐼𝐸dc 𝑁 = 𝑁 𝑁 𝑁 𝐼𝜎 (𝑁 𝐼𝜎 − 1) (2.12)2 2 𝐼 𝜎 𝜎 Inserting Eqs. (2.11) and (2.12) into Eq. (2.8) and using Eq. (2.9b) gives the total cor- rection of the DFT energy functional.∑︁ ( )𝐼 𝐼 ∑︁ 𝐸LDA+U [𝑛(𝒓)] [ ( )] + 𝑈 − 𝐽 = 𝐸 𝐼𝜎LDA 𝑛 𝒓 ∑︁ 𝑁( − 𝑛 𝐼𝜎 𝐼𝜎 2 𝑚𝑚 𝑛𝑚𝑚 𝐼𝜎 𝑚 ) (2.13)(∗) 𝑈 𝐼 − 𝐽 𝐼 = 𝐸LDA [𝑛(𝒓)] + tr(?̂?𝐼𝜎) − tr( [?̂?𝐼𝜎]2)2 𝐼𝜎 Step (∗) uses that the atomic subshells have a finite and orthonormal basis, the spher- ical harmonics. Thus, each configuration has an axis along which the density matri- ces are diagonal. Replacing the them by the density operators ?̂?𝐼𝜎 renders Eq. (2.13) invariant under unitary transformations of the localised basis set [13]. The Hubbard approach to correct the correlation in the DFT is called LDA+U method since the𝑈 is the most important parameter. It is not restricted to LDA, but it can also be adapted to other functionals like GGA. Eq. (2.13) is the rotationally invariant and simplified LDA+U formulation introduced by Dudarev et al., where the parameters𝑈 and 𝐽 represent the spherically averaged Coulomb integrals [14, 15]. The more gen- eral formulation employs explicitly calculated screened Coulomb integrals between 14 2.1 Density functional theory the subshell wavefunctions for 𝐸Hub, as proposed by Liechtenstein et al. Neverthe- less, the double-counting energy 𝐸dc has still to be parametrised by𝑈 and 𝐽 [16]. The simplified formulation in Eq. (2.13) demonstrates important properties of the LDA+U correction. The additive to the DFT functional is always positive if the density matrix is not idempotent1. Conversely, if ?̂? is idempotent, i.e. all eigenvalues are 0 or 1, the correction term vanishes. In other words, the LDA+U method penalises fractional oc- cupations and drives the subshell into a state where all orbitals are fully occupied or empty. Nota bene, the LDA+U correction in Eq. (2.13) does not explicitly encourage spin polarisation (Hund’s second rule) as the effective (𝑈 − 𝐽)-term does not include a magnetisation-dependent penalty. An important deficiency of the LDA+U method is the introduction of parameters to the DFT approach, which is otherwise ab initio and avoids empirical parameters. The parameters𝑈 and 𝐽 for a particular system are often determined by tuning them so that the materials properties match the experiment (e.g. band gaps, lattice con- stants). However, this approach opens a gateway for systematic errors because it can overcorrect other, erroneous effects and thus obfuscate weaknesses of the model. A more sophisticated approach is to calculate the parameters self-consistently from lin- ear response, as introduced by Cococcioni. In detail, a small penalty 𝛼𝐼 is applied to the Kohn-Sham potential of a certain site 𝐼 and the effect on the on-site occupations 𝑁𝐾 = 𝑁𝐾↑ + 𝑁𝐾↓ is measured for sites 𝐾 . This leads to the susceptibility 𝜒. d𝑁𝐾 𝜒𝐾𝐼 = (2.14)d𝛼𝐼 The inverse of 𝜒 basically equals the negative Hubbard𝑈 , according to the quadratic dependence of the double-counting term on the on-site density 𝑁 𝐼 (Eq. (2.12)). How- ever, a term from the electronic rehybridisation at constant total density, leading to the susceptibility 𝜒0, has to be subtracted [13]. 𝑈 𝐼 = (𝜒−10 − 𝜒−1)𝐼𝐼 (2.15) The procedure of determining𝑈 can be summarised as follows: 1) Calculate the self-consistent DFT ground-state density of the system in question. 2) Apply an on-site penalty 𝛼𝐼 to site 𝐼 and calculate the respective on-site occupa- tions 𝑁𝐾(𝛼𝐼) within a non-self-consistent calculation at the fixed ground-state density. This yields 𝜒0 by Eq. (2.14) and finite differences. 3) Repeat 2), but let the charge relax (self-consistent charge density with penalty). This yields 𝜒 by Eq. (2.14) and finite differences. 4) Calculate𝑈 𝐼 with Eq. (2.15). 1A quadratic matrix ?̂? is idempotent, if ?̂?2 = ?̂?. Because tr(?̂?) = 1 and ?̂?∗ = ?̂?, the Cauchy-Schwarz inequality holds: tr(?̂?2) ≤ tr(?̂?). 15 2 Methods 2.1.2 Spin-orbit coupling in DFT The steep potentials of nuclei with high atomic numbers give rise to relativistic ef- fects, which must be considered if a material contains elements of the fifth row of the periodic table and beyond. In particular, the elements of the sixth row, the last row containing stable isotopes, are subjected to a considerable relativistic contraction of the atomic shells. As a response to relativistic corrections, the 6s radius shrinks by a few percent for the lighter elements, by ∼10 % for the heavier ones and by 15 % for gold (Au, 𝑍 = 79), a local extremum [17, pp. 372f]. The fully relativistic version of quantum mechanics is based on the Dirac equation which fulfils the prerequisite of invariance under Lorentz transformations. Being a system of four coupled differential equations, the Dirac equation is demanding, even for a single particle, and contains much redundant information for the purposes of condensed-matter theory. The two approximations to make the relativistics suitable for solid-state systems are the elimination of the small component and a truncated series expansion of the resulting Hamilton operator2. The zeroth-order approxim- ation of the correct expansion yields the following Hamilton operator, called CPD Hamiltonian after Chang, Pelissier, and Durand [19, 20]: ( ) 1 𝑉 (𝒓) −1 𝐻0 = 𝑉 (𝒓) + (𝝈 · 𝒑)𝜅(𝒓) (𝝈 · 𝒑) with 𝜅(𝒓) = 1 − (2.16)2𝑚0 2𝑚0𝑐2 where𝑉 (𝒓) is the potential, 𝑚0 the rest mass of the electron, 𝑐 the speed of light, and 𝒓 and 𝒑 the canonical variables. As 𝝈 = (𝜎𝑥 , 𝜎𝑦, 𝜎𝑧) is the vector of Pauli matrices, the Hamilton operator in Eq. (2.16) acts on a 2-spinor. Test calculations on the uranium atom (U, 𝑍 = 92), in which all core and valence states are affected by relativistics, show that the zeroth order CPD Hamiltonian is quite accurate. It predicts the valence levels very close to those calculated with the full Dirac equation and fails only in predicting the levels of the most inner K and L shells [19]. The commutator relation [𝑟𝑖 , 𝑝 𝑗] = iℏ𝛿𝑖 𝑗 and the algebra of the Pauli matrices lead to the following, more interpretable representation of 𝐻0: (𝝈 · 𝒂) (𝝈 · 𝒃) (= 𝒂 · 𝒃 + i𝝈) · (𝒂 × 𝒃) (∗) [ ( )] Eq. (A.93) −∇𝒑 𝑟𝑉 (𝒓) −iℏ, 𝜅 𝒓 = −iℏ(𝜅 ′(𝒓) ) = 𝜅 2(𝒓) ∇ 2 2 ( 2 ( 2 ) ) 𝑟𝑉 (𝒓) (∗∗)𝑚0𝑐 𝑚0𝑐(∗) 𝐻0 = 𝑉 (𝒓) + 1 · i𝒑 (𝜅(𝒓)𝒑) + 𝝈 · 𝒑 × 𝜅(𝒓)𝒑2𝑚0 2𝑚0 (2.17)(∗∗) = 𝑉 (𝒓) + 1 𝒑 · 𝜅(𝒓) ℏ𝒑 + 𝜅2( )2 (𝒓) 𝝈 · (∇2 2 𝑟𝑉 (𝒓) × 𝒑)𝑚0 𝑚0𝑐 2An interesting issue of these expansions concerns in the convergence of the series. The traditional expansion in (𝐸 − 𝑉 )/2𝑚 𝑐20 causes severe problems because electrons have non-zero probability densities near the singularities of a Coulomb potential wherefore the expansion parameter diverges. The expansion in 𝐸/(2𝑚0𝑐2 −𝑉 ) leads to a reliable and fast convergence [19]. 16 2.1 Density functional theory According to Eq. (2.17), the main relativistic effects can be summarised as two con- tributions: an alteration in the kinetic energy which, in a simplifying manner, cor- responds to the relativistic mass increase and the spin-orbit coupling (SOC) term. The name of the latter name becomes obvious after inserting a spherically symmetric potential as in the case of single atoms: 𝐻atom 1 ( ) 0 = 𝑉 (𝒓) + 𝒑 · 𝜅(𝒓)𝒑 + 𝐻2 SOC𝑚0 (2.18) ℏ 1 d𝑉 with 𝐻SOC = ( )2 𝜅 2(𝒓) 𝝈 · 𝑳 and 𝑳 = 𝒓 × 𝒑 2𝑚0𝑐 𝑟 d𝑟 Solving the eigenspace of Eq. (2.18) for solid-states is still demanding wherefore the code package used in this work, VASP, uses a further approximation: As the relativist- ics is expected to prevail solely in the vicinity of the atomic cores3, the SOC Hamilto- nian can be assumed to act solely within the PAW spheres, which are introduced by the projector-augmented wave (PAW) method [21, 22]. As a consequence, 𝐻SOC can be expressed solely by means of the PAW projectors which mix the on-site densit- ies of the pseudo-spinor [23]. Moreover, since the PAW projections are still resolved with respect to orbital quantum numbers and spin, it is possible to track the atomic orbitals upon forming chemical bonds even with SOC included. 2.1.3 Van der Waals corrections Since the correlation part of 𝑣xc is local in the charge density or its gradients, DFT bears the general weakness that it describes long-range electronic correlation insuf- ficiently. This is less problematic for systems in which all bonding is covalent or ionic as the atomic overlap lets the xc-functional account for the major part of correlation. However, it can deteriorate the results for materials in which van der Waals interac- tions dominate the bonding between closed-shell entities. Van der Waals (vdW) interactions, more precisely the London dispersion effects4, arise from fluctuations in charge distributions which polarise the environment. In the demonstrative case of neutral molecules, the temporary dipole moment emerging in one molecule induces dipole moments in the adjacent molecules, which results in a net attractive force. The fundament of such fluctuations consists in the quantum nature of charge, which is the probability density of quantum particles. The mutual dependence of the temporary dipoles leads to their quasi-synchronised occurrence and is thus a type of electronic correlation. 3If the potential vanishes, 𝜅(𝒓) is close to unity. Furthermore, the derivative of 𝑉 (𝑟) vanishes fast for Coulomb-like 1/𝑟 potentials. Thus, Eq. (2.16) adopts the form of the non-relativistic Hamilton operator 𝐻 = 𝑉 (𝒓) + 𝒑2/2𝑚0 on domains excluding the neighbourhood of the nuclei. 4The original work on interactions between neutral molecules by Johannes Diderik van der Waals included attractive and repulsive forces of differing nature. One of these are the attractive inter- actions from short-frequency perturbations between polarisable molecules, which were described by Fritz London. He called these interactions the “dispersion effect” as he found that knowledge about the frequency-dependence of the response of the charge densities towards short-frequency perturbations is necessary for correctly calculating the same [24]. 17 2 Methods The common mistake to avoid when considering vdW interactions consists in the as- sumption that these interactions are generally small. While this is indeed true for the mutual attraction between isolated atoms, it proves to be completely wrong in dense systems where vdW interactions dominate the intermolecular bonding. For ex- ample, the boiling points of hydrogen halides increase with growing halogen atomic numbers, despite the decrease of the static dipole moments. In this case, vdW in- teractions obviously exceed the interactions between the static dipoles. Not only do vdW interactions influence the intermolecular bonding, but they can also influence the intramolecular bonding. For instance, the alkanes are often employed to demon- strate the positive relation between vdW-related intermolecular attraction and chain length. However, they are also subjected to intramolecular vdW interactions which stabilise branched molecules with respect to the linear ones [25]. Further examples where vdW interactions play an important role are “the structures of DNA and pro- teins, the packing of crystals, the formation of aggregates, host-guest systems, or the orientation of molecules on surfaces or in molecular films” [26]. Being a dynamic long-range correlation effect, vdW interactions are complicated to describe and nu- merically expensive. In particular their introduction into DFT is subject of a vivid research activity which tries finding a compromise between numerical feasibility, accuracy and avoiding empirical parameters. The large number of approaches can be classified by their level of sophistication [25]. It is therefore obligatory to test the different vdW schemes in order to find the method which is sufficiently accurate for the respective problem while saving computational resources. The DFT-D2 method introduced by Grimme will be the most important method in this work. It is a semi-empirical approach which adds the 𝑅−6-dependent dipolar dispersion correction for each pair of inter∑︁acting atoms [26]:𝐼 𝐽 𝐸DFT-D2 𝑠 𝐶 = − 6 6disp 6 𝑓 (𝑅2 𝐼 𝐽 ) (2.19) ≠ 𝑅𝐼 𝐽 𝐼 𝐽 where 𝑅𝐼 𝐽 is the distance between the atoms 𝐼 and 𝐽 and 𝑠6 is a universal scaling constant which depends on the xc-functio√︃nal. The coefficients 𝐼 𝐽 𝐶 = 𝐶𝐼6 6 · 𝐽 𝐶6 are the geometric mean of the 𝐶6 coefficients of the single atoms. These, in turn, are proportional to the respective static dipolar polarisabilities, i.e. they describe how susceptible the atoms are towards vdW interactions. 𝑓 (𝑅𝐼 𝐽 ) is a damping function limiting the vdW interactions if the atoms are too close to each other. The paramet- ers are fitted to match experimental reference and are considered universal for the employed atomic species [26]. A bit more sophistication provides the DFT-D3 method by Grimme et al. [27]. This method complements the 𝐶6 contributions from the dipole-dipole dispersion by the 𝐶8 contributions from the dipole-quadrupole dispersion. In addition, the coefficients are not fixed any more, but they scale with the fractional coordination number of the respective atoms to partially reflect different chemical environments. Moreover, the 18 2.2 The Bloch theorem parameter fitting is more ab initio. The original DFT-D3 method employs a damp- ing function which scales the dispersion to zero for small atomic distances (zero- damping) [27]. Later, a Becke-Johnson-type damping was introduced, which reduces the dispersion to a finite value instead of zero and enhances short vdW contacts [25, 28]. The last vdW scheme tested in this work is the method introduced by Tkatchenko and Scheffler [29]. It is formally identical to the DFT-D2 method, however the vdW coefficients reflect the variation of the atomic charges with respect to the neutral atoms by means of Hirshfeld partitioning. This procedure explicitly considers the influence of the chemical environment on the polarisabilities in an ab initio man- ner. The original method [29], however, has problems with ionic systems, in which charge transfer alters the atomic reference charges. The solution consists in shifting the reference from the neutral atoms to the atoms in a charged state by means of an iterative Hirshfeld partitioning scheme [30, 31]. The construction of the above semi-empirical methods shows why vdW interactions become growingly important in materials involving elements with high atomic num- bers: Since the 𝐶 coefficients are proportional to atomic polarisabilities, atoms with larger electron shells contribute more dispersion to the total energy. Therefore, in particular the elements of the last rows of the periodic table are expected to have high vdW-related effects on the structures which incorporate them. 2.2 The Bloch theorem Most solid-states are periodic, i.e. their properties are completely described on a finite, microscopic domain which is replicated into all 𝑁 spatial directions. Tradi- tional crystals, for example, are periodic in all three spatial dimensions with a small replication unit representing the symmetry of the macroscopic material. Also non- periodic systems can be described with a periodic ansatz. The unit cell has then to be large enough, that the local conditions are well described and the influence from the periodic replica vanishes. The domain of a periodic system is the generalised period length, which can always be chosen as an𝑁 -dimensional parallelepiped (a line, a par- allelogram and a parallelepiped for 𝑁 = 1, 2, 3, respectively). It contains all inform- ation about the macroscopic properties, in particular the potential and the solutions of the corresponding quantum-mechanical eigenvalue equation. Most importantly, the finiteness of the domain ensures the integrability of the Hilbert space. The wavefunctions of a particle in a periodic potential underlie boundary conditions, which are summarised as Bloch’s theorem. It can be derived in several ways like the common one considering the properties of translation operators or the abstract one employing 𝑁 -dimensional Fourier series. The derivation presented here is based on the properties of the translation group and connects the Bloch theorem with the gauge degree of freedom of the wavefunctions. Furthermore, some identities are lis- ted, which become important when the geometric phases are considered (Chap. 7). In the following paragraphs, a single quantum particle moving in a periodic poten- tial is assumed. Since the solutions of the Kohn-Sham equations fulfil this condition, Bloch’s theorem is compatible with the DFT. 19 2 Methods The direct lattice. A 𝑁 -dimensional crystal is defined by a set of 𝑁 linearly inde- pendent (direct) basis vectors {𝑹𝑖 , 𝑖{∑︁= 1, . . . , 𝑁 }. The par}allelepiped CR B 𝑟𝑖𝑹𝑖 | 𝑟𝑖 ∈ [0, 1] (2.20a) 𝑖 is the (direct) unit cell and the set {∑︁ } R B 𝑛𝑖𝑹𝑖 | 𝑛𝑖 ∈ ℤ (2.20b) 𝑖 is the (direct) lattice of the crystal. An alternative unit cell, the Wigner-Seitz cell, is defined as the set of points which are closer to 𝑹0 = 0 than to any other 𝑹 ∈ R. The Bloch theorem as a gauge function The physics of a charged particle in a crystal is determined by an electrostatic poten- tial𝑉 (𝒙) which is periodic onR. Consequently, the corresponding Hamilton operator 𝐻 of a non-magnetic system (𝑨 ≡ 0) is invariant under translations by a lattice vector 𝑹 ∈ R, which are represented by the unitary translation operator𝑈𝑹. 𝑈𝑹 = exp(−i 1ℏ 𝑹 〉· 𝒑), 𝑈 𝐻〉 (𝒑, 𝒙)𝑈−1𝑹 𝑹 = 𝐻 (𝒑, 𝒙 −〉𝑹) = 𝐻 (𝒑 , 𝒙〉)⇒ 𝐻 (𝒑, 𝒙) Ψ𝑛 = 𝐸𝑛 Ψ ⇔ 𝐻 (𝒑, 𝒙)𝑈 𝑛 𝑹 Ψ𝑛 = 𝐸 𝑈 𝑛 𝑹 Ψ𝑛 (2.21) Hence, if |Ψ𝑛⟩ is an eigenstate with eigenene rgy〉 𝐸𝑛, then𝑈𝑹 |Ψ𝑛⟩ is also an eigenstatewith the same energy. The effect of 𝑈𝑹 on Ψ𝑛 is thus equivalent to a gauge trans- formation (Eq. (A.98)). 𝑈𝑹 = exp(−i𝜒(𝒙,𝑹)) (2.22) The gauge function 𝜒(𝒙,𝑹) can be derived in two steps. Firstly, the group properties of 𝑈𝑹 tell that 𝜒(𝒙,𝑹) is linear in 𝑹, so it can be written as a scalar product between 𝑹 and a vectorial function 𝒇 (𝒙). 𝜒(𝒙,𝑹) + 𝜒(𝒙, 𝑺) = 𝜒(𝒙,𝑹 + 𝑺), −𝜒(𝒙,𝑹) = 𝜒(𝒙,−𝑹) ⇒ 𝜒(𝒙,𝑹) = 𝑹 · 𝒇 (𝒙) Secondly, because translations commute, 𝒇 (𝒙) has to be a constant vector. exp(−i 1 𝑹′ℏ ·[𝒑) exp(−i𝑹 ]· 𝒇 (𝒙)) = exp(−i𝑹 · 𝒇 (𝒙)) exp(−i 1 ′ℏ 𝑹 · 𝒑) ⇒ 0 = − 𝒇 (𝒙),− 1 Eq. (A.93)ℏ𝒑 = i 𝒇 ′(𝒙) ⇔ 𝒇 (𝒙) = 𝒌 ∈ ℝ𝑁 In summary, a crystal wavefunction which is translated by a lattice vector 𝑹 ∈ R alters by a phase shift of −𝒌 · 𝑹, where 𝒌 is a constant, 𝑁 -dimensional vector. This relation is called Bloch’s t heoretions: 〉m and the wave func〉tions |Ψ𝑛,𝒌⟩ are called Bloch func-𝑈𝑹 Ψ𝑛,𝒌 = exp(−i𝒌 · 𝑹) Ψ𝑛,𝒌 ∀𝑹 ∈ R (2.23a) Alternatively, in space representation: Ψ𝑛,𝒌 (𝒙 + 𝑹) = exp(i𝒌 · 𝑹)Ψ𝑛,𝒌 (𝒙) (2.23b) 20 2.2 The Bloch theorem The Bloch theorem proves that 𝒌 characterises the state |Ψ𝑛,𝒌⟩ like a quantum num- ber, i.e. states with different 𝒌 (up to a reciprocal lattice vector, vide infra) are mutu- ally 〈exclus ive. 〉 〈 Ψ 𝑛,𝒌 Ψ𝑛,𝒌′ = 〈Ψ𝑛,𝒌 −1 〉 〉 〈 𝑈 𝑈 Ψ ′ = Ψ 〉𝑹 𝑹 𝑛,𝒌 𝑛,𝒌 exp(i𝒌 · 𝑹) exp(−i𝒌′ · 𝑹) Ψ𝑛,𝒌′= Ψ𝑛,𝒌 Ψ𝑛,𝒌′ exp〈(i (𝒌 − 𝒌′) ·〉𝑹) ∀𝒌, 𝒌′ ∈ ℝ 𝑁 ,𝑹 ∈ R ⇒ Ψ𝑛,𝒌 Ψ𝑛,𝒌′ = 𝛿𝒌,𝒌′ (2.24) The continuity of 𝒌 motivates the nota〉tion o f the B〉loch states as functions of 𝒌:Ψ 𝑛,𝒌 → Ψ𝑛(𝒌) As the quantum-number nature of 𝒌 depends on the translation symmetry, it is also called crystal momentum, analogously to momentum conservation following from spatial homogeneity in classical mechanics. The reciprocal lattice. The Bloch phase 𝒌 ·𝑹 in Eqs. (2.23) is periodic on the addition of a vector 𝑮 ∈ ℝ𝑁 to 𝒌 with 𝑮 · 𝑹 = 2π𝑛, 𝑛 ∈ ℤ. A basis for the 𝑮 vectors is defined by a set {𝑮 𝑗} with: 𝑮 𝑗 · 𝑹𝑖 = 2π𝛿𝑖, 𝑗 , 𝑹𝑖 ∈ R, 𝑖, 𝑗 = 1, . . . , 𝑁 Or more compactly written as matrices: 𝐺 B (𝑮1, . . . ,𝑮𝑁 ), 𝑅 B (𝑹1, . . . ,𝑹𝑁 ) ⇒ 𝐺ᵀ · 𝑅 = 2π𝐼 ⇔ 𝐺ᵀ = 2π𝑅−1 The column vectors of 𝐺 are the reciprocal basis vectors {𝑮 𝑗 , 𝑗 = 1, . . . , 𝑁 }, in ac- cordance with their calculation fr{o∑︁m the (transposed) inverse of the matrix 𝑅. Theparallelepiped } CG B 𝑠 𝑗𝑮 𝑗 | 𝑠 𝑗 ∈ [0, 1] (2.25a) 𝑗 is the reciprocal unit cell and the se{t∑︁ } G = 𝑛 𝑗𝑮 𝑗 | 𝑛 𝑗 ∈ ℤ (2.25b) 𝑗 is the reciprocal lattice of the crystal. The Brillouin zone is the reciprocal analogon of the Wigner-Seitz cell: the set of points being closer to 𝑮0 = 0 than to any other 𝑮 ∈ G. Since the Bloch phase exp(−i𝒌 · 𝑹) is periodic on G, 𝒌 is only well defined within the reciprocal unit cell CG (or within the〉Bril louin z〉one).Ψ𝑛(𝒌 +𝑮) = Ψ𝑛(𝒌) ∀𝑮 ∈ G (2.26) 21 2 Methods Eq. (2.26) completes the classification of the eigenspace of a crystal Hamilton oper- ator: Every eigenstate |Ψ𝑛(𝒌)⟩ has an 𝑁 -dimensional, continuous quantum number 𝒌 ∈ CG and a band index 𝑛 ∈ ℕ. Consequently, the eigenenergies are continuous func- tions of 𝒌 ∈ CG. The band index 𝑛 is discrete for bound states, which exist in every condensed-matter system. Hence, a band with fixed 𝑛 describes a hypersurface in the 𝑁 + 1 dimensional 𝒌-energy space. As paper is only capable of displaying two- dimensional plots, a section of the hypersurface can be drawn by plotting the bands along a 𝒌-path in CG. The result is the band structure. 2.2.1 Basis changes The Bloch states |Ψ𝑛(𝒌)⟩ are unhandy in practical applications since they are non- periodic (Bloch theorem) and delocalized (they do not decay). These problems can be encountered by basis changes which, of course, do not affect the physics. There are different levels of transformations: those acting separately on each individual state |Ψ𝑛(𝒌)⟩, those mixing states with different 𝒌-vectors at a fixed band 𝑛, and those mixing different bands 𝑛 at a fixed 𝒌-vector. The two important bases presented here are the Bloch factors and the Wannier functions, which solve the above-mentioned problems about periodicity and localisation, respectively. The Bloch factors The first basis change exploits the quantum-number nature of 𝒌 and the gauge trans- formation rules in Eqs. (A.99). The application of a 𝒌-dependent phase to the Bloch functions transforms the stationary Schrödinger equation (2.21) into new eigenvalue equation for the Bloc h Ham〉iltonia n ℎ(𝒑,〉𝒙 | 𝒌), which 𝒌 e nters a〉s a parameter.𝑢𝑛(𝒌) B ?̃? 𝒌 Ψ𝑛(𝒌) = exp(−i𝒌 · 𝒙) Ψ𝑛(𝒌) (2.27a) ℎ(𝒑, 𝒙 | 𝒌) B ?̃?𝒌𝐻 (𝒑, 𝒙) −1 Eq. (A.99b)?̃?𝒌 〉 = 𝐻 (𝒑 +〉ℏ𝒌, 𝒙) (2.27b) ⇒ ℎ(𝒑, 𝒙 | 𝒌) 𝑢𝑛(𝒌) = 𝐸 (𝒌) 𝑛 𝑢𝑛(𝒌) (2.27c) In contrast to the original states |Ψ𝑛(𝒌)⟩, the Bloch factors |𝑢𝑛(𝒌)⟩ are periodic on R. This is a great simplification of the eigenvalue problem as techniques like Fourier series become applicable. On the other hand, the periodicity in 𝒌 on G is lost and phase-sh ifted b〉oundary conditions on CG hold instea〉d. That is ∀𝑹 ∈ R, 𝑮 ∈ G: 𝑈 𝑹 𝑢𝑛(𝒌) = 𝑈𝑹 exp(−i𝒌 · 𝒙)𝑈−1 𝑹 𝑈𝑹 Ψ𝑛(𝒌) 〉 (2.28a) Eq. (2.23) 〉 〉= exp(−i𝒌 · (𝒙 − 𝑹)) exp(−i〉𝒌 · 𝑹) Ψ𝑛(𝒌) = 𝑢𝑛(𝒌)〉𝑢𝑛(𝒌 +𝑮) = exp(−i (𝒌 +𝑮) · 𝒙) Ψ𝑛(𝒌 +𝑮) = exp(−i𝑮 · 𝒙) 𝑢𝑛(𝒌) (2.28b) The Wannier functions Since the Bloch functions are periodic in 𝒌 on G, they have an 𝑁 -dimensional Fourier series. The Fourier coefficients, which are indexed by the direct lattice vectors 𝑹 ∈ R 22 2.2 The Bloch theorem (Eqs. (A.94), but with swapp〉ed space∫s), are the Wannier functio〉ns |𝑎 (𝑹)⟩. 𝑎 ( ) 1𝑹 〉B∑︁ d𝑘 ex p(−i𝒌 〉· 𝑹) 𝑛 𝑛 |C | Ψ𝑛(𝒌) (2.29a)G CGΨ𝑛(𝒌) = exp(i𝒌 · 𝑹) 𝑎𝑛(𝑹) . (2.29b) 𝑹∈R With respect to the Bloch theorem, the 𝑹th Wannier function results from shifting all states of a fix〈ed band 𝑛〉 into the 𝑹th u∫nit cel〈l and ta Eq. (2.23) 1 king th〉e sum over 𝒌. 𝒙 𝑎 (𝑹) = 𝑛 |C | d𝑘 𝒙 − 𝑹 Ψ𝑛(𝒌) C 𝑎𝑛(𝒙 − 𝑹) (2.30)G CG Therefore, each band 𝑛 has one Wannier function 𝑎𝑛(𝒙) which is centred at 0 and defined on the entire space ℝ〈 𝑁 .〉This per∫mits the definit ion of a centre of charge:1 𝒙𝑛 = |C | d𝑥 𝒙 𝑎𝑛( 2𝒙) (2.31) R ℝ𝑁 In order to show that Eq. (2.31) is well defined, the existence of the integral and the gauge-invariance have to be proven. The space integral in Eq. (2.31) can be partitioned into a series of unit-cell integrals. The〈de〉finiti∑︁on of the∫cell-period ic Blo ch fa∑︁ctors (E∫q. (2.27a)) then yields:1 𝒙𝑛 = |∫C | ∑︁d𝑥 𝒙 𝑎𝑛( ) 2 1 2𝒙 = ∈R R CR+𝑹 ∈R |CR∫| d𝑥 (𝒙 − 𝑹) 𝑎𝑛(𝒙 − 𝑹) 𝑹 𝑹 CR 1 2 1 ∑︁ 2= |C | ∫ d𝑥 𝒙 ∫𝑎𝑛(𝒙 −R CR 𝑹∈R (∗) i ∫ 1 〈 𝑹) − |C〉R | 〈 d 𝑥 〉𝑹 𝑎𝑛(𝒙 − 𝑹)CR 𝑹∈R (2.32) = |C | d𝑘 |C | d𝑥 𝑢𝑛(𝒌) 𝒙 ∇ ′ 𝒙 𝑢 (𝒌′) 𝑘 𝑛 ′ G CG R C 𝒌 =𝒌 R i 〈 〉 = d𝑘 𝑢 (𝒌) |C | 𝑛 ∇ 𝑘 𝑢𝑛(𝒌)G CG In step∑︁(∗) , the Parse val ident∫ity (Eqs 〈. (A .95) an〉d (A.96)) was use∫d. ( − ) 2 1 ( ) 2 Eq. (2.27a) 1 〈 〉 ∑︁ 𝑎𝑛 𝒙 𝑹 = |C | ∫ d𝑘 〈 𝒙 Ψ𝑛 𝒌 〉 〈 = |C〉| d𝑘 𝒙 𝑢𝑛(𝒌) 2 𝑹∈R G CG G CG( − ) 2 −i𝑹 𝑎𝑛 𝒙∫ 𝑹 〈= |C | 〉 d𝑘 Ψ 〈 𝑛(𝒌) 𝒙〉 ∇𝑘 ′ 𝒙 Ψ𝑛(∫𝒌′) ′𝑹∈R G C 𝒌 =𝒌G Eq. (2.27a) −i 1 〈 〉 = |C | d𝑘 𝑢 (𝒌) 𝒙 ∇ ′ 𝑛 𝑘′ 𝒙 𝑢𝑛(𝒌 ) 𝒌′=𝒌 + |C | d𝑘 𝒙 𝒙 𝑢𝑛(𝒌) 2G CG G CG Eq. (2.32) is thus well defined because ⟨𝒙 | 𝑢𝑛(𝒌)⟩ is differentiable with respect to 𝒌. 23 2 Methods To prove the gauge invariance, the centre of charge is calculated for Bloch factors which are shifted by a 𝒌-dependent phase 𝜒(𝒌). In doing so, a variant of the Gauß identity is used telling that the volume integral of a gradient field equals the surface integral of the potential, 𝜒(𝒌) in this case. The surface integral can be evaluated with the restriction on 𝜒(𝒌) that its values on two opposite faces of CG must differ by in- teger multiples of 2π so as to retain boundary conditions of the Bloch factors (Eq. (2.28b)). 〉 〉?̃?𝑛〈(𝒌)〉 = e〈xp〉(−i𝜒(𝒌))∫𝑢𝑛(𝒌) ∫1 〈 〉 ∑︁𝑁 𝑹⇒ ?̃?𝑛 = 〈𝒙𝑛〉 + |C | d ∇ 𝑗 𝑘 𝑘𝜒(𝒌) = 𝒙𝑛 + d𝑠 · ∇𝑠𝜒(𝒔) G CG [0,1]𝑁 2π𝑗=1 = 𝒙𝑛 + 𝑹, 𝑹 ∈ R Thus, the centre of charge is only well defined modulo a direct lattice vector 𝑹. This is expectable as translating all Wannier functions by 𝑹 corresponds to a reindexing. 24 Part I: Rare-earth silicide nanowires on silicon surfaces Rare-earth silicide nanowires are intriguing examples of systems on the nanoscale. As the name suggests, they are derived from the rare-earth silicides, a compound of silicon and the rare-earth elements (REs), which is often addressed by the chemical formula “RESi2”, irrespectively of the exact stoichiometry. Horizontal thin streaks of these hard, brittle metals form on several silicon surfaces by self-organised growth, i.e. after RE deposition and successive annealing without further structuring meas- ures. As their cross sections extend to only a few nanometres, while their lengths can reach several 100 nm, these streaks are referred to as nanowires. As a result of the small lateral dimensions, quantum confinement effects are expected to dominate the electronic system. Hence, RESi2 nanowires are promising candidates for metallic systems of one-dimensional electronic character. The RESi2 nanowires establish a special class of nanowires as they combine a bunch of unique and peculiar properties. In contrast to many other nanowire systems (e.g. the not less interesting gold nanowires on Si(ℎℎ𝑘) [P13, P14]), RESi2 nanowires can be considered a cut-out of a bulk structure, the bulk RESi2 prototypes in this case. These cut-outs extend to heights and widths of a few unit cells and to lengths of hun- dreds of unit cells. Such a structure bears several advantages: Firstly, since the in- dividual streaks are subdivided into small building blocks, structure models of ar- bitrary widths, heights and lengths can be set up in principle. Secondly, the streaks incorporate a silicon network, which links to the substrate via strong Si–Si bonds and guarantees stability against delamination. Thirdly, there is a vast number of nanowire@silicon configurations because RESi2 occurs in two different phases and silicon provides surfaces of very different morphologies. For these reasons, RESi2 nanowires provide an inexhaustible number of possibilities to synthesise and ex- plore samples. The research began in the 1980s when scientists experimented with metallic silicides on pristine silicon. In particular, the integrated-circuit community was interested in these systems as they establish metal-semiconductor contacts. Since epitaxy permits the production of high-quality films, those silicides which grow epitaxially on silicon attracted the most scientific attention. The REs stand out from the silicide-forming elements as they produce very low Schottky barriers of 0.3 eV to 0.4 eV on 𝑛-type sil- icon, the smallest known for this substrate in the 1980s [32]. Subsequent experiments proved that the silicides of yttrium (Y, 𝑍 = 39) and all REs from gadolinium (Gd, 𝑍 = 64) to lutetium (Lu, 𝑍 = 71) are suitable for epitaxy on the Si(111) surface. This is due to a match between the structure of the silicon-rich RESi2 bulk phases and the geometry of the Si(111) surface: Both have a trigonal/hexagonal symmetry axis and 25 Rare-earth silicide nanowires on silicon surfaces similar lattice constants, which deviate by −2.55 % to +0.83 % [33]. The thickness of the RESi2 films on Si(111) can be tuned so that it spans only a few nanometres. In particular, RE coverages corresponding to one monolayer lead to the formation of a very thin and smooth silicide film with a thickness of one axial lattice constant of the respective RESi2 bulk phase [34]. Such a film has an electronic band structure which resembles that of a two-dimensional (semi-)metal with a Schottky barrier of ∼0.1 eV [35]. It is remarkable that the experimental outcomes for different REs are almost identical [P8, P12, 34–53]. The silicide epitaxy was also investigated on the Si(001) surface. In 1998, Kalka et al. prepared a clean silicon sample with dysprosium (Dy, 𝑍 = 66) at a coverage corres- ponding to one monolayer. Surprisingly and differently from the Dy@Si(111) system, subsequent annealing at a few 100 ◦C lead to the formation of rectified, rectangular islands instead of a smooth film [42]. Short after, Preinesberger et al. reduced the RE coverage to sub-monolayer and observed the formation of strongly anisotropic DySi2 structures. Because the lateral dimensions measured a few nanometres, while their lengths reached up to 200 nm, they referred to the structures as “nanowires” [54]. Not only did they report the first fabrication of RESi2 nanowires, but they also assessed the process to be quite simple as the nanowire formation was self-organised, con- trolled mainly by macroscopic parameters like the RE coverage and the annealing temperature. Since their discovery, RESi2 nanowires have been an active research field in which many groups published fabrication details, topographical data (e.g. STM), hypothetical structure models and electronic properties (e.g. ARPES or STS) [P3, P6, 54–67]. RESi2 nanowires do not only grow on Si(001), as in the pioneering work. Also vicinal Si(111) surfaces (Si(ℎℎ𝑘)) produce nanowires as the steps and terraces cause discon- tinuities in the monolayer film [P8, 64, 68–70]. This part of the thesis is dedicated to the theoretical exploration of RESi2 nanowires on the Si(557) surface (nanowire- RE@Si(557)). Although structure models have been proposed based on the RESi2 monolayer on Si(111) and experimental findings [64, 68, 70], the real structure is still unknown. Therefore, a major part of this work consists in setting up reasonable structure models, optimising them by means of DFT and evaluating their stability by means of ab initio thermodynamics. Once verified, the stable structure model is used to explain the experimental outcomes of this system, particularly the elec- tronic properties measured by ARPES (band structure and Fermi surfaces). Trivalent terbium (Tb, 𝑍 = 65) is employed as a representative RE as it produces the desired nanostructures in the experiment [70]. To a certain extent, the results are transfer- able to other trivalent REs, in particular dysprosium (Dy, 𝑍 = 66) and erbium (Er, 𝑍 = 68), as proven by experimental evidence [64, 68], and holmium (Ho, 𝑍 = 67), as it lies between Dy and Er in the lanthanoid series. The purpose of all investigations is answering the following questions: • Are the proposed structure models for the nanowire-Tb@Si(557) system stable? • Does the electronic structure of the nanowire-Tb@Si(557) system have (quasi-) one-dimensional electronic properties? 26 Rare-earth silicide nanowires on silicon surfaces Before the RESi2 nanowires are addressed, some related systems are considered at first. The respective insights will be absolutely crucial for understanding the physics of the nanostructures. In the first instance, the chemical background of the constitu- ent elements, the rare-earth elements and silicon, is collated in Chap. 3. The focus of this literature research is on the role of the 4f electrons in chemistry and the gen- eral properties of the silicide compound class. After that, the bulk phases of RESi2 are analysed in Chap. 4 since they are the three-dimensional bulk prototypes for the nanostructures. Attention is paid to how the characteristic Si vacancies affect the structural and electronic properties of the bulk phases. Furthermore, two important approximations which are employed in the subsequent chapters are validated: the frozen-core approach for treating the f electrons of Tb and the omission of the lattice dynamics in ab initio thermodynamics. Chap. 5 concerns the first nanostructure: the monolayer-Tb@Si(111) system, which is the surface prototype for the nanowire-Tb@Si(557) system. Contrarily to the nano- wires, the monolayer provides detailed experimental data on the lattice parameters. Therefore, it serves as a test system for the approach. Furthermore, although the RESi2 monolayers have already been treated in several experimental and theoret- ical works (vide supra), the peculiarities about the established structure model are not sufficiently understood. This in particular concerns the reason for the buckling direction of the covering Si honeycomb, which is inverse to that of the substrate. A detailed analysis of the structural and electronic properties illuminates the underly- ing physics of the monolayer and explains the peculiarities. The chapter concludes with a critical revision of the popular statement that mechanical strain is responsible for the details of the established structure model. The nanowire-Tb@Si(557) system is the matter of Chap. 6. The structure models are derived from those of the monolayer and optimised by means of DFT. The RE coverage of the surface is varied between dense and sparse so as to derive a corresponding phase diagram for the nanowire system. For the stable structure models the electronic properties are calculated. The chapter concludes with the proof of a unique dimensional crossover from 2D to 1D, which has never been observed in this system before. 27 3 Chemical background 3.1 The rare-earth elements The rare-earth elements are a group of 17 elements: scandium (Sc, 𝑍 = 21), yttrium (Y, 𝑍 = 39) and the 15 lanthanoids1. The latter are the first row of the f-block plus the first 5d element (inner transition metals), a series starting with lanthanum (La, 𝑍 = 57) and terminating with lutetium (Lu, 𝑍 = 71) [71, p. 51]. The rare-earth elements are often abbreviated to REE or, more like a chemical symbol, RE. The lanthanoids are often denoted by the chemical symbol Ln = [La−Lu]. Since the individual members of the REs are unfamiliar to most non-specialists, they are more than once addressed by their name, chemical symbol and atomic number 𝑍 in this section. The REs are very homogeneous concerning their chemical behaviour, while their physical properties (in particular optical and magnetic properties) differ consider- ably. This makes the REs important technology metals in nearly every high-tech branch. “Though there are no uniform classifications for rare-earth applications, the markets are commonly divided into nine sectors: catalysts, polishing, glass, phos- phors and pigments, metallurgy, batteries, magnets, ceramics, and others” [72]. A very prominent RE is neodymium (Nd, 𝑍 = 60) as the alloy Nd2Fe14B is the base for strong permanent magnets used for efficient electric motors and generators [73, p. 78]. The high spin multiplicity of gadolinium (Gd, 𝑍 = 64) is exploited in contrast agents for magnetic resonance tomography (MRT). The paramagnetic ions enhance the relaxation times of the surrounding protons (hydrogen atoms) and thereby in- crease the image contrast [73, p. 70]. Erbium (Er, 𝑍 = 68) is known from optical devices, e.g. erbium-doped fibre amplifiers: The shielded 4f levels of the Er3+ ions produce a near-infrared transition (from 𝐽 = 152 to 𝐽 = 13 2 ) with a long lifetime of ∼10 ms. Thus, most Er3+ ions in the fibre can be excited (“pumped”) so that optical signals passing the fibre are amplified by stimulated emission [74, 75]. The chem- ical similarity between the REs is due to analogous valence electron configurations (VECs) which all yield the oxidation state III in most compounds [18, p. 2230], as well as in aqueous solutions (Tab. 3.1). While this is expectable for scandium (Sc, 𝑍 = 21), yttrium (Y, 𝑍 = 39) and lanthanum (La, 𝑍 = 57), which represent the third column of the periodic table, the trivalence of the other 14 lanthanoids despite their consecut- ive atomic numbers is an “aperiodic” behaviour. This phenomenon is unique in the periodic table and due to the special role of the 4f electrons in chemistry. 1Most of the (older) literature refers to the elements from La to Lu as “lanthanides”. However, this term is misleading since the suffix “-ide” denotes anionic constituents (cf. oxide O2 – , fluoride F – ). In contrast, the suffix “-oid” indicates an instance similar to the word root. Therefore, the IUPAC recommends the term “lanthanoids” for the group La–Lu, including La and Lu [71, p. 52]. 28 3.1 The rare-earth elements The role of f electrons in chemistry The VEC can partially be derived from the shell model and the standard occupation rules which lead to the periodic table. These rules correctly predict a trivalent VEC of 4s23d1 and 5s24d1 for the non-lanthanoids scandium (Sc, 𝑍 = 21) and yttrium (Y, 𝑍 = 39), respectively. Furthermore, they predict that the first f shell, the 4f shell, is empty up to the alkaline earth metal barium (Ba, 𝑍 = 56, VEC = 6s2) and fills up from lanthanum (La, 𝑍 = 57) to ytterbium (Yb, 𝑍 = 70). Hence, the VEC of the 𝑛th lanthan- oid would be 6s24f𝑛, 𝑛 ∈ [1, 14], according to the periodic table. The termination of the lanthanoid series lutetium (Lu, 𝑍 = 71) would then be the first 5d-element with a VEC of 6s25d14f14. Since the complete 4f shell is expected to remain inert, Lu is correctly predicted to be trivalent with two s electrons and one d electron in the out- ermost shells, analogously to Sc and Y. The trivalence of the other lanthanoids is not obvious, in particular because the 4f levels are incomplete, except for ytterbium (Yb, 𝑍 = 70), and lie in the chemically relevant energy range near the Fermi level. Though, the energy levels and occupations of a shell are not the only conditions for its ability to participate in chemical bonding. In addition, the atomic orbitals have to spatially overlap with those of another atom to form molecular orbitals. However, the spa- tial distribution of the atomic 4f orbitals is enclosed by the shells with higher main quantum numbers, in particular the 6s shell, the 5d shell and the 5p shell. Hence, the 4f electrons could only participate in chemical bonding, if all outer shells did also. While this is the case for the 6s and 5d electrons, the 5p electrons are part of the xenon noble gas configuration and thus inert. As a consequence, they screen the 4f electrons from the environment and render them inert as well. Applying this to the lanthanoids from lanthanum (La, 𝑍 = 57) to ytterbium (Yb, 𝑍 = 70) would imply a divalent state from the two 6s electrons, not the observed trivalent state2. Obviously, the rules of the periodic table cannot explain the trivalence of the lan- thanoids. They even fail in predicting the correct atomic ground state for lanthanum (La, 𝑍 = 57), cerium (Ce, 𝑍 = 58) and gadolinium (Gd, 𝑍 = 64). The apparent con- tradictions are due to an oversimplification of the real electronic situation and can be resolved by considering the ionisation energies of different VECs. A detailed ana- lysis of this can be found in [75]. The key is that the atomic terms belonging to the divalent VEC (6s25d04f𝑛) and the trivalent VEC (6s25d14f𝑛−1) are energetically very close to each other for most of the lanthanoids. Therefore, one electron leaves the 4f shell if the drop in the electronic chemical potential, e.g. caused by an electronegative reaction partner, is large enough. The ability of the Ln2+ ions to donate the third elec- tron depends on the third ionisation energies of the isolated Ln atoms. These grow gradually from lanthanum (La, 𝑍 = 57) to europium (Eu, 𝑍 = 63) by +5.7 eV, show a sharp drop at gadolinium (Gd, 𝑍 = 64) by −4.3 eV, grow again from gadolinium (Gd, 𝑍 = 64) to ytterbium (Yb, 𝑍 = 70) by +4.4 eV, and finally drop at lutetium (Lu, 𝑍 = 71) by −4.1 eV [75]. The zigzag pattern is the manifestation of Hund’s second rule 2The terms “divalent” and “trivalent” denote the number of electrons in the outermost 6s and 5d shells plus the charge state of the atom/ion. For example, the VEC of the neutral Gd atom (6s25d14f7) is trivalent as well as that of the Gd3+ ion (4f7). On the other hand, the VEC of the neutral Yb atom (6s25d04f14) is divalent as well as that of the Yb2+ ion (4f14). This nomenclature makes sense as the only possibility for a formal valence electron not to take part in chemistry is to be part of the 4f shell. 29 3 Chemical background La Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb Lu 2 2 2 (a) 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 4 (b) 4f0 4f1 4f2 4f3 4f4 4f5 4f6 4f7 4f8 4f9 4f10 4f11 4f12 4f13 4f14 Table 3.1: (a) “Valencies of the lanthanoids occurring in aqueous solution” [76]. (b) VEC of the trivalent Ln3+ ion. Empty, half-filled and filled 4f shells are bold. that the spin configuration of a specific shell tends to maximise its spin multiplicity. More precisely, parallel spins occupying orbitals with different magnetic quantum numbers reduce the total energy by their exchange energy. An overview over the behaviour of the atomic terms and the third ionisations energies of the lanthanoids can be found in the appendix (Sec. B.1). Hund’s second rule is very helpful for predicting the valences. The trivalent state is the most frequent for all lanthanoids (Tab. 3.1). If the electronic chemical potential is not too low, e.g. if the reaction partners are not too electronegative, an additional electron will go into the 4f shell to maximise the spin multiplicity. This is particularly relevant for europium (Eu, 𝑍 = 63) and ytterbium (Yb, 𝑍 = 70) since their divalent VEC provides a (half-)full 4f shell. For the left neighbours samarium (Sm, 𝑍 = 62) and thulium (Tm, 𝑍 = 69), the trivalent state is already quite stable, so the electronic chemical potential has to be high in order to push an additional electron into the 4f shell. On the other hand, if the electronic chemical potential is very low, cerium (Ce, 𝑍 = 58) and (less likely) terbium (Tb, 𝑍 = 65) can oxidise to tetravalent Ln4+ ions [18, pp. 2300]. The valence of the REs has a large impact on the properties of their compounds wherefore it is necessary to care about it. While isovalent RE com- pounds may differ only slightly in most properties, different valences usually lead to very different properties. A demonstrative example for this issue are the lanthanoid monochalcogenides, whose chemical formulae Ln(S, Se, Te) suggest divalence. How- ever, the properties of these compounds strongly depend on the atomic number of the employed lanthanoid. While the monochalcogenides of Sm, Eu and Yb are salt- like, those of the other lanthanoids are metallic. The reason for this is a deviation between the formal oxidation state II and the actual valence of the Ln ions: They are divalent in the salt-like compounds, described by the formula LnII(S, Se, Te) – II. In the metallic compounds, the Ln ions lose an additional 4f electron to the conduc- tion bands, whereupon their valence formally increments by +I and gives rise to an “electride” LnIII(S, Se, Te) – IIe – [76]. The lanthanoid contraction Although the 4f electrons are prevented from chemical bonding, their presence has some indirect consequences on the chemistry. Most notably, despite the valences of the REs are analogue, the atomic/ionic radii differ. For the first three REs without f-electrons (Sc, Y and La), the ionic radius increases, as expected for columns in the periodic table. On the contrary, the horizontal lanthanoid series from La to Lu shows 30 3.1 The rare-earth elements 220 Metal RE3+ ion, CN=6 RE3+ ion, CN=8 199.5 200 187.0 194.0182.5 182.0 181.4 181.0 180.2 178.7 180 176.3 175.2 174.3 173.4 172.4 171.8 160.6 160 177.6 140 130.0 128.3 126.6 124.9 123.3 121.9 120.6 115.9 119.3 118.0120 116.7 115.5 114.4 113.4 112.5 111.7 101.0 100 117.2 115.0 113.0 112.3 111.0104.0 109.8 108.7 107.8 106.3 105.2 104.1 103.0 102.0 100.8 100.1 80 88.5 Sc Y La Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb Lu Figure 3.1: Atomic radii of the REs [17, Anhang IV]. Dashed lines between La and Lu highlight the lanthanoid contraction. CN: coordination number of RE3+ ions. the aperiodic trend of decreasing ionic radii for incrementing atomic numbers. The lanthanoid contraction follows from two effects: the relativistic orbital contraction (Sec. 8.1) and the filling of the core-like 4f shell: Because every additional 4f electron screens the increment of the nuclear charge only partially, the fifth and sixth shells of heavier REs feel steeper potentials whereupon they contract [17, p. 374]. This affects the radii of the isolated atoms (with analogue VEC) as well as the radii of the trivalent ions and the metallic radii (Fig. 3.1 (a)). The lanthanoid contraction is strong enough to fully compensate the radius increase from the fifth to the sixth period at holmium. Therefore, the light yttrium (Y, 𝑍 = 39) is sometimes indexed between holmium (Ho, 𝑍 = 67) and erbium (Er, 𝑍 = 68). The discontinuities of europium (Eu, 𝑍 = 63) and ytterbium (Yb, 𝑍 = 70) in the otherwise smooth contraction trend of the metallic radii are due to their divalent VECs in the metals. If REs compounds with analogous VECs are compared, the RE radius has the largest impact on the geometry. This can be very important in some cases, for example, if two structural phases are energetically close to each other. Small differences in the ion sizes, as caused by the lanthanoid contraction, then determine the favourability of the one or the other phase. This proves to be a critical point when investigating the RE silicides (Chap. 4). On the other hand, the smoothness of the lanthanoid contraction permits the structural tuning of some RE compounds by varying the employed RE. This is, for instance, utilised for the growth of RE-containing structures on surfaces. The lanthanoid contraction is the f-analogon of the d-block contraction, which ex- plains the otherwise unexpectedly small atomic radii of the post-transition metals (in particular those following the 3d-elements: Ga and Ge in group 13 and 14)3. The 3Analogously to the post-transition elements, the post-lanthanoids have unexpectedly small atomic radii. For the first post-lanthanoids, the f-block contraction is strong enough to fully compensate the incrementing main quantum number of the outer shells from the fifth to the sixth row. The group-4 elements zirconium (Zr) and hafnium (Hf) have almost identical ionic radii, as well as the group-5 elements niobium (Nb) and tantalum (Ta). As a consequence, naturally occurring minerals of these elements are often solid solutions between the respective two homologues. Niobium and Tantalum were identified as different elements in the mid of the 19th century. Hafnium was discovered even later in the 1920s, after having been predicted by the Bohr model [18, pp. 1809, 1831]. 31 Radius (pm) 3 Chemical background main difference between the f-block and the d-block contraction is that the lanthan- oids have the same trivalent properties in most compounds and the same-period d- elements not. In principle, the lanthanoid contraction should occur also in the actin- oids, whose 5f shell fills up. In fact, the radii of isovalent actinoid ions contract for incrementing atomic numbers. However, the trivalent state of the actinoids is less stable because relativistic effects makes the inner s orbitals collapse, which in turn expands the outer d and f orbitals. Consequently, the 5f orbitals can participate in the chemistry [17, 18, pp. 372ff, 2230, 2320f]. The homologues neodymium (Nd, 𝑍 = 60) and uranium (U, 𝑍 = 92) demonstrate this: While Nd is trivalent in most compounds, U tends to employ all valence electrons and assumed a main valence of VI [18, p. 2323]. Hence, the lanthanoid contraction is indeed unique in the periodic table. 3.2 Silicon and silicides Silicon (Si, 𝑍 = 14) is the second most abundant element in Earth’s crust (26.3 wt%), after oxygen (48.9 wt%) [17, p. 1063]. It belongs to the third row and 14th group, the carbon group. While carbon (C, 𝑍 = 6) is the basis of organic life on Earth, silicon is mainly known as the most important technology material as it is employed in the vast majority of integrated circuits (Sec. 1.1). However, silicon plays a role also in physiology, e.g. in the hair and nails of animals [17, p. 1062], in the cell walls of plants [77] and in the global oxygen generation (diatoms, “Kieselalgen”) [78]. Silicon shares several physical and chemical properties with its lighter homologue carbon. They have analogous VECs (C: 2s22p2, Si: 3s23p2) with four valence electrons to be paired for chemical bonding. The s and p orbitals tend to hybridise to four equivalent sp3-hybrid orbitals in tetrahedral geometry, which is the origin of the al- ternative name of the carbon group: the tetrels. In the elemental phases, the sp3 hybridisation leads to the diamond structure, a three-dimensional network in which each atom has four equivalent, 𝜎-bound neighbours4. This structure is a stable phase for both elements on a certain thermodynamic domain. On the other hand, the ele- mental phases reveal an important difference between C and Si: C adopts the graph- ite structure at normal pressures, a layered structure of flat honeycombs which stick together by vdW interactions. Therein, the C atoms are sp2-hybridised and mutually bind via sp𝜎 bonds in trigonal-planar geometry. The remaining p orbitals pointing out of the plane (conventionally denoted p𝑧) connect with each other to a delocal- ised system of p𝜋 bonds, which stabilises the individual sheets [17, pp. 997ff]. To a certain extent, these are even stable in isolated form, the graphene sheets [79], for which Andre Geim and Konstantin Novoselov were awarded with the Nobel Prize in Physics in 20105. In contrast to C, Si avoids the graphite structure as well as p𝜋 bonds 4In fact, there are two possibilities of fourfold coordinated networks. The classical diamond structure is based on the fcc lattice. Changing the layer stacking in the (111) direction from ABCABC to ABAB leads to a related structure: the hexagonal diamond or lonsdaleite, which is based on the hcp lattice. Being stable only under extreme thermodynamic conditions, lonsdaleite can be found in shockwave- treated graphite and meteorites [17, p. 1002]. 5All Nobel Prizes in Physics, https://www.nobelprize.org/prizes/lists/all- nobel- prizes- in- physics/ (visited on 04/03/2022). 32 3.2 Silicon and silicides in general. A demonstrative example of this are the properties of the oxides: While CO2 with double bonds is a stable, gaseous molecule ( O C O ), SiO2 polymerises to single-bonded [SiO4] tetrahedrons which connect via the O atoms to crystals under normal conditions [17, pp. 1078ff, 1099ff]. For the same reason, the Si analogon of graphene (“silicene”) is unstable as both flat (sp2) and buckled (sp3) honeycomb geo- metries contain unpaired electrons, which render the allotrope highly reactive. Zintl phases A way to stabilise honeycomb-Si geometries is to charge the Si atoms and render them isoelectronic to the right neighbour phosphorus (P, 𝑍 = 15). The sp2 hybridisa- tion is then still avoided, but the five valence electrons give rise to a sp3 configuration with three bonds and a non-binding electron pair. This allows the Si atoms to form a buckled honeycomb with closed shells. Alkali (MI) and alkaline earth metals (MII) can serve as electron donators since their electronegativities are smaller than that of Si by >0.7 (except Mg and Be, Allred-Rochow scale, [17, Tafel III]). This leads to a partially heterovalent compound MnSim, which belongs to the class of Zintl phases, named after Eduard Zintl, who first investigated intermetallic compounds with high elec- tronegativity differences. In 1963, W. Klemm and E. Busmann completed Zintl’s ideas to the extended Zintl-Klemm-Busmann concept (ZKB concept): When an alkali or al- kaline earth metal (non-noble component) reacts with a metallic/semiconducting p- block element (noble component), the valence electrons of the non-noble component are formally completely transferred to the noble one. Depending on the VEC after the transfer, the noble component pairs the residual electrons by inter-se bonding and forms the anionic partial lattice, into which the cations are embedded. The formal strict charge transfer is then relaxed by introducing covalent bonds between the free electron pairs of the anionic lattice and the empty valence orbitals of the cations [80]. Since having four right neighbours, Si provides a particularly large variety of anionic partial lattices (e.g. buckled honeycombs, zigzag chains, dumb-bells and isol- ated atoms, [17, p. 1070]). The ZKB concept can successfully explain some structural aspects of MnSim com- pounds and will play an important role in explaining the inevitable vacancies of the bulk phases of the RE silicides. Though, it does not predict the exact geometry and the properties of the resulting compounds. Depending on the electronegativity differ- ence of the reactants, the properties of Zintl phases can range from salt-like to metal- lic. For instance, alkali/alkaline earth metal chalcogenides (selenides and tellurides) are similar to the classical salts (oxides, halides) since the high electronegativity dif- ference (>1.0, except Mg and Be, Allred-Rochow scale, [17, p. 158]) implies a strong heterovalent (ionic) bonding. In particular, the reaction enthalpies are high and the compounds are brittle and isolating/semiconducting. On the contrary, if the elec- tronegativity difference is less pronounced, the bonding has a more covalent charac- ter. Consequently, the bandgap decreases and the compound resembles more a metal than a salt concerning the electronic and optical properties [17, 80]. Moreover, cova- lent bonding can influence the partial anionic lattice because the empty d orbitals of the heavier alkali/alkaline earth metal cations can be activated and exert a steric effect on their binding partners [17, pp. 400ff]. 33 4 The rare-earth silicide bulk phases The RE silicides have been investigated since the end of the 1950s [81–108]. In 1959, Perri et al. systematically analysed the “disilicides” of several REs by means of XRD [81]. They found that the lighter REs crystallise in the tetragonal ThSi2 structure, while the heavier REs (up to Dy) assume an orthorhombically distorted version of it. The clear trend is that the orthorhombic distortion increases with higher RE atomic numbers, corresponding to smaller ionic radii (lanthanoid contraction, Sec. 3.1). For the heaviest RE silicide investigated in that work, DySi2, the distortion is considerable with a 𝑏/𝑎 ratio of 97.8 %. Only Eu disrupts the otherwise monotone development as it adopts the undistorted, tetragonal ThSi2 structure. Since coinciding with the dis- continuity in the lanthanoid contraction, this is a clear evidence that Eu is divalent in its silicide while all other investigated REs are trivalent [81]. Subsequent work found that the orthorhombic distortion reversibly vanishes at temperatures of sev- eral 100 ◦C, so all investigated RE silicides assume the tetragonal ThSi2 structure as a high-temperature phase. On the other hand, the tetragonal silicides of Pr and Eu distort at −120 ◦C and −150 ◦C, respectively [82]. Mayer et al. [83] completed the list by the REs not investigated by Perri et al. They found that the silicides of Er and the heavier REs crystallise in the hexagonal AlB2 structure, which is quite different from the ThSi2 structure at first glance. Ho and all lighter REs join the findings of Perri et al. as they adopt tetragonal/orthorhombically distorted ThSi2 structure, depending on the ionic radius. The silicide of Y was found to adopt both structures depending on the preparation temperature, in accordance with its ionic radius lying between those of Ho and Er [83]. This dimorphism motivated the same group to test whether the other RE silicides show the AlB2/ThSi2 transition as well. In fact, the silicides of Gd, Tb, Dy, Ho and Y can assume either the tetragonal/orthorhombic ThSi2 structure or the hexagonal AlB2 structure, the latter apparently being the low-temperature phase. The REs lighter than Eu show only the tetragonal/orthorhombic ThSi2 structure, and the heavier REs Er, Tm and Lu show only the hexagonal AlB2 structure [85]. The aim of this chapter is to illuminate the microscopic reasons for the peculiar be- haviour of the RE silicides and to investigate their structure. At first, the prototypes (ThSi2 and AlB2) are systematised in a novel and unconventional manner. This helps to understand how the structures respond to specific variations, in particular con- cerning vacant Si sites, which are inevitably present in all RE “disilicides” and which have not been sufficiently investigated. Then, different structure models are optim- ised and their stability is analysed by ab initio thermodynamics. The following points are addressed: the stoichiometric structure models, the role of the 4f electrons, the lattice dynamics and the Si vacancies. The findings on the structures are confirmed by band structure analyses. The chapter concludes with a discussion of the mechan- isms which determine the structure of the RE silicides. 34 4.1 Structure prototypes 4.1 Structure prototypes The RE silicide structures can be classified into two prototypes: the ThSi2 structure (Fig. 4.1 (a)) and the AlB2 structure (Fig. 4.1 (b)). Both are characterised by a skeleton of mutually threefold coordinated Si atoms in trigonal-planar geometry, denoted M- stars, into which the RE atoms are embedded. The simpler realisation of this is the AlB2 structure. It consists of parallel, congruent sheets of flat-honeycomb-Si between which hexagonal layers of RE atoms are embedded at hole position. The honeycomb sheets can also be viewed as staggered rows of M-stars, as indicated by blue shad- ows in (Fig. 4.1 (b)). The space group is 𝑃6/𝑚𝑚𝑚 (No. 191) and the atoms occupy the Wyckoff positions1 RE → 1𝑎 and Si → 2𝑑, so the conventional unit cell contains one formula unit RESi2. The ThSi2 structure is the alternative realisation. It can be derived from the AlB2 structure by rotating the M-stars of every second row by 90◦ and staggering the rows to form a three-dimensional network. The RE atoms occupy the interstices and are visible from both views along the basis. The space group is 𝐼41/𝑎𝑚𝑑 (No. 141) and the atoms occupy the Wyckoff positions RE→ 4𝑏 and Si→ 8𝑒, so the conventional unit cell contains four formula units RESi2. In order to clarify the relationship between the two structures, consider the building blocks which both of them have in common [88]. These building blocks are cuboids of width 𝑥, depth 𝑦 and height 𝑧 with Si atoms at the vertices and an RE atom in the centre (orange boxes in Fig. 4.1). Their stacking in the z-direction determines the structure. If stacked in a zigzag manner, i.e. alternately translated by ±𝑥2 in the 𝑥- direction, the b√locks form the AlB2 structure. 𝑥 is then the lattice constant 𝑎 ([11.0] direction), 𝑧 is 3𝑎/2 for a perfect hexagonal symmetry ([1̄1.0] direction), and 𝑦 is the lattice constant 𝑐 ([00.1] direction). Alternatively, the blocks can be stacked in a helical manner, i.e. alternately translated by ±𝑥2 in the 𝑥-direction and ± 𝑦 2 in the 𝑦-direction, which result is the ThSi2 structure. 𝑥 is then the lattice constant 𝑎 ([100] direction), 𝑦 is 𝑏 ([010] direction) and 𝑧 is 𝑐/4 ([001] direction). The building blocks demonstrate a further structural aspect concerning the ways how the Si atoms at the vertices are connected. The splices between two block faces consist of parallel rows of zigzag-chain-Si (different shades of blue in Fig. 4.1) which accrete to zigzag planes. In the AlB2 structure, all zigzag chains point into the same [11.0] direction. In the ThSi2 structure, the zigzag planes are alternately oriented in the [100] direction (light blue) and the perpendicular [010] direction (dark blue). Silicon vacancies The stoichiometries of real RE silicides deviate considerably from RESi2 in that Si atoms are missing. Hence, the structures in Fig. 4.1 should be considered only as pro- totypes. The empty Si sites, the vacancies, in the real structures are unordered2and their concentration depends on the fabrication technique, the temperature and the atomic number of the RE. As the real stoichiometries range between RESi1.6 and 1The numbers preceding the letters denote the multiplicity of the positions. 2In thin RESi2 films of the AlB2 phase, w√hich √are epitaxially grown on Si(111), the vacancies are ordered in-plane and give rise to a R30◦( 3 × 3) superstructure [36, 37, 39, 46, 50–52]. 35 4 The rare-earth silicide bulk phases (a) γ c c z y x a b 90° [100] [11.0] [010] [00.1] (b) z y x γ 90° b a c Figure 4.1: Overview over the RESi2 structure prototypes. (a) ThSi2 prototype; (b) AlB2 pro- totype. Blue circles are Si atoms; Yellow circles are RE atoms; Circle sizes indicate the out-of- plane positions. Left and centre: Views along perpendicular crystallographic axes. Red lines mark conventional unit cells. Blue shadings mark threefold coordinated Si atoms in the Si sublattice (M-stars). Right: Stacking of the cuboid building blocks (orange lines). Light and dark blue Si atoms highlight Si zigzag planes. (a) (b) 2 1 2 [100] 1 2 1 b 2 c a 1 a [11.0] Figure 4.2: Possible configurations of ordered Si vacancies in RESi1.67. (a) ThSi2; (b) AlB2. Col- ouring: Fig. 4.1. Solid orange circles mark the first set of vacancies, labelled dashed orange circles possible second sets. Green lines mark the supercells. 36 AlB2 structure ThSi2 structure [‾11.0] [001] [001] [‾11.0] zigzag-planes Building blocks zigzag-planes 4.1 Structure prototypes RESi1.9 [81, 85–93], the RE “disilicides” should always be referred to as RESi2 – x, where 𝑥 is the number of Si vacancies per RE atom. A value of 𝑥 = 0.33, as present in many RE silicide preparations, implies that every sixth Si site is vacant. The large number of vacancies affects the Si sublattice and, thus, the entire prototype in a considerable, but predictable manner: Since experimental diffraction patterns do not indicate any long-range ordering, the vacancies have to be assumed statistically distributed. A uniform distribution retains the symmetries of the prototypes (AlB2 → hexagonal, ThSi2→ tetragonal), while a non-uniform distribution is likely to distort them. In de- tail, removing a Si atom makes the three neighbours of the respective M-star move inwards, towards the vacant site. This combined with the topography of the zigzag chains suggests how a given vacancy distribution alters the lattice constants. Fig. 4.2 shows exemplary snapshots of ordered vacancy configurations (orange circles) for the ThSi2 and the AlB2 structure in RESi1.67 stoichiometry. The conventional unit cell of the ThSi2 structure contains four alternately orthogonal zigzag planes, of which every second is equivalent due to the body-centred unit cell (highlighted by light and dark blue circles in Fig. 4.2 (a)). If the vacancies populate both inequivalent zigzag planes to a similar extent (i.e every sixth Si site is vacant in both the light and dark blue zigzag chains, not shown in Fig. 4.2), the basal lattice constants 𝑎 and 𝑏 are expected to shrink equally. Also the axial lattice constant 𝑐 will shrink as the zigzag planes move closer together. In summa, the cell volume will con- tract while retaining the tetragonal symmetry of the prototype. The situation changes if the vacancies concentrate in one of the two inequivalent zigzag planes (e.g. every third Si site is vacant in the light blue zigzag chains, Fig. 4.2 (a)). Then, 𝑎 is expected to shrink, as well as 𝑐, while 𝑏 will remain unchanged. In summa, the structure will undergo an orthorhombic distortion along with a reduction of the symmetry. Such an effect was confirmed in TbSi1.67 by Schobinger-Papamantellos et al., who meas- ured a distortion of ≈2 % (neutron diffraction, [89]). If the vacancies populate the zigzag planes of the AlB2 structure to an equal extent (configuration 1 in Fig. 4.2 (b)), the basal lattice constants 𝑎 and 𝑏will equally contract while retaining the hexagonal symmetry of the prototype. As a response to the squeezing of the basis, the interstitial RE atoms will push the silicon layers apart and increase the axial lattice constant 𝑐. On the other hand, if the vacancies concentrate in one of the two inequivalent zigzag planes (e.g. configuration 2 in Fig. 4.2 (b)), an orthorhombic distortion is expected as 𝑎 and 𝑏 shrink differently. Such an effect was observed in hexagonal ErSi1.67 [91] and TbSi1.67 [93] by Auffret et al. (neutron diffraction). However, the distortion of the AlB2 structure (<0.5 %) seems to be much weaker than that of the ThSi2 structure. While the vacancies underlie a probability distribution in most RESi2 – x, Yb3Si5 c√an ad√opt the ordered version of configuration 1 in the bulk, as indicated by a R30◦( 3 × 3) superstructure (light green unit cell) (XRD, [87]). As a final remark on the stoichiometry, the nomenclature of the RE silicides is quite chaotic in the literature. Some publications refer to them as RESi2, irrespectively of the large number of vacancies. Others refer to the silicides with a ThSi2-derived struc- ture as RESi2 and to those with an AlB2-derived structure as RE3Si5. Although this agrees with the relative positions in the temperature-composition phase diagrams (vide infra), the stoichiometries are still incorrect. Moreover, it is rather confusing 37 4 The rare-earth silicide bulk phases that the orthorhombically distorted ThSi2 structure is often referred to as the GdSi2 structure. On the one hand, the structure-determining vacancies are not ordered in GdSi2 – x, so it has to be related to the ThSi2 structure anyway. On the other hand, GdSi2 is dimorphic and can adopt the AlB2 structure as well, depending on the preparation conditions. In order to avoid misunderstandings, the following nomenclature is used in this work. All RESi2 – x structures are assigned to one of the two prototypes: the AlB2 phase or the ThSi2 phase. If relevant, the presence or absence of orthorhombic distor- tion is explicitly mentioned by a symmetry tag (“hex”: hexagonal, “tet”: tetragonal, “ort”: orthorhombic). The chemical formula is RESi2 – b for the AlB2-type silicides and RESi2 – a for the ThSi2-type silicides (as introduced by [102]), where 𝑏 is (slightly above) 0.33 and 𝑎 between 0.33 and 0. For example, orthorhombically distorted ThSi2-type terbium silicide is denoted ort-ThSi2-TbSi2 – a and hexagonal AlB2-type erbium silicide is denoted hex-AlB2-ErSi2 – b. The structure-neutral chemical formula is RESi2 – x. The phase diagrams of the RE–Si systems The silicides of the heavy REs always contain vacant Si sites, as the temperature- composition phase diagrams of the RE–Si systems prove [102–107]. Summarising re- views on the phase diagrams can be found for Gd, Tb, Dy, Ho and Er [109–113]. This group is simply denoted by RE in the following paragraphs. Fig. 4.3 shows a sketch of the silicon-rich half of the phase diagram of the Tb–Si system, to which those of Gd, Dy, Ho and Er are similar. It furthermore contains a table with the respective peritectic and eutectic temperatures. The solubility of RE in Si and Si in RE is negligible (<1 at%) [102–104]. Several com- pounds RExSiy exist, whose melting points are higher than those of elemental hcp-RE and diamond-Si (𝑇REm > 1300 ◦C and 𝑇Sim = 1414 ◦C). All RE–Si systems have a com- pound at a Si content of 50 at% (RE1Si1), which melts congruently at a high temperat- ure (𝑇RESim ≈ 1850 ◦C). On the Si-rich side, all RE–Si systems have a compound at a Si content of ≈62.5 at%, which transitions by a peritectic reaction into solid RE1Si1 and liquidus at a peritectic temperature of𝑇1p ≈ 1650 ◦C . This compound is the AlB2 phase with a stoichiometry of RESi2 – b (𝑏 ≈ 0.33, RESi1.67 or RE3Si5). In the Er–Si system, AlB2-ErSi2 – b forms a eutectic with diamond-Si at a Si content of 81 at% and a eutectic temperature of 𝑇e = 1210 ◦C. The Ho–Si, Dy–Si, Tb–Si and Gd–Si systems, in contrast, have an additional, peritectic compound at a Si content between 62.5 at% (RESi2 – b) and 66.7 at% (RESi2). The peritectic temperature 𝑇2p decreases monotonously from Gd (1601 ◦C) to Ho (1290 ◦C). This compound is the ThSi2 phase with a stoichiometry of RESi2 – a (𝑎 ∈ [0, 0.33], lower vacancy density than in AlB2-RESi2 – b). Similarly to AlB2-ErSi2 – b in the Er–Si system, ThSi2-RESi2 – a forms a eutectic with diamond-Si at Si contents between 82 at% and 87 at% and eutectic temperatures of 𝑇e ≈ 1200 ◦C. Both the AlB2 and the ThSi2 phases show a reversible dimorphism (orthorhombic distortion) at lower temperatures, in accordance with [82, 89, 91]. Furthermore, the RESi2 – x phases have narrow homogeneity ranges of ≈1 at% [87, 104]. Several conclusions can be drawn from the phase diagrams. The first is that the form- ation of the AlB2 and ThSi2 phases depend rather on the initial compositions than on the temperature. In particular, the initial Si concentration of 62.5 at% is a critical value beyond which AlB2-ErSi2 – b is saturated with silicon [87], and the silicon-richer 38 4.1 Structure prototypes TbSi Tm liquidus RE 𝑇1p (◦C) 𝑇2 ◦ ◦p ( C) 𝑇e ( C)1 Si Tp Tm Gd 1664 1601 1226 2 T Tb 1638 1517 1222p Dy 1635 1437 1224 Ho 1620 1290 1210 Te Er 1620 — 1210 50% Si Composition 100% Si Figure 4.3: Left: Schematic phase diagram of the Tb–Si system. Adapted from [111]. Right: Peritectic and eutectic temperatures of some RE–Si systems [109–113]. ThSi2-RESi2 – a forms beside AlB2-RESi2 – b for the REs lighter than Er [88, 89, 93]. The findings of Mayer et al. that ThSi2 seems to be the high-temperature phase and AlB2 the low-temperature phase [85] can be explained by the incompleteness of the reac- tion if the temperature during the preparation is too low [108]. The second conclusion concerns the peritectic transitions. For example, consider an initial preparation of 33 at% RE and 67 at% Si (RESi2). If the sample heats up to a temperature between 𝑇1p and 𝑇2p (∼1600 ◦C), AlB2-RESi2 – b crystals form beside a RE/Si melt on the liquidus curve. Slow annealing drives the transition across the peritectic line 𝑇2p whereupon the AlB2-RESi2 – b crystals vanish and solid ThSi2-RESi2 – a forms plus solid excess Si. Such peritectic transitions require a particular care about the temperature variation during annealing in order to obtain the desired single phase silicides. If the tem- perature drops too fast, the product will contain considerable amounts of residual AlB2-RESi2 – b which is kinetically inhibited from transitioning into the equilibrium (ThSi2-RESi2 – a plus diamond-Si). Moreover, the peritectic and eutectic temperatures𝑇1p , 𝑇2p and𝑇e show an interesting dependence on the RE atomic numbers. While𝑇1p and𝑇e vary only slightly,𝑇2p shows a rapid drop from Gd to Ho. In detail,𝑇2p lies short below𝑇1p in Gd–Si and approaches𝑇e in Ho–Si. If the trend was extrapolated from Ho to Er, 𝑇2p would drop below𝑇e, which coincides with the non-existence of ThSi2-ErSi2 – a. The behaviour of the peritectic temperatures also confirms that the RE radius does not directly influence the stability of the AlB2 structure, as stated in [85], but rather the stability of the ThSi2 structure. The latter becomes unstable if the ratio between the atomic radii of RE and Si is too small [84]. If the trend was extrapolated to the REs lighter than Gd, 𝑇2p would rise beyond 𝑇1p , which implies that the ThSi2 phase melts congruently. This is indeed the case for Sm–Si and Nd–Si [114, 115] (Eu left out because of its anomalous valence). Hence, Gd, Tb, Dy and Ho are a unique quadruple concerning the peritectic transition between ThSi2-RESi2 – a and AlB2-RESi2 – b. Valence considerations In early studies, the wrong assumption of vacancy-free, stoichiometric RESi2 lead to the idea that the silicon sheets in the hex-AlB2 structure are analogous to the 39 Temperature TbSi TbSi2−b TbSi2−a diamond-Si 4 The rare-earth silicide bulk phases carbon sheets in graphite, including the presence of delocalised 𝜋-electron systems. For instance, Mayer et al. calculated a Si–Si bond distance of 2.16 Å from the basal lattice constants of hex-AlB2-LuSi2. That the result is shorter than the single bond length in diamond-Si (2.35 Å) indicates the presence of sesqui or double bonds. The ratio between the two values (0.919) seems to confirm this as it is similar to the bond-distance ratio of graphite-C and diamond-C (0.921) [83, 85]. However, as will be proven, the Si–Si bond distance in hex-AlB2-RESi2 is close to that in diamond-Si. Only the contraction of the basis due to the vacancies is such that a corresponding vacancy-free honeycomb-Si sheet produces the above ratio by coincidence. Magaud et al. were the first who theoretically investigated vacancy-populated YSi2 with DFT (RE ionic radius similar to Ho and Er, no f electrons, [96]). It turned out that the stoichiometry is decisive for the agreement between theoretical and experimental lattice constants. Relatively to AlB2-YSi2, AlB2-Y3Si5 (ordered vacancies, configura- tion 1 in Fig. 4.2 (b), light green unit cell) has an inverted 𝑐/𝑎 ratio as the basis of the unit cell shrinks and the axial lattice constant grows (as predicted, vide supra). The authors concluded that the vacancies allow the Si honeycomb to “release a part of the constraint imposed by the two adjacent Y planes”, i.e. the vacancies are due to mechanical strain [96]. However, as Zavala Aké et al. pointed out, both AlB2-YSi2 and AlB2-Y3Si5 are unstrained zero-pressure structures and the Si–Si distances of respect- ively 2.37 Å and 2.44 Å are quite close to that in diamond-Si (2.35 Å) (DFT, [100]). This indicates that the Si atoms are singly bonded in both AlB2-YSi2 and AlB2-Y3Si5 and that the removal of Si atoms causes the basal contraction, not vice versa. The electronic density of states (DOS) gives an important hint at the true reason for the stability of the vacancy-populated phases: The DOS of AlB2-YSi2 shows a peak at the Fermi level, which is not present in AlB2-Y3Si5 [95, 100]. Obviously, the Si vacan- cies partially deplete the conduction band and thereby reduce the total energy. In order to find a simple explanation for this, the system will be illuminated within the ZKB concept (p. 33). Since the anionic lattice of vacancy-free AlB2-YSi2 consists of Si honeycomb sheets, the Si atoms have a formal valence of – I. However, each RE3+ cation donates three electrons into the system, of which two are accepted by two Si atoms and one is spare. The compound formula can thus be written as an electride REIIISi – Ie –2 , indicating that one electron per RE atom occupies the conduction band. The removal of a Si atom primarily reduces the number of acceptors by one. On the other hand, having lost their bonding partner, the three Si atoms adjacent to the va- cant Si site can each accept an additional electron to complete the shells. In summa, the capacity of the anionic partial lattice increases by two electrons for each vacancy. The compound formula for RE3Si5 is thus REIII3 Si −I −II − 2 Si3 e , so the number of conduc- tion band electrons per RE atom is reduced to 1/3. Hence, the vacancies are expected to improve the charge balance, which will be proven later. According to the ZKB concept, the bonding in the vacancy-populated AlB2 structure is as follows: The two Si – I atoms per unit cell bind each to the three Si – II atoms via in- plane sp2-hybrid orbitals. The free electron pair of each Si – I atom thus occupies the out-of-plane p orbital. The three Si – II𝑧 atoms per unit cell bind each to the two Si – I atoms in an angled geometry. They are presumably sp3-hybridised and use the two in-plane hybrid orbitals for the Si–Si bonds. The other two hybrid orbitals, which are inclined out of the Si plane towards the vacancy, accept the two free electron pairs. Of 40 4.2 Structure optimisation course, the charge transfer is formal and bonds have to be established between the free electron pairs of the Si atoms and the empty orbitals of the RE atoms. While the former have the above-mentioned geometry, the geometry of latter can be neglected in first approximation as the s and d orbitals can hybridise in a quite flexible manner [17, p. 396]. The electronegativity difference between Si and the REs from Gd to Er (Δ𝜒 = 1.74−1.11 = 0.63, Allred-Rochow scale [17, Tafel III, Tafel V]) suggests a partial ionic character of ca. 10 % [17, p. 158]. Thus, a covalent bonding between the RE-sd orbitals and the Si-sp/Si-p𝑧 orbitals is expected to dominate the chemistry. 4.2 Structure optimisation The preliminary considerations about the structure of bulk RESi2 – x are inspected by means of DFT (Sec. 2.1). For this purpose, several structure models based on the AlB2 and ThSi2 prototypes are set up in suitable unit cells and optimised with respect to the cell vectors and the ionic positions (mechanical equilibrium). After that, the ther- modynamic properties are calculated in order to determine the most stable phases under given conditions. The focus of the investigations is on the silicides of Tb since Tb lies in the mid of the quadruple from Gd to Ho and is employed for the nanostruc- tures in Chap. 5 and Chap. 6. At certain points, the TbSi2 – x are compared to ErSi2 – x as it is the lightest RE not assuming the ThSi2 phase. 4.2.1 Methodological details Volume relaxation All bulk phases in this work are considered to be in mechanical equilibrium with the vacuum, so the zero-pressure volumes of the structure models have to be found. The first method to conduct the volume relaxations employs the code-level relaxation routines which optimise the atomic positions, the volume and the shape of the unit cell simultaneously. This is a quick way, but some systematic errors limit the reliab- ility of the results. On the one hand, Pulay stress arises, i.e. artificial stress from the link between the unit-cell volume and the Fourier sampling. This can be reduced by iterative restarts of the relaxation. On the other hand, the results depend on the way how the minimum is approached, which is due to the finite stop criterion (energy dif- ference, forces, etc.). In particular, starting from volumes smaller and larger than the optimum can result in different unit cells. These errors are avoided if sample unit cells are optimised at fixed volumes and the Murnaghan equation of state is used [116]. In detail, the free energy 𝐹 (𝑉,𝑇, 𝑁•), depending on the volume 𝑉 , the temper- ature 𝑇 and t(he p)article number 𝑁• of the atomic spe[cies •, i(s m)inim]ised.𝐾′ ( ) 𝜕𝑝 𝐾 𝑉 1 𝑉= − = + ′ ⇒ ( ) = + 0 · 0𝐾 𝑝 𝑉 𝐾0 𝐾 𝑝 𝐹 𝑉 𝐹0 ′ ′− + 1 − 𝐾0𝑉0 ′− (4.33)𝜕𝑉 𝑇 𝐾 𝐾 1 𝑉 𝐾 1 where 𝐾 (𝑝) is the compressive modulus at 𝑝, 𝐾0 = 𝐾 (0) and 𝐾 ′ = 𝐾 ′(0). 41 4 The rare-earth silicide bulk phases The function 𝐹 (𝑉 ) is fitted to the samples (𝑉, 𝐹 (𝑉 )), whereupon the fit parameters finally yield the minimum (𝑉0, 𝐹0) as well as the bulk mechanical properties 𝐾0 and 𝐾 ′. As the prerequisite of Eq. (4.33) is the linear behaviour of 𝐾 (𝑝), the Murnaghan equation of state is valid only on a small region around the minimum. Thermodynamical framework Since this chapter is preparatory work for the nanostructures on silicon surfaces, the investigations concern only the Si-richest bulk RESi2 – x phases under Si-rich con- ditions. In other words, the most stable structures are searched for a given amount of RE and an infinitely large reservoir of Si provided by bulk diamond-Si. This translates into the thermodynamic boundary conditions that the pressure 𝑝, the temperature 𝑇 , the RE amount 𝑁RE and the Si chemical potential 𝜇Si are constant. The situation is best described by a thermodynamical potential 𝐺RE(𝑝,𝑇, 𝑁RE, 𝜇Si) derived from the internal energy𝑈 (𝑉, 𝑆, 𝑁RE, 𝑁Si) by Legendre transformation. 𝐺RE(𝑝,𝑇, 𝑁RE, 𝜇Si) = 𝑈 (𝑉, 𝑆, 𝑁RE, 𝑁Si) + 𝑝𝑉 − 𝑇𝑆 − 𝜇Si𝑁Si (4.34) where 𝑉 is the volume, 𝑆 the entropy, 𝑁• the particle number of the atomic species • and 𝜇• the respective chemical potential. Under given boundary conditions, those structures are stable which minimise 𝐺RE. According to Euler’s theorem, 𝐺RE com- prises only the term 𝑁RE𝜇RE and can easily be transformed into an intensive quantity by dividing by 𝑁RE. The remainder is then identical to the RE chemical potential. 1 𝜇RE𝑁RE 𝐺RE(𝑝,𝑇, 𝑁RE, 𝜇Si) = = 𝜇RE(𝑝,𝑇, 𝜇Si) (4.35) 𝑁RE 𝑁RE The boundary conditions immediately replace some of the variables in Eq. (4.34) with known quantities. The structure models determine {𝑁•}. 𝜇Si is fixed to 𝜇d-SiSi , the Si chemical potential of diamond-Si. The 𝑝𝑉 terms are zero because the struc- ture models are relaxed into their zero-pressure volumes (vide supra). Concerning 𝑆 and 𝑇 , an approximation will be made, although its validity is not obvious a pri- ori: The internal energy𝑈 is replaced by the DFT total energy 𝐸cellDFT and the 𝑇𝑆 terms are omitted. This is based on the close relatedness between the RESi2 prototypes, which are both characterised by cages of mutually three-fold coordinated Si atoms, into which the RE atoms are embedded. Therefore, the vibrational contributions to 𝑈 and 𝑆 are expected to be similar in different RESi2 – x structures, so the 𝑇𝑆 terms cancel out in energy differences. This is a common practice when related structures are compared at low temperatures [53]. The validity of the 𝑇𝑆 cancellation will be tested on two samples in Sec. 4.2.4.(With these repl)acements, Eq. (4.35) reduces to: ≈ 1 cell d-Si hcp-RE𝜇RE 𝐸 𝑁 DFT − 𝜇Si 𝑁Si − 𝜇RE (4.36) RE Referring to hcp-RE𝜇RE 𝜇RE , the RE chemical potential of bulk hcp-RE, is necessary be- cause only chemical potential differences have a physical meaning. As the reference is arbitrary, hcp-RE𝜇RE could be substituted by the RE chemical potential of other phases, e.g. by that of gaseous RE atoms. In the present form, Eq. (4.36) equals the free 42 4.2 Structure optimisation formation enthalpy of RE + (2 – x) Si −−→ RESi2 – x under Si-rich conditions. As 𝜇RE is intensive, structure models of different sizes and, particularly, containing different numbers of RE and Si atoms can directly be compared regarding their stability. In detail, if a set of RESi2 – x structures is given, only those are in chemical equilibrium with diamond-Si which minimise hcp-RE𝜇RE. The 𝜇RE term cancels in 𝜇RE differences, which shows that the reference is indeed arbitrary. If, in addition, the stoichiometry of the two structures is equal, also the 𝜇d-SiSi term cancels, so knowledge about any referential chemical potentials is not necessary in this case. A major part of the in- vestigations concerns the question whether the AlB2 phase or the ThSi2 phase is more stable under certain conditions (e.g. the Si vacancy density or the theoretical treat- ment). Therefore, it is instructive to define the difference: Δ𝜇T−ARE ≔ 𝜇 ThSi2 AlB2 RE − 𝜇RE (4.37) where 𝜇ThSi2RE and 𝜇 AlB2 RE are the RE chemical potentials of the most stable ThSi2-RESi2 – x and AlB2-RESi2 – x structure models at a certain 𝑥. Hence, Δ𝜇T−ARE < 0 implies that the ThSi2 phase is more stable than the AlB2 phase and vice versa. Vibrational contributions to the free energy The omission of the lattice dynamics in Eq. (4.36) is based on the cancellation of the vibrational contributions upon comparing similar structures. In order to check the validity, it has to be proven that the lattice dynamics of the structures to be compared yield similar free energies. Lattice dynamics covers a broad field of different effects and methods which would exceed the purposes of this work. A quite feasible concept for estimating lattice-dynamical effects at not too high temperatures is the frozen- phonon method, which is applied in this work. In principle, the dynamics of 𝑁K ions in a DFT system are described by the Hamilton operator 𝐻K with the DFT total energy 𝐸DFT({𝑅𝑘}) acting as an effective potential. The latter depends on the ionic positions {𝑅𝑘}, 𝑘 ∈ [1, 3𝑁K] (Born-Oppenheimer ap- proximation, Eqs. (2.2); from now on, the positions are written component-wise). For small displacements from the equilibrium, 𝐸DFT({𝑅𝑘}) can be approximated by a second-order expansion, the harmonic approximation. The coefficient of the quad- ratic term, the Hessian matrix of 𝐸DFT({𝑅𝑘}) or dynamical matrix, can be calculated by finite differences. Its eigenvalues are (the square of) frequencies 𝜔𝑘 and its ei- genvectors are harmonic vibrations. Hence, as long as the harmonic approximation is valid, i.e. the temperature is not too high, the lattice dynamics is described by an ensemble of 3𝑁K independent quantum-mechanical harmonic oscillators. Since 𝐸DFT({𝑅𝑘}) is periodic in crystals, an approach similar to the Bloch theorem (Sec. 2.2) can be employed to reduce the vibrating ions to the𝑁 cellK ions of the unit cell. The band structure of the eigenfrequencies gives rise to 3𝑁 cellK discrete branches, the phonons, which depend on a continuous parameter 𝒒 confined to the Brillouin zone [6, pp. 41ff]. The “frozen-phonon” method consists in calculating the phonons at zero tem- perature and using them for describing the lattice dynamics at finite temperatures. The exact calculation of state variables of a phononic system involves q-integrations over the Brillouin-zone and summations over discrete branches. In this work, the 43 4 The rare-earth silicide bulk phases VASP-built-in phonon routines are employed, which calculate only the 𝑁 cellK phonons at 𝒒 = 0. The sum over them corresponds to sampling the Brillouin-zone solely by the Γ-point, which is insufficient particularly for small unit cells. To resolve this is- sue, the unit cells are multiplied to supercells so that the Γ-point includes also 𝒒 ≠ 0 points according to band-folding. In doing so, symmetry-equivalent entries of the dynamical matrix are identified so as to reduce the computational demand. Once the eigenfrequencies 𝜔𝑘 are determined, the vibrational free energy 𝐹vib(𝑇 ) at finite temperatures 𝑇 can be calculated by the quantum statistics of independent harmonic o∏scillators.3𝑁cell 3∏𝑁cellK K 𝑍 = tr[exp(−𝛽ℏ𝜔 1𝑘 (𝑛 + 2))] = (2 sinh(𝛽ℏ𝜔 /2)) −1 𝑘 , 𝛽 = 1/𝑘B𝑇 𝑘=1 ∑︁ 𝑘=13𝑁cellK 3∑︁𝑁cellK ⇒ 1 𝑇→0 ℏ𝜔𝑘𝐹vib(𝑇 ) = − ln(𝑍) = 𝑘B𝑇 ln(2 sinh(ℏ𝜔𝑘/2𝑘B𝑇 )) ≃ (4.38) 𝛽 2 𝑘=1 𝑘=1 Adding this to the electronic free energy ≈𝐸cellDFT yields the total free energy 𝐹 (𝑇 ). 𝐹 (𝑇 ) = 𝐸cellDFT + 𝐹vib(𝑇 ) (4.39) In the low-temperature limit, 𝐹vib(𝑇 ) approaches the sum of the ground-state ener- gies of the harmonic oscillators. Consequently, vibrational contributions can be rel- evant even at low temperatures, at which the 𝑇𝑆 terms vanish. The theoretical treatment of the rare-earth elements As the considerations in Sec. 3.1 point out, the lanthanoids have a strongly correlated, incomplete 4f shell. Therefore, pure DFT predicts the wrong ground state with frac- tionally occupied 4f levels lying at the Fermi level. The first of two possible solutions for this problem is the LDA+U method (Sec. 2.1.1), which works quite well for the 4f electrons of the lanthanoids since they obey Hund’s rules. The alternative solution makes use of the inertness of the 4f electrons and “freezes” them in the core during the pseudopotential generation. The major a-priori decision to be made is then the VEC of the Ln atom, i.e. how many 4f electrons are kept frozen in the core. Of course, the second approach is only possible if the valence is known a priori. The explicit treatment of the 4f electrons with LDA+U is a quite universal approach as the valence of the Ln atoms is not set at the beginning, but rather develops self- consistently in the respective structure model. However, this approach has import- ant drawbacks. Firstly, the computational demand is significant, in particular for the heavy trivalent lanthanoids, as 2 + 𝑛 electrons per Ln atom have to be considered, while only 2 + 1 electrons determine the chemistry. Secondly, the convergence into the ground state can be rather difficult, so many test calculations may have to be done until the spin configuration is correct. The third drawback consists in the paramet- ers 𝑈 and 𝐽 , which depend on the VEC and alter the total energy by a gauge. There- fore, calculations with different𝑈 and 𝐽 and, hence, different VECs cannot directly be 44 4.2 Structure optimisation compared. Instead, test calculations are necessary, from which the correct VEC can be determined by comparing observable quantities with the experiment (cohesive energies, bond lengths, lattice constants etc.). Since this limits the ab initio character of the explicit f-electron treatment, the frozen-core approach often gives satisfying results while saving computational demand. It will be the method employed in this work, while the explicit f-electron treatment will be tested. Computational parameters The DFT calculations are carried out with VASP [2, 3] using PAW potentials and the PBE xc-functional [10, 21, 22]. The 4f electrons of the REs are kept frozen in the core, while the 5p electrons are treated explicitly as semi-core electrons. As the considered REs are trivalent in their silicides [81, 83], the REs have a PAW valence of 5p65d16s2. Si has a PAW valence of 3s23p2. The validity of freezing the 4f electrons in the core will be tested for TbSi2 and ErSi2 (Sec. 4.2.3). The kinetic energy cutoff is 𝐸cut = 400 eV. The Brillouin zone is sampled by a Γ-centred Monkhorst-Pack mesh [117] at a density of 16 × 16 × 16 k-points for the hexagonal unit cell and 16 × 16 × 8 k-points for the conventional tetragonal unit cell. For the supercells, the sampling density is scaled accordingly. The Brillouin-zone integration is carried out by a Gaussian occupation with a smearing of 20 meV for the ionic relaxations and by the tetrahedron method with Blöchl corrections [118] for all static calculations. A convergence test proves that increasing the energy cutoff or the sampling density alters the total energy by≈1 meV per atom. The ionic forces are calculated according to the Hellmann-Feynman the- orem [119]. The ions move along them towards the equilibrium positions and stop to relax if the forces acting on each ion are smaller than 0.005 eV/Å. 4.2.2 The stoichiometric RESi2 phases The mechanical equilibria of the vacancy-free prototypes of TbSi2, DySi2, HoSi2 and ErSi2 are determined by Murnaghan relaxations. For this purpose, the unit cells of AlB2-RESi2 and ThSi2-RESi2 are set up according to Fig. 4.1 and their zero-pressure volumes 𝑉0 are estimated by the code-level routines. After that, eleven samples in a ±10 % interval around the estimated 𝑉0 are set up and relaxed with respect to the atomic positions and the shape of the unit cell, but at constant volume 𝑉 . The DFT total energies are taken as free energies 𝐹, neglecting vibrational contributions (vide supra). The samples (𝑉, 𝐹) are fitted by the Murnaghan equation of state 𝐹 (𝑉 ) (Eq. (4.33)), which returns the bulk mechanical parameters. For the ThSi2 prototype, 𝑉 and 𝐹 are rescaled by 1/4 so that they become comparable with those of the AlB2 prototype. Furthermore, 𝐹 is reset to 𝐹0. The Murnaghan fits excellently retrace the sample points for all silicides and phases (Plots: Fig. 4.4, parameters: Tab. 4.1). The equilibrium volumes per formula unit 𝑉0 contract almost linearly towards heavier REs, in accordance with the lanthanoid con- traction. Hex-AlB2-RESi2 is slightly denser than tet-AlB2-RESi2, i.e. 𝑉0 of the former is smaller than that of the latter (−0.4 %, −0.5 %, −0.7 % and −0.9 % for TbSi2, DySi2, 45 4 The rare-earth silicide bulk phases 350 (a) hex-AlB2 ErSi2 (b) tet-ThSi2 ErSi2HoSi2 HoSi2 300 DySi2 DySi2 250 TbSi2 TbSi2 200 150 100 50 0 50 52 54 56 58 60 62 64 50 52 54 56 58 60 62 64 V − V0 (Å3) V − V0 (Å3) Figure 4.4: Murnaghan fits of TbSi2, DySi2, HoSi2 and ErSi2 in (a) the hex-AlB2 and (b) the tet-ThSi2 phase. Crosses: sample points, lines: Murnaghan fits. All extensive quantities are scaled by the number of RE atoms per unit cell. hex-AlB2 prototype tet-ThSi2 prototype 𝑉 (Å30 ) 𝐾0 (eV/Å3) 𝐾 ′ (eV/Å6) 𝑉0 (Å3) 𝐾0 (eV/Å3) 𝐾 ′ (eV/Å6) Tb 57.901 0.524181 4.10229 58.141 0.540657 4.39081 Dy 57.308 0.527378 4.15985 57.617 0.543023 4.43436 Ho 56.756 0.530427 4.24312 57.162 0.547184 4.35291 Er 56.215 0.533331 4.38906 56.741 0.553545 4.19432 Table 4.1: Bulk mechanical properties of TbSi2, DySi2, HoSi2 and ErSi2 in the hex-AlB2 and the tet-ThSi2 phase. This work Exp. reference 𝑎 (Å) 𝑐 (Å) 𝑐/𝑎 Δ𝜇T−ARE (meV) 𝑎 (Å) 𝑐 (Å) 𝑐/𝑎 hex-AlB2 prototype Tb 4.100 3.978 0.970 3.84 4.14 1.078 Dy 4.097 3.943 0.962 3.83 4.12 1.076 Ho 4.096 3.907 0.954 3.80 4.10 1.079 Er 4.090 3.881 0.949 3.78 4.08 1.079 tet-ThSi2 prototype Tb 3.977 14.707 3.698 −49 4.05 (3.96) 13.38 3.304 Dy 3.953 14.747 3.731 −44 4.04 (3.95) 13.33 3.300 Ho 3.916 14.912 3.808 −40 4.03 (3.97) 13.31 3.303 Er 3.891 14.995 3.854 −37 — — — Table 4.2: Lattice parameters and relative RE chemical potentials of TbSi2, DySi2, HoSi2 and ErSi2 in the hex-AlB2 and the tet-ThSi2 phase. Numbers in parentheses indicate orthorhombic distortion. Experimental reference: [82, 83, 85]. 46 F − F0 (meV) 4.2 Structure optimisation HoSi2 and ErSi2, respectively). The compressive moduli 𝐾0 of all RESi2 are compar- able, as well as those of the AlB2 and the ThSi2 phase. They range from 0.52 eV/Å3 to 0.56 eV/Å3, i.e. ∼80 GPa. The rigidity slightly grows from the lighter to the heavier REs and from the hex-AlB2 to the tet-ThSi2 phase. The lattice parameters (Tab. 4.2) reveal that the volume contraction from TbSi2 to ErSi2 is anisotropic. This can be linked to how the Si anionic sublattice is expected to respond to shrinking RE cations. Concerning the hex-AlB2 structure, the honeycomb- Si layers are quite rigid, so the shrinkage of the cations in between is mostly com- pensated by the interlayer distance, the axial lattice constant 𝑐. From TbSi2 to ErSi2, 𝑐 shrinks by −2.4 %, while the basal lattice constant 𝑎 is almost constant (−0.2 %). In summa, the volume contracts by −2.9 %. The situation in the tet-ThSi2 structure is more complicated as the anionic sublattice comprises a three-dimensional network of Si atoms. From TbSi2 to ErSi2, the 𝑎 shrinks by −2.2 %, while 𝑐 grows by +2.0 %. In summa, the volume shrinks by −2.4 %, similarly to the hex-AlB2 phase. The chemical potential differences Δ𝜇T−ARE (Eq. (4.37)) tell that the tet-ThSi2 phase is considerably more stable than the hex-AlB2 phase for all stoichiometric RESi2. Although, the ThSi2 phase slightly destabilises from TbSi2 to ErSi2, the potential difference is still higher than 𝑘B𝑇 at low and medium temperatures (𝑘B · 100 K = 8.6 meV). Therefore, the formation of hex-AlB2-RESi2 is not possible. The results for stoichiometric RESi2 conflict with several experimental observations. The first contradiction is that the AlB2 phase is unstable, even for ErSi2, although it is adopted by the silicides of all REs and the only prevailing phase in ErSi2 [85]. Furthermore, the lattice constants deviate considerably from the experimental ref- erence, not only absolutely by several percent. Also the 𝑐/𝑎 ratios strongly differ by more than 10 % and are even inverted for hex-AlB2-RESi2. The last major deviation from the experiment is the missing orthorhombic distortion in ThSi2-RESi2 as 𝑎 and 𝑏 remain equal during relaxation, even if the symmetry of the starting configura- tion is broken by hand. The origin of the discrepancies lies in the high amount of Si vacancies in the real RESi2 – x structures, as it will be proven in Sec. 4.2.5. Before addressing this, the validity of two technical approximations will be verified by test- ing them on the stoichiometric RESi2 structures: The frozen-core approach of the 4f electrons (Sec. 4.2.3) and the omission of vibrational contributions (Sec. 4.2.4). 4.2.3 The role of the 4f electrons in RESi2 In order to test if the 4f electrons have effects on the structures, the Murnaghan relax- ations are repeated for the TbSi2 and ErSi2 structures with f-valent PAW potentials3. The DFT calculations employ spin polarisation and the rotationally invariant, simpli- fied LDA+U approach, proposed by Dudarev et al. [14, 15] (Sec. 2.1.1). Since only the 4f shell of the RE atoms must be corrected, one set of 𝑈 and 𝐽 parameters has to be determined. 𝑈 is calculated with the self-consistent linear-response approach (Eqs. (2.14) and (2.15), [13]). For this purpose, the unit cells from the previous section are 3Potentials which treat the 4f electrons explicitly as valence electrons. The corresponding calculations are referred to as f-valent, in contrast to the trivalent ones above. 47 4 The rare-earth silicide bulk phases multiplied to supercells ((2×2×2) for hex-AlB2-RESi2, (2×2×1) for tet-ThSi2-RESi2) so that the penalised RE atom is separated from its periodic replica. The k-point mesh is adapted to the supercells. The results for 𝑈 are 5.0 eV and 4.9 eV for AlB2-TbSi2 and ThSi2-TbSi2, respectively. Since the values are consistent, a common 𝑈 = 5.0 eV is used for all f-valent TbSi2 calculations. AlB2-ErSi2 yields a self-consistent 𝑈 para- meter of 7.4 eV, used for all f-valent ErSi2 calculations. According to Eq. (2.13), 𝑈 and 𝐽 enter the Dudarev correction only by their difference. The exact value of the effective parameter (𝑈 − 𝐽) is irrelevant, though, once it passes a certain threshold (idempotent occupation matrix, Hund’s rules). Therefore, (𝑈 − 𝐽) can safely be set to 𝑈 . Once the individual RE atoms adopt the correct spin configuration, the atomic magnetic moments provide different ways of ordering. Since magnetic order is not the focus of this work, only orderings which are collinear and commensurate with the conventional unit cells are considered. Containing one RE atom, the unit cell of AlB2-RESi2 is described as ferromagnetically ordered (FMO). The four atoms in ThSi2- RESi2 permit additional antiferromagnetically ordered (AMO) configurations, out of which only that with alternating FMO (001)-planes is considered. Since the symmetry group of the unit cell can prevent orbitals from splitting and artificially retain degen- eracies, the symmetry reduction of the Brillouin zone is switched off and the lattice of the initial unit cell is slightly distorted before relaxation. The results of the Murnaghan relaxations for f-valent TbSi2 are compiled in Tab. 4.3. As expected from the inertness of the 4f shell, the f-valent ThSi2-TbSi2 unit cell is almost equal to the trivalent one (𝑎: +0.1 %, 𝑐: −0.7 %, 𝑉 : −0.6 %). In particular, it retains the tetragonal symmetry. The structural deviations due to magnetic order- ing are negligible. On the contrary, the f-valent AlB2-TbSi2 unit cell incurs a con- siderable orthorhombic distortion with regard to trivalent hex-AlB2-TbSi2, resulting in a new ort-AlB2-TbSi2 structure. In detail, the dashed orthorhombic envelope of the primitive AlB2 unit cell in Fig. 4.1 (b) expands in the [1̄1.0]-direction and con- tracts in the perpendicular [11.0]-direction. Thereby, the formerly isotropic basal lattice constants split into a larger one in the [10.0]-direction (denoted 𝑎, 4.214 Å, 𝑎 𝑐 𝑐/𝑎 𝑉/𝑁Tb 𝛾 Δ𝜇Tb (Å) (Å) (Å3) (meV) trivalent potentials hex-AlB2-TbSi2 4.100 3.978 0.970 57.90 120.0◦ 17 ort-AlB2-TbSi2 4.229 (3.878)𝑎 4.002 0.946 58.34 124.7◦ 0 tet-ThSi -TbSi 3.977 14.707 3.698 58.14 122.5◦2 2 −32 f-valent potentials fmo-ort-AlB2-TbSi2 4.214 (3.886)𝑎 3.986 0.946 57.92 124.4◦ 0 fmo-tet-ThSi2-TbSi2 3.979 14.606 3.671 57.82 122.3◦ −31 amo-tet-ThSi2-TbSi2 3.978 14.617 3.674 57.83 122.2◦ −38 Table 4.3: Lattice parameters and relative Tb chemical potentials of TbSi2 in the AlB2 and the ThSi2 phase under different f-electron treatments. The unit cells were optimised by Murnaghan relaxations. 𝑎orthorhombic distortion. 48 4.2 Structure optimisation +2.8 % compared to that of trivalent hex-AlB2-TbSi2) and a smaller one in the [11.0]- direction (denoted 𝑏, 3.878 Å, −5.4 %). The orthorhombic distortion also redistributes the three M-star angles of 120◦ to two larger ones of 𝛾 = 124.4◦ (cf. Fig. 4.1 (b)) and a smaller one of 111.2◦. Remarkably, the axial lattice constant 𝑐, as well as the unit- cell volume𝑉 remain constant (+0.2 % and ±0.0 %, respectively). In other words, the orthorhombic distortion is an area-conserving deformation of the basis. In order to clarify whether mechanisms other than the f-electron treatment are responsible for this, the ort-AlB2-TbSi2 unit cell is again subjected to a Murnaghan relaxation, this time with trivalent PAW potentials. Instead of relaxing back into hex-AlB2-TbSi2, it retains the orthorhombic distortion (Tab. 4.3). All deviations in the lattice constants of the trivalent and f-valent structures are below 0.5 % and the f-valent volume is slightly smaller (−0.7 %), similarly to tet-ThSi2-TbSi2. Therefore, the orthorhombic distortion of AlB2-TbSi2 is indeed physical and was merely inhibited by the fixed sym- metry group in the previous trivalent calculations. It is remarkable that the volumes per TbSi2 formula unit of ort-AlB2-TbSi2 and tet-ThSi2-TbSi2 are almost equal and that the three angles of the M-stars assume similar values. To facilitate the comparison, the Tb chemical potentials 𝜇Tb in Tab. 4.3 are reset by that of ort-AlB2-TbSi2, respectively for the trivalent and the f-valent calculations. Δ𝜇•Tb = 𝜇 • Tb − 𝜇 ort-AlB2-TbSi2 Tb Concerning the trivalent calculations, the orthorhombic distortion stabilises the AlB2- TbSi2 structure by −17 meV. As a consequence, the AlB2 and the ThSi2 phase are en- ergetically closer together (Δ𝜇T−ATb = −32 meV). Concerning the f-valent calculations, tet-ThSi2-TbSi2 is more stable in the AMO than in the FMO configuration by −7 meV. Comparing the two FMO structures, fmo-tet-ThSi2-TbSi2 is more stable than fmo-hex- AlB2-TbSi2 by Δ𝜇T−ATb = −31 meV, in agreement with the trivalent calculations. The results for f-valent ErSi2 may be found in Tab. 4.4. The overall trends are very similar to those of TbSi2, including the well agreement between the trivalent and 𝑎 𝑐 𝑐/𝑎 𝑉/𝑁Er 𝛾 𝜇Er (Å) (Å) (Å3) (meV) trivalent potentials hex-AlB2-ErSi2 4.090 3.881 0.949 56.22 120.0◦ 24 ort-AlB2-ErSi2 4.240 (3.840)𝑎 3.914 0.923 56.82 125.4◦ 0 tet-ThSi2-ErSi2 3.891 14.995 3.854 56.74 124.3◦ −13 f-valent potentials fmo-ort-AlB2-ErSi2 4.234 (3.847)𝑎 3.912 0.924 56.67 125.2◦ (0)𝑏 fmo-tet-ThSi2-ErSi2 3.897 14.924 3.829 56.66 124.0◦ (−185)𝑏 amo-tet-ThSi2-ErSi2 — — — — — (−190)𝑏 Table 4.4: Lattice parameters and relative Er chemical potentials of ErSi2 in the AlB2 and the ThSi2 phase under different f-electron treatments. The unit cells were optimised by Murnaghan relaxations. 𝑎orthorhombic distortion. 𝑏not reliable due to numerical problems. 49 4 The rare-earth silicide bulk phases f-valent lattices (<0.5 %) and the orthorhombic distortion of AlB2-ErSi2. The stabil- ity gain from the orthorhombic distortion is higher for AlB2-ErSi2 (−24 meV) than for AlB2-TbSi2 (−17 meV). Consequently, trivalent ort-AlB2-ErSi2 is energetically close to trivalent tet-ThSi2-ErSi2 (Δ𝜇T−AEr = −13 meV). Although, the relaxation of f-valent ThSi2-ErSi2 was performed only in the FMO configuration, the dependence of Δ𝜇Er on the magnetic ordering can be estimated by the total energy of the FMO-relaxed cell in the AMO configuration. The latter is more stable by −5 meV, similarly to the TbSi2 case. The potential difference Δ𝜇T−AEr of the FMO structures (−185 meV) is much higher than that of the trivalent structures (−13 meV), which is the only incongru- ity. Since the deviation is not linked to any structure differences, it is likely that the ground state configurations – at least that of fmo-ort-AlB2-ErSi2– were incorrect. In detail, although the total on-site charge and magnetisation were as expected, the spin ordering within the 4f shell could be wrong. Supporting evidence for this is that the Murnaghan relaxations produced several classes of images which had equal on-site charges, magnetisations and equilibrium volumes, but different energy offsets. The nature of such problems is clearly numerical, so the f-valent Δ𝜇Er in Tab. 4.4 are not reliable. That such problems were absent in f-valent TbSi2 agrees with the this as- sumption as Tb has only one electron in the second half-shell, not four like Er. In conclusion, neither the explicit treatment of the 4f electrons, nor the magnetic ordering have major effects on the RESi2 structures, as expected. Also the relative stabilities between the phases are retained, provided that the magnetic ordering is analogue. Therefore, the frozen-core approach is safe for use and the trivalent poten- tials are employed for all further RESi2 calculations (bulk, films and nanowires). 4.2.4 Vibrational contributions in RESi2 The lattice-dynamical effects on 𝜇Tb will be quantified for AlB2-TbSi2 and ThSi2-TbSi2 by calculating the vibrational free energies within the frozen-phonon approach and the harmonic approximation (vide supra). The error due to the omitted thermal ex- pansion can be estimated by the experimentally determined linear thermal expan- sion coefficients of ≈15 · 10−6 K−1 for AlB2-RESi2 – x [98]. Since this approaches the error bars for the lattice constants in this work (≈0.5 %, vide infra) at room tem- perature, the approach is considered to be safe up to 300 K. The unit cells are mul- tiplied to supercells so as to increase the effective Brillouin-zone sampling for the phonons (vide supra). The comparability between the AlB2 and the ThSi2 structure at different effective samplings is ensured by embedding the AlB2 structure into an orthorhombic unit cell (the dashed red cell in Fig. Fig. 4.1 (b) doubled in the [1̄1.0]- direction), which matches the tetragonal unit cell of the ThSi2 structure by the dimen- sions. These Tb4Si8 cells, denoted ThSi2-type unit cells to demarcate them from the conventional ones, are optimised by Murnaghan relaxations analogous to Sec. 4.2.2. The results are compiled in Tab. 4.5. The lattice parameters of tet-ThSi2-TbSi2 are equal to those in Tab. 4.3. The AlB2 phase splits into three slightly different struc- tures. Two of these are hex-AlB2-TbSi2 (𝛾 = 120◦) and ort-AlB2-TbSi2 (𝛾 = 124.7◦), whose lattice parameters are almost equal to those in Tab. 4.3. The third structure, denoted ort2-AlB2-TbSi2 (𝛾 = 118.1◦), is new and represents the other possibility of 50 4.2 Structure optimisation 𝑎 (Å) 𝑏 (Å) 𝑐 (Å) 𝑉/𝑁 3Tb (Å ) 𝛾 Δ𝜇Tb (meV) hex-AlB2-TbSi2 4.10 3.98 14.19 (4.10)𝑎 57.9 119.9◦ 18 ort-AlB2-TbSi2 3.88 4.00 15.04 (4.23)𝑎 58.4 124.7◦ 0 ort2-AlB2-TbSi2 4.20 3.98 13.87 (4.05)𝑎 58.0 118.1◦ 17 tet-ThSi2-TbSi2 3.98 3.98 14.71 58.1 122.6◦ −32 Table 4.5: Lattice parameters and relative Tb chemical potentials of TbSi2 in the AlB2 and the ThSi2 phase in orthorhombic Tb4Si8 cells. The unit cells were optimised by Murnaghan relaxations. The lattice constants 𝑎, 𝑏 and 𝑐 refer to the ThSi2-type unit cells4. 𝑎lattice constant 𝑎 in the conventional [10.0]-direction. orthorhombic distortion, contrary to that in ort-AlB2-TbSi2, i.e. dilation in the [11.0]- direction and contraction in the perpendicular [1̄1.0]-direction. This new structure is discarded as it is not more stable than hex-AlB2-TbSi2. For the other three structures, the (static) values for Δ𝜇Tb are equal to those in Tab. 4.3. The Γ-point phonons are calculated in (1×1×1), (2×2×1) and (3×3×1) supercells of the optimised ThSi2-type unit cells. This corresponds to a convergence test for the effective sampling. The restriction of the multiplication to the basal plane is due to the aspect ratio of the ThSi2-type unit cell. The (3×3×1) supercell is almost cubic so that the effective sampling of the reciprocal space is nearly isotropic. The vibrational free energy 𝐹vib(𝑇 ) of each supercell is calculated with Eq. (4.38), scaled by the respective number of Tb atoms 𝑁Tb and plotted from 𝑇 = 0 K to 270 K (Fig. 4.5 (a) – (c)). While the changes in 𝐹vib(𝑇 ) from the (1 × 1 × 1) to the (2 × 2 × 1) supercells are visible (≈8 meV at𝑇 = 0 K), the free energies of the (2×2×1) and the (3×3×1) supercells are almost equal (deviation <1 meV at 𝑇 = 0 K). Thus, the (3 × 3 × 1) supercells provide well converged absolute vibrational free energies. The lattice dynamical effects can be introduced into the Tb chemical potentials 𝜇Tb simply by adding the vibrational free energies per Tb atom, since the stoichiometries of all structures are equal. They are reset by 𝜇ort-AlB2-TbSi2Tb (𝑇 ) ≕ 𝜇 0 Tb(𝑇 ). • • 0 0 Eq. (4.35) 𝐹 (𝑇 ) − 𝑁 𝜇Si 𝐹 (𝑇 ) − 𝑁 𝜇Si Δ𝜇•Tb(𝑇 ) = Si Si 𝑁• − 𝑁0Tb Tb Eq. (4.39) 1 ( ) = 𝐸• 0 • 0DFT − 𝐸DFT + 𝐹vib(𝑇 ) − 𝐹vib(𝑇 )𝑁Tb The plots of Δ𝜇Tb(𝑇 ) for the (1 × 1 × 1) and the (3 × 3 × 1) supercells can be found in Fig. 4.5 (d). Those of the (2×2×2) supercells (not shown) respectively lie in between. Comparing the (1×1×1) and (3×3×1) curves of the same structure, the differences in the (relative) Δ𝜇Tb(𝑇 ) (2 meV at 0 K, 4 meV at 270 K) are smaller than the respective differences in the absolute free energies 𝐹vib/𝑁Tb (5 meV to 10 meV for all 𝑇 ). This in- dicates an error cancellation which makes the relative free energies converge faster than the absolute ones. In particular, the error in Δ𝜇Tb(𝑇 ) is near the overall DFT error of this work (1 meV per atom). 4The mapping between the ThSi2-type and the conventional AlB2 unit cell holds (cf. Fig. 4.1 (b)): 𝑎ThSi2 = 𝑏AlB2 ([11.0]-direction), 𝑏 2ThSi2 = 𝑐AlB2 , (𝑐ThSi2/4) + (𝑎ThSi2/2)2 = 𝑎2AlB ([10.0]-direction)2 51 4 The rare-earth silicide bulk phases 100 hex-AlB2 ort-AlB tet-ThSi(a) hex-AlB 2 22 (1×1×1) (3×3×1) static 50 (1×1×1) (2×2×1) 20 0 (3×3×1) (d)10 100 (b) ort-AlB2 0 50 (1×1×1) (2×2×1) 0 (3×3×1) -10 100 (c) tet-ThSi -202 50 (1×1×1) (2×2×1) -30 0 (3×3×1) -40 0 50 100 150 200 250 0 50 100 150 200 250 T (K) T (K) Figure 4.5: Vibrational free energies of (a) hex-AlB2-TbSi2, (b) ort-AlB2-TbSi2 and (c) tet-ThSi2- TbSi2 and (d) the corrected Tb chemical potentials 𝜇Tb(𝑇 ) calculated in different supercells. The line types correspond to the supercells and the colours to the structure. Dash-dotted lines: 𝜇Tb without vibrational contributions (static). Concentrating on the (3× 3× 1) results, the two AlB2 structures approach each other by a bit (Δ𝜇Tb = +18 meV (static), +14 meV at 0 K and +10 meV at 270 K). Also ort-AlB2- TbSi2 and tet-ThSi2-TbSi2 approach each other, though to a lesser extent (Δ𝜇T−ATb = −32 meV (static), −31 meV at 0 K and −29 meV at 270 K). The shifts in 𝜇Tb do not change the relative stabilities between the structures for 𝑇 < 300 K. It is remark- able that the two AlB2 structures differ more from each other than ort-ThSi2-TbSi2 and tet-ThSi2-TbSi2 do, with respect to the vibrational free energies. This can be ex- plained by the Si–Si bond distances and angles within the Si sublattices: While hex- AlB2-TbSi2 provides distances of 2.37 Å, ort-AlB2-TbSi2 and tet-ThSi2-TbSi2 provide each two shorter (2.38 Å) and one longer distance (2.42 Å and 2.41 Å, respectively). Therefore, the local Si geometry seems to be more important for the lattice dynamics than the global symmetry. In conclusion, although the two TbSi2 prototypes look rather different, the vibra- tional free energies in the most stable structures (ort-AlB2-TbSi2 and tet-ThSi2-TbSi2) are almost equal. Thus, the free energies can safely be approximated by the DFT total energies up to 300 K. These results are certainly transferable to all other structurally related RESi2. Furthermore, they can be transferred to the nanostructures as well (films and nanowires), provided that the local Si–Si geometries are comparable. 4.2.5 The vacancy-populated RESi2 – x phases Although the above theoretical results for stoichiometric RESi2 clearly contradict the experimental observations, they reproduce the previous theoretical findings for stoi- chiometric hex-AlB2-YSi2 by Magaud et al. [96]. As already discussed above (pp. 39f), 52 Fvib/NRE (meV) μ − μ ort-AlBTb 2Tb (meV) 4.2 Structure optimisation that work proved that the high amount of Si vacancies, as it occurs in the real AlB2- YSi2 – x structures, is decisive for the agreement between the theoretical and the ex- perimental lattice constants, in particular concerning the 𝑐/𝑎 ratios. The influence of the vacancies on the AlB2 and the ThSi2 prototypes will be demonstrated for TbSi2 – b and ErSi2 – b (one out of six Si sites is vacant). The vacancies are considered maxim- ally ordered, so the conventional unit cells of the AlB2 and the ThSi2 phase have to be tripled to map the exact stoichiometry. Only those configurations are considered in which the vacancies are separated by at least two Si atoms from each other. Fur- thermore, the experimental observation is included that the orthorhombic distor- tion clearly affects the ThSi2 phase, while the AlB2 phase is at least approximately hexagonal. This confines the planar vacancy distributions to configurations 1 and 2 of ThSi2-RESi2 – b and configuration 1 of AlB2-RESi2 – b (Fig. 4.2). The vacancy ordering perpendicular to the plane of projection of Fig. 4.2 (ThSi2: the (010) plane, the AlB2: (0001) plane) is congruent, i.e. the vacancies stack onto each other in that direction. All other orderings would multiply the supercells in that direction, which would ex- ceed the scope of this work. The energetic effects from out-of-plane ordering can be estimated at∼30 meV per vacancy (in hex-AlB2-YSi2 – b [96]), which corresponds to 𝜇RE variations of ∼10 meV. The three RESi2 – b supercells to investigate comprise two (3 × 1 × 1)-ThSi2 supercells describing Th√Si2-RE√Si2 – b (1) and ThSi2-RESi2 – b (2) (green cell in Fig. 4.2 (a)) 5 and a rotated R30◦( 3 × 3 × 1)-AlB2 supercell describing AlB2-RESi2 – b (light green cell in Fig. 4.2 (b)). The structures of all supercells are optimised by code-level relaxa- tions, which are not as accurate as Murnaghan relaxations, but whose uncertainty is expected to be smaller than those introduced by other approximations, e.g. the out-of-plane ordering of the vacancies. Pulay stress is reduced by iterating the relax- ations until the supercells remain unchanged. In order to estimate the reliability of the relaxation method, stoichiometric ort-AlB2-RESi2 and tet-ThSi2-RESi2 are optim- ised at code level as well and compared to the respective Murnaghan results. The deviations are very small. Only tet-ThSi2-TbSi2 shows a slightly different axial lattice constant 𝑐 (∼1 %). As the Murnaghan structure is more stable by 5 meV per Tb atom, the code-level relaxation is indeed less reliable. The confidence intervals can thus be estimated to 1 % for the lattice constants and 5 meV for the RE chemical potentials 𝜇RE. The k-point meshes are adapted to the supercells. The first set of calculations is conducted with the PBE xc-functional [10], a second set with PBEsol [11]. In addi- tion to the RESi2 – x structures, the bulk phases of diamond-Si and elemental hcp-RE are optimised as well so as to calculate 𝜇RE according to Eq. (4.36). The results for TbSi2 – x are compiled in Tab. 4.6. Those for ErSi2 – x are in the appendix (Tab. B.1). Fig. 4.6 shows images of the optimised TbSi2 – x structures. Concentrating on TbSi2 – b, several predictions about the effects from the vacancies are confirmed. The removal of a Si atom drives the three atoms at the corner of the re- spective M-star towards its centre (light blue arrows in Fig. 4.6). As a consequence, the basal lattice constant 𝑎 of the AlB2 phase contracts by −9.7 %, which, in turn, pushes 5Mind that for internal consistency, the vacancies populate the Si zigzag chains parallel to 𝑎, so the expected orthorhombic distortion would result in 𝑎/𝑏 < 1. In contrast, most of the crystallographic literature uses an interchanged notation for 𝑎 and 𝑏, so 𝑏/𝑎 < 1. 53 4 The rare-earth silicide bulk phases (a) ThSi2-RESi2−b (1) (b) ThSi2-RESi2−b (2) (c) AlB2-RESi2−b (1) d γva γv da da c d db c b √3a dc dc γv 3a 3a Figure 4.6: Structure models of vacancy-populated TbSi2 – b. Orange circles mark vacancies. Dark red lines mark the supercells. Red dumb-bells mark Si–Si bond distances; Red angles mark the bond angles 𝛾v near the vacancies. Light blue arrows indicate Si atoms moving towards the vacancies. All 3D images in this work are rendered with XCrySDen [120]. the (0001)-planes apart and increases the axial lattice constant 𝑐 by +5.0 %. The sym- metry remains hexagonal, contrarily to stoichiometric AlB2-TbSi2. The impact on the ThSi2 prototype deviates from the predictions. The basal lattice constant 𝑎 does not contract, but remains almost unchanged. Instead, the perpendicular basal lattice constant 𝑏 grows by +1.3 % and +2.1 % for ThSi2-TbSi2 – b (1) and ThSi2-TbSi2 – b (2), respectively. Consequently, the tetragonal symmetry breaks into an orthorhombic distortion of 𝑎/𝑏 = 0.994 and 0.979, respectively. The axial lattice constant 𝑐 strongly contracts by −9.4 % and −9.0 %, respectively. The Tb chemical potentials 𝜇Tb (Eq. (4.36)) reveal that the Si vacancies clearly stabilise the RESi2 prototypes (Tab. 4.6). In order to facilitate the stability considerations, the 𝜇Tb are reset by that of hex-AlB2-TbSi2 – b, the most stable TbSi2 – x structure. Δ𝜇• ≔ 𝜇• − 𝜇hex-AlB2-TbSi2 – bTb Tb Tb (4.40) Two further chemical potential differences are defined: • Δ𝜇T−A = 𝜇ThSi2-TbSi2 – x AlB2-TbSi2 – xTb Tb −𝜇Tb : the difference between the most stable ThSi2 and AlB2 structures with the same stoichiometry (Eq. (4.37)) • Δ𝜇vac ≔ 𝜇•-TbSi2 – bTb Tb − 𝜇 •-TbSi2 Tb : the difference between TbSi2 – b and TbSi2 in the same phase •, i.e. the energy gain per Tb atom upon introducing the vacancies. The vacancies stabilise the AlB2 phase by Δ𝜇vacTb = −191 meV and the ThSi2 phase to a lesser extent (−128 meV for ThSi2-TbSi2 – b (1)). As a consequence, the AlB2 phase be- comes more stable than the ThSi2 phase (Δ𝜇T−ATb = +35 meV for TbSi2 – b and −28 meV for TbSi2). However, the exact values have to be interpreted with caution because they seem to strongly depend on the vacancy ordering, as indicated by the potential difference of 42 meV between the two ort-ThSi2-TbSi2 – b configurations. 54 4.2 Structure optimisation 𝑎 𝑏 𝑐 𝑐/𝑎 𝑉/𝑁Tb 𝜇Tb Δ𝜇Tb (Å) (Å) (Å) (Å3) (eV) (meV) PBE ort-AlB2-TbSi2 4.23 3.88 4.00 0.946 58.3 −1.636 191 hex-AlB2-TbSi2 – b 3.82 4.20 1.100 53.0 −1.827 0 tet-ThSi2-TbSi2 3.97 3.97 14.82 3.734 58.3 −1.664 163 ort-ThSi2-TbSi2 – b (1) 4.00 4.02 13.43 3.340 54.0 −1.792 35 ort-ThSi2-TbSi2 – b (2) 3.96 4.05 13.48 3.328 54.1 −1.750 77 PBEsol ort-AlB2-TbSi2 4.17 3.88 3.92 0.940 56.2 −1.886 173 hex-AlB2-TbSi2 – b 3.76 4.16 1.107 51.0 −2.058 0 tet-ThSi2-TbSi2 3.93 3.93 14.57 3.709 56.2 −1.918 141 ort-ThSi2-TbSi2 – b (1) 3.93 3.98 13.27 3.335 51.9 −2.009 49 ort-ThSi2-TbSi2 – b (2) 3.90 4.01 13.35 3.333 52.1 −1.964 95 Exp. reference hex-AlB2-TbSi2 – x 3.84 4.14 1.078 52.9 ort-ThSi2-TbSi2 – x 3.96 4.05 13.38 3.304 53.6 Table 4.6: Lattice parameters and relative Tb chemical potentials of TbSi2 – x in the stoi- chiometric and the vacancy-populated AlB2 and ThSi2 phases. The unit cells were optimised by code-level relaxations with different xc-functionals. Experimental reference (mind nota- tion of 𝑎 and 𝑏 for ort-ThSi2): [83, 85]. ErSi2 – x: Tab. B.1. The PBEsol results are similar to the PBE results. The most important difference consist in the contraction of all lattice constants by −0.9 % to −1.9 %. Accordingly, the unit-cell volumes contract by −3.5 % to −3.8 %. The Tb chemical potentials 𝜇Tb shift relatively to each other by ∼20 meV, which does not change the stability or- der, though. Notably, the stability of AlB2-TbSi2 – b relative to ThSi2-TbSi2 – b increases (Δ𝜇T−A = +49 meV). The potential drop due to the vacancies is smaller (Δ𝜇vacTb Tb = −173 meV and −92 meV for hex-AlB2-TbSi2 – b and ort-ThSi2-TbSi2 – b (1), respectively). The absolute positions of 𝜇Tb are considerably lower by >200 meV for all structures. The smaller lattice constants and the higher reaction enthalpies (𝜇Tb, vide supra) within PBEsol are expectable [10, 11]. The ErSi2 – x results (Tab. B.1) are similar to the TbSi2 – x results. The unit-cell volumes are smaller (RESi2 – b: ≈−4 %, RESi2: ≈−2.5 %) due to the lanthanoid contraction. The lattice distortions upon the Si removal are more pronounced, in particular concern- ing the axial lattice constant 𝑐 of the ThSi2 phase: While it is longer in tet-ThSi2-ErSi2 (15.0 Å) than in tet-ThSi2-TbSi2 (14.8 Å), it contracts so strongly due to the vacancies that it is shorter in ort-ThSi2-ErSi2 – b (13.3 Å) than in ort-ThSi2-TbSi2 – b (13.4 Å). In accordance with the higher structural impact, the potential drop due to the vacan- cies is steeper in ErSi2 than in TbSi2 (Δ𝜇vacRE = −266 meV and −208 meV for hex-AlB2- ErSi2 – b and ort-ThSi2-ErSi2 – b (1), respectively). As in TbSi2 – x, the relative stability between the AlB2 and the ThSi2 phase changes sign upon introducing the vacancies (Δ𝜇T−AEr = +43 meV for ErSi2 – b and −13 meV for ErSi2). 55 4 The rare-earth silicide bulk phases The lattices of the vacancy-populated TbSi2 – b and ErSi2 – b structures compare very well with the experimental reference [83, 85], much better than the respective stoi- chiometric RESi2 prototypes. In particular, the symmetry groups match the experi- mental observations, contrarily to the stoichiometric structures. Concerning the AlB2 phase, the 𝑐/𝑎-ratios correctly adopt values greater than 1. At first glance, the PBE lattice parameters are closer to the experimental reference than the those obtained with PBEsol, in particular concerning 𝑎 and 𝑉/𝑁RE. On the other hand, PBEsol de- scribes 𝑐 better. It has to be kept in mind, though, that the RE silicide phases are ap- proximated by ordered and frozen vacancies. Existing theoretical work hints that the vacancies may show considerable diffusion along the Si zigzag chains at finite tem- peratures [99]. Therefore, more sophisticated calculations including disorder and diffusion may affect 𝑎 more than 𝑐. Concerning the ThSi2 phase, the lattice para- meters of ort-ThSi2-TbSi2 – b reproduce the experimental reference up to 1 % within PBE and are too small by up to −2 % within PBEsol. As in the AlB2 phase, disorder and diffusion of the vacancies are likely to influence the lattice parameters. Further- more, real ThSi2-TbSi2 – x contains more Si than TbSi2 – b does, so it is expectable that the lattice constants of more realistic, Si-richer structures will be larger. In summary, it is not possible to conclude on the suitability of the xc-functionals at this point. Fi- nally, the volumes per atom of all investigated RESi2 – x structures are smaller than the weighted atomic volumes of the constituents in the elemental phases. The con- traction ratio ranges from 13 % to 22 %, independently of the xc-functional, and hints at the partial heteropolar character of the compound [80]. Stability analysis Both ErSi2 – x and TbSi2 – x favour the AlB2 phase in the RESi2 – b stoichiometry (𝑥 = 13 ) and the ThSi2 phase in the RESi2 stoichiometry (𝑥 = 0). In order to further analyse this, the relative RE chemical potentials Δ𝜇•RE (Eq. (4.40)) for each RE and phase • are linearly interpolated with respect to 𝑥 and plotted in Fig. 4.7. At the 𝑥 = 13 bound- ary, Δ𝜇ThSi2RE (dashed line) is higher than Δ𝜇 AlB2 RE (solid line). The difference (Δ𝜇 T−A RE ) is similar for ErSi2 – b (green, PBE: +43 meV, PBEsol: +56 meV) and TbSi2 – b (yellow, PBE: +35 meV, PBEsol: +49 meV). In other words, the relative stability of the AlB2 phase with Si vacancies is independent of the RE radius. That RESi2 – b certainly adopts the AlB2 phase agrees with the experimental observations and phase diagrams [89, 91, 93, 110, 111]. At the other boundary (𝑥 = 0), vacancy-free ErSi2 and TbSi2 both favour the (tetragonal) ThSi2 phase over the (orthorhombic) AlB2 phase. The potential lines intersect, as marked by black dotted lines. Because Δ𝜇T−ARE is, according to amount, smaller in ErSi2 (PBE: −15 meV, PBEsol: −12 meV) than in TbSi2 (PBE: −28 meV, PBE- sol: −32 meV), the intersection is closer to the Si-rich boundary for ErSi2 – x than for TbSi2 – x. This permits the conclusion that ErSi2 – x needs a higher Si content to form the ThSi2 phase than TbSi2 – x does. Furthermore, particularly within PBEsol, the sta- bility range of ThSi2-ErSi2 – x so narrow that it is possible that more sophisticated cal- culations shift the intersection beyond 𝑥 = 0, which would exclude the formation of ThSi2-ErSi2 – x at all. Of course, the linear interpolation of Δ𝜇RE is a crude approxim- ation as the real curves are convex6. Nevertheless, if similar shapes are assumed, 6It is expectable that 𝐺RE, defined by Eq. (4.34), has a minimum at a certain 𝑥 for a fixed phase. 56 4.2 Structure optimisation 300 (a) PBE (b) PBEsol 250 AlB2-ErSi2−x AlB2-ErSi2−x ThSi2-ErSi2−x ThSi2-ErSi2−x 200 AlB2-TbSi2−x AlB2-TbSi2−x ThSi2-TbSi2−x ThSi2-TbSi2−x 150 100 50 0 0.30 0.25 0.20 0.15 0.10 0.05 0 0.30 0.25 0.20 0.15 0.10 0.05 0 x x Figure 4.7: Linear interpolation of Δ𝜇RE for TbSi2 – x (yellow) and ErSi2 – x (green) between 𝑥 = 1 3 and 𝑥 = 0. (a) PBE and (b) PBEsol. The line types correspond to the phases. Black dotted lines mark the intersections (𝜇T−ARE = 0). the real curves for the AlB2 and the ThSi2 phase will still be horizontally displaced from each other and this displacement will be higher for ErSi2 – x than for TbSi2 – x, reflecting the different 𝜇T−ARE at the boundaries. As a last test concerning the vacancies, a further Si atom i√s rem√oved from the most stable hex-AlB2-RESi2 – b structure. For this pur√pose, t√he ( 3 × 3 × 1) supercell is doubled in both basal directions to obtain a (2 3 × 2 3 × 1) supercell, in which an additional vacancy is created. The optimised RE12Si19 structures (RESi1.58) turn out to be highly unstable against hex-AlB2-RESi2 – b (PBE: Δ𝜇Tb = +97 meV, Δ𝜇Er = +98 meV; PBEsol: Δ𝜇Tb = +135 meV, Δ𝜇Er = +134 meV). Consequently, the optimal composition under Si-rich conditions, at least for the AlB2 phase, indeed seems to be near the RESi2 – b stoichiometry, in agreement with the phase diagrams. Silicon bond length analysis To complete the study of the RESi2 – x structures, the binding mechanisms are illu- minated by analysing the geometry of the Si sublattices. Since the structures differ in the arrangement of the Si–Si nearest neighbours, it is instructive to classify the Si–Si bonds with respect to their environment by the following notation: • Each Si atom is either twofold coordinated and adjacent to a vacancy, denoted (v)-type, or threefold coordinated, denoted (m)-type. • The nearest-neighbours pairs of Si atoms, the Si–Si dumb-bells, are classified with respect to the types of the constituents, e.g. (mv)-type if the one Si atom is (m)-type and the other (v)-type. In hex-AlB2-RESi2 – b (Fig. 4.6 (c)), each (v)-type Si atom is coordinated by (m)-type Si atoms and vice versa. Thus, all Si–Si dumb-bells are all equivalent and of (mv)-type 57 ΔμRE (meV) 4 The rare-earth silicide bulk phases (length denoted 𝑑a). In ort-ThSi2-RESi2 – b (Fig. 4.6 (a) and (b)), the Si–Si dumb-bells point into three different directions and are unequal. Their lengths are termed after the lattice constant towards which they incline: 𝑑a, 𝑑b and 𝑑c. The stoichiometric hex-AlB2-TbSi2, ort-AlB2-TbSi2 and tet-ThSi2-TbSi2 structures provide only (mm)-type Si–Si dumb-bells, which have two different lengths for the latter two. The Si–Si bond lengths of all PBE TbSi2 – x structures are compiled in Tab. 4.7. Those optimised with PBEsol and those of the ErSi2 – x structures may be found in the appendix (Tab. B.2 and Tab. B.3). The diamond-Si structure serves as reference for the length of Si–Si single bonds (PBE: 2.37 Å, PBEsol: 2.35 Å, tetrahedron angle: 109.5◦). In stoichiometric hex-AlB2-TbSi2, the Si–Si bond length amounts to 𝑑a = 2.37 Å, ex- actly the PBE bond length in diamond-Si. This clearly indicates the presence of single bonds. The other two stoichiometric TbSi2 structures (ort-AlB2 and tet-ThSi2) have (per formula unit) two Si–Si dumb-bells with lengths near the single bond (2.36 Å), while the third dumb-bell is considerably longer (2.42 Å and 2.43 Å). In terms of the zigzag planes of the Si sublattice (Fig. 4.1), the shorter bond length is that of the Si zigzag chains (in-plane), while the larger is the connection between the planes. This structural similarity underlines the above results that ort-AlB2-TbSi2 and tet-ThSi2- TbSi2 merely differ in the way how the planes stack (unidirectional vs. alternately perpendicular), but not in the local geometries. The explanation for the unequal bond lengths may lie in the charge transfer from the RE ions to the Si sublattice: When the zigzag planes move apart, the zigzag chains can accept more charge. The bond lengths and angles in the vacancy-populated TbSi2 – b structures accord with the charge transfer model as well. In hex-AlB2-TbSi2 – b, the (v)-type Si – II atom form- ally has two binding and two free electron pairs in tetragonal coordination. However, the tetrahedron is distorted because the two free pairs are localised closer to the Si atom and push the two binding pairs away. This is reflected by the bond length of 𝑑a = 2.43 Å, which is longer than a single bond, and the bond angle of 𝛾v = 103.8◦, which is smaller than the tetrahedron angle. On the other hand, the (m)-type Si – I atom has three equivalent binding electron pairs in symmetric trigonal-planar geo- metry, while the formal free electron pair occupies the out-of-plane p𝑧 orbital. Cor- respondingly, the respective M-star has inner angles of exactly 120◦. The same mech- anisms apply to the two ort-ThSi2-TbSi2 – b structures as well. All (mm)-type Si dumb- PBE 𝑑a(Å) 𝑑b(Å) 𝑑c(Å) hex-AlB2-TbSi2 2.37 (mm) ort-AlB2-TbSi2 2.36/2.42 (mm) hex-AlB2-TbSi2 – b 2.43 (mv) tet-ThSi2-TbSi2 2.36 (mm) 2.36 (mm) 2.43 (mm) ort-ThSi2-TbSi2 – b (1) 2.49 (vv) 2.44 (mv) 2.36 (mm) 2.39 (mv) ort-ThSi2-TbSi2 – b (2) 2.50 (vv) 2.47 (vv) 2.37 (mm) 2.37 (mv) Table 4.7: Si–Si nearest-neighbour distances in TbSi2 – x optimised with PBE. The notation ac- cords with Fig. 4.6. Letters in parentheses indicate the types of the Si–Si dumb-bell. PBEsol: Tab. B.2; ErSi2 – x: Tab. B.3. 58 4.2 Structure optimisation bells have lengths near the single bond (2.36 Å to 2.37 Å). All (mv)-type Si dumb-bells have larger bond lengths of 2.37 Å to 2.44 Å. If the Si dumb-bell connects two (v)-type Si atoms, the bond length is even larger (2.47 Å to 2.50 Å), as expected from the double repulsion from the free electron pairs at each end. The average bond angles at the (v)-type Si atoms are clearly below the tetrahedron angle (𝛾 ◦ ◦v = 106.6 and 105.3 for ort-ThSi2-TbSi2 – b (1) and ort-ThSi2-TbSi2 – b (2), respectively). The Si–Si bond lengths of the ErSi2 – x structures (Tab. B.3) are quantitatively very similar to those of TbSi2 – x, deviating by less than 1 %. This is surprising as the lattice constants show larger differences of 1 % to 2 %. For instance, 𝑑a is exactly equal in both hex-AlB2-TbSi2 – b and hex-AlB2-ErSi2 – b (2.43 Å). On the other hand, the basal lat- tice constant 𝑎 of hex-AlB2-ErSi2 – b is smaller than that of hex-AlB2-TbSi2 – b by −1.4 % (Tab. 4.6 and Tab. B.1). The lanthanoid contraction is thus not carried out by the Si– Si bond distances, but rather by the bond angles at the vacancies, which are more acute in hex-AlB2-ErSi2 – b (𝛾v = 101.4◦ vs. 𝛾v = 103.8◦). Similar considerations hold for the ThSi2-ErSi2 – b structures. The analysis of the PBEsol structures (Tab. B.2 and Tab. B.3) yields very similar results. All bond lengths are a bit shorter, as well as the bond distance in diamond-Si (2.35 Å, experimental reference: 2.352 Å [17, p. 1065]). The bond angles at the vacancies 𝛾v are more acute in PBEsol than in PBE. The effects from the vacancies can be linked to the M-stars, which both structure prototypes have in common. If vacancies are introduced into the AlB2 structure, the remaining complete M-stars rotate and move closer together whereupon the basis contracts. As a consequence, the RE atoms are squeezed out of the Si honeycombs, which dilates the axis. Both deformations combined invert the 𝑐/𝑎 ratio. The ThSi2 structure behaves differently as its axis locally resembles the basal [1̄1.0] direction of the AlB2 structure, while its basis is a mixture between the basal [11.0] and the axial [00.1] direction of the AlB2 structure. Therefore, the basis of the ThSi2 structure responds only weakly to the vacancies, while the axis considerably contracts. 4.2.6 The CaSi2 phases The ZKB concept (p. 33) was introduced by means of the exemplary alkali and al- kaline earth metal silicides (MmSin). In the particular case of CaIISi – I2 , the Si atoms formally accept an electron and arrange themselves in buckled honeycombs between which the Ca ions are embedded. Such a geometry reminds of the AlB2 phase and raises the question whether the stoichiometric RE silicides might crystallise in one of the CaSi2 phases. As CaSi2 is polymorphic, several structures are discussed for this compound. Three of them comprise alternating hexagonal Ca layers and buckled Si honeycombs. The first structure is the simplest way of layer stacking as the Ca atoms arrange them- selves in the “holes” of the buckled Si honeycombs. Denoted h1-CaSi2, this structure is obviously analogous to the hex-AlB2-RESi2 structure, except that the Si honeycomb is buckled in the former and flat in the latter. CaSi2 does not occur in the h1-CaSi2 struc- ture, but CaGe2 does [121]. The second structure results from the following stacking: The Ca atoms are placed above the lower Si atoms of the buckled honeycomb (T4 59 4 The rare-earth silicide bulk phases (a) ort-AlB2-RESi2 (b) tr3-CaSi2-RESi2 (c) tr6-CaSi2-RESi2 c c c a aa Figure 4.8: Models of TbSi2 in the (a) ort-AlB2, (b) tr3-CaSi2 and (c) tr6-CaSi2 structures. Blue circles are Si atoms; Yellow circles are RE atoms. Red lines mark the unit cells. position), like eggs in a (hexagonal) egg box. The next Si honeycomb covers the Ca layer so that the upper Si atoms are above the Ca atoms, like a turned egg box cover- ing a filled egg box. Denoted tr3-CaSi2 with respect to the (ABC)-type sequence of the Ca layers, this structure is an artificial phase of CaSi2 [122]. The third structure is a mixture between h1-CaSi2 and tr3-CaSi2: 1) Ca atoms above the lower Si atoms of the buckled honeycomb (egg box). 2) Holes of the next Si honeycomb above the Ca atoms. 3) Ca atoms above the holes of the Si honeycomb. 4) Upper Si atoms of the next honeycomb above the Ca atoms (turned egg box). Denoted tr6-CaSi2 with respect to the (AABBCC)-type sequence of the Ca layers, this structure is one of the natural phases of CaSi2 [121]. To answer the above question, the h1-CaSi2, tr3-CaSi2 and tr6-CaSi2 structures are set up with RE (Tb and Er) replacing Ca. The unit cells are optimised at code level with the same parameters as above (Sec. 4.2.5), for PBE and PBEsol. The h1-CaSi2 structure proved to transition into the ort-AlB2-RESi2 structure, which is chosen as reference. Fig. 4.8 shows the optimised structure models. The PBE results for TbSi2 (lattice para- meters and relative RE chemical potentials Δ𝜇RE) are compiled in Tab. 4.8, all other results are in the appendix (Tab. B.4). For both RESi2, at least one of the two CaSi2 structures is more stable than ort-AlB2-RESi2 and even tet-ThSi2-RESi2. This is a very surprising result, particularly because the differences in the RE chemical potentials are considerable. They depend, however, strongly on the xc-functional: Switching from PBE to PBEsol affects Δ𝜇RE of tr3-CaSi2-RESi2 to a large extent (+60 meV and +77 meV for TbSi2 and ErSi2, respectively), and Δ𝜇RE of ort-AlB2-RESi2 (−18 meV and −8 meV) and tr6-CaSi2-RESi2 (+15 meV and +22 meV) to an intermediate extent. The basal lattice constants of TbSi2 and ErSi2 are very similar in the respective struc- tures (deviations below 1 %). The axial lattice constant 𝑐 contracts from TbSi2 to ErSi2 by−1.5 % to−2.3 % (lanthanoid contraction). A similar result has already been found for the hex-AlB2 structures (Sec. 4.2.2). In accordance with the bases, the Si–Si bond lengths and angles of the respective TbSi2 and ErSi2 structures are very similar as 60 4.2 Structure optimisation 𝑎 𝑐 𝑉/𝑁RE 𝛾 𝑑a Δ𝜇RE (Å) (Å) (Å3) (Å) (meV) TbSi2, PBE ort-AlB2 4.23 (3.88)𝑎 4.00 58.3 124.7◦ (110.6◦)𝑎 2.36 (2.42)𝑎 0 tr3-CaSi2 3.78 5.56 63.4 103.6◦ 2.41 −30 tr6-CaSi 3.94 9.34 60.9 104.9◦2 , 118.9◦ 2.49, 2.29 −67 Table 4.8: Lattice parameters and relative Tb chemical potentials of stoichiometric TbSi2 in the AlB2 and CaSi2 phases. The unit cells were optimised by code-level relaxations with PBE. tr6-CaSi2-RESi2 has two independent M-star angles 𝛾 and Si–Si bond lengths 𝑑a. 𝑎orthorhombic distortion. Other results: Tab. B.4 well. Since determined by buckled Si honeycombs, the basis of tr3-CaSi2-RESi2 is smaller than that of ort-AlB2-RESi2. On the other hand, the axial lattice constant 𝑐 is longer. In summa, the total volume per RE atom 𝑉/𝑁RE of tr3-CaSi2-RESi2 is larger than that of ort-AlB2-RESi2 by ≈+9 %. Following the ZKB concept, the Si atoms of the buckled honeycombs are sp3-hybridised and use three orbitals for inter-se bonding, while the fourth accepts a free electron pair. This is reflected by the bond lengths of 2.38 Å to 2.41 Å, which are longer than that of diamond-Si (within the respect- ive xc-functional), and the bond angles, which are more acute than the tetrahedron angle. The tr6-structure shows unexpected peculiarities: On the one hand, the upper Si honeycomb of the unit cell (the tr3-like part) has very long bond lengths of 2.46 Å to 2.49 Å, which is near the length of a (vv)-type Si dumb-bell (vide supra). However, the charge accumulation in a (vv)-type dumb-bell (formal Si2 – ion at each end) is much larger than that in the bonds of the honeycomb (formal Si – ion at each end). On the other hand, the lower Si honeycomb (the AlB2-like part) is almost flat (𝛾 ≈ 119◦) and the bond lengths (2.26 Å to 2.28 Å) are considerably shorter than that in diamond-Si. Since the AlB2-like part seems to be squeezed and the tr3-like part to be stretched, it is even more surprising that tr6-CaSi2 is the most stable structure for TbSi2. In conclusion, the AlB2 and ThSi2 prototypes of stoichiometric RESi2 are unstable against at least one of the CaSi2 phases. This underlines that RESi2 without vacan- cies are far away from the real structures. Furthermore, the stability of the buckled- honeycomb geometries confirms that Si tends to avoid sp2 hybridisation, as already discussed in Sec. 3.2. Since they seem to be concurring phases for Si-rich RESi2 – x, the CaSi2 structures should be taken into account when RE–Si phase diagrams are calculated by means of DFT. From the experimental phase diagrams (pp. 38ff), it is expectable that the vacancies stabilise the ThSi2 and the AlB2 phase more than the CaSi2 phases, so the latter do not occur. That the relative stabilities between the four structures (tet-ThSi2, ort-AlB2, tr-3-CaSi2 and tr6-CaSi2) seem to strongly depend on the xc-functional should be verified by Murnaghan relaxations. Moreover, the CaSi2 phases will be relevant for the monolayer-Tb@Si(111) system (Chap. 5) as they share several structural properties with the latter. 61 4 The rare-earth silicide bulk phases 4.3 Electronic properties The band structure 𝜀𝑛(𝒌) and the DOS are calculated non-self-consistently, i.e. the ei- genvalues are calculated for a fixed ground-state density, which was self-consistently determined on a (16 × 16 × 16) k-point mesh. The k-paths for the band structures comprise densely sampled line segments between high-symmetry points of the Bril- louin zones (according to [123, A.5. and A.10.]). The DOS is calculated on a dense, Γ-centred (32 × 32 × 32) mesh, which produces very smooth curves while the non- self-consistency considerably saves computation time. All other parameters are ana- logous to those in Sec. 4.2 The band structures 𝜀𝑛(𝒌) become rather complicated even for the smallest RESi2 cells. Therefore, they are disentangled by projecting the wavefunctions |Ψ𝑛(𝒌)⟩ onto the spherical harmonics |𝑌 𝑙𝑚,𝐼⟩ centred at ion 𝐼 , where 𝑙 and 𝑚 are the orbital and magnetic quantum numbers. The square modulus of the projection is then a measure for how strong a state is localised in the r 〈espect ive ato〉m ic orbitals. 𝑃𝑙𝑚,𝐼 ( 2𝑛, 𝒌) = 𝑌 𝑙𝑚,𝐼 Ψ (𝒌) 𝑛 (4.41) The projections are very convenient in the PAW method as they equal the PAW pro- jections. Once determined, they define the following scheme for plotting the band structures: Each band 𝜀𝑛(𝒌) is a function of 𝒌, which is plotted against the x-axis. The line width is scaled locally in 𝒌 by the sum of the projections 𝑃𝑙𝑚,𝐼 (𝑛, 𝒌) of a certain group {𝑙, 𝑚, 𝐼}. The portion of a subgroup in this group determines the line colour according to a colour bar. The Brillouin-zone integration for obtaining the DOS can be weighted by the p∑︁rojection∫s as well. This results in the partial DOS (PDOS).𝑁Band 1 DOS(𝐸) = 1∑︁|C | d𝑘 𝛿(𝐸∫− 𝜀𝑛(𝒌)) (4.42a)𝑛= G 𝑁∑︁CGBand PDOS(𝐸) 1= |C | d𝑘 𝛿(𝐸 − 𝜀𝑛(𝒌)) 𝑃 𝑙𝑚,𝐼 (𝑛, 𝒌) (4.42b) 𝑙,𝑚,𝐼∈group 𝑛=1 G CG where 𝐶G is the reciprocal unit cell (Sec. 2.2). The PDOS is plotted against the energy axis (y-axis) next to the band structures. Throughout this section, Si portions (PDOS and bands) are coloured blue, Tb por- tions yellow and Er portions green. As expected, the band structures of TbSi2 – x and ErSi2 – x turned out to be almost identical. Therefore, only TbSi2 – x band structures are analysed, while ErSi2 band structures are compared with those of TbSi2 when the f-valent structures are considered. Moreover, the analysis concentrates on the AlB2 structures since their bands are clearer than those of the ThSi2 structures. The band structures of the RESi2 prototypes Fig. 4.9 shows the band structures of the stoichiometric RESi2 prototypes (hex-AlB2- TbSi2 and tet-ThSi2-TbSi2), each plotted along a k-path connecting all high-symmetry 62 4.3 Electronic properties (a) (i) total (ii) s-orbitals (iii) p-orbitals (iv) d-orbitals 0 -2 -4 -6 -8 -10 -12 Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Tb Z Y1 Σ tet-ThSi2-TbSi A 2 H 1 Γ L Σ Y X Γ Si M K hex-AlB2-TbSi(b) 2 0 -2 -4 -6 -8 -10 -12 Γ MK Γ A L H A Γ MK Γ A L H A Γ MK Γ A L H A Γ MK Γ A L H A Figure 4.9: Band structures of (a) tet-ThSi2-TbSi2 and (b) hex-AlB2-TbSi2. The line width cor- responds to the PAW projections of (i) all orbitals, (ii) the s orbitals, (iii) the p orbitals and (iv) the d orbitals. The line colour indicates the Tb/Si portions, according to the colour bar (yellow → Tb, blue→ Si). The Brillouin zones and k-paths are shown in the middle. The respective PDOSs are plotted against the y-axis to right of the band structures in arbitrary units. points of the primitive Brillouin zone. The DOSs and PDOSs of tet-ThSi2-TbSi2 are scaled by 1/2 so that they match those of hex-AlB 72-TbSi2 by scale . Panel (i) shows the summed s, p and d projections of all atoms (line width, constant). Panels (ii), (iii), and (iv) show the separate s, p and d portions in the PAW projections of all atoms. The line colour indicates Tb/Si portions. On the right hand of each band structure, the PDOS of the respective orbitals is plotted against the y-axis, where yellow and blue portions correspond to Tb and Si, respectively (stack diagram). Thus, the sum of the yellow and blue portions in panel (i) equals the total DOS. The band structures of tet-ThSi2-TbSi2 and hex-AlB2-TbSi2 show strongly dispersing bands. They look rather different from each other at first glance, but have several common properties. The bands belonging to Si have s character at high binding ener- gies8, gain p character at lower binding energies and are mostly p-like near the Fermi level. Above−5 eV, strongly dispersive d bands associated with Tb arise, which finally dominate the region between −3 eV and the Fermi level (and the empty conduction bands above). The d contributions from Si are zero (since the Si PAW potential lacks 7The body-centred ThSi2 cell has two formula units RESi2. All DOSs and PDOSs have arbitrary units. 8The binding energy is exactly the opposite of the energy scale in Fig. 4.9. 63 E − EF (eV) E − EF (eV) 4 The rare-earth silicide bulk phases d projectors) and the p and s contributions from Tb are small throughout the band structure. The RESi2 prototypes furthermore have in common that the Tb-d states mix with the Si-p states, so pure Tb states are hardly present. On the contrary, (al- most) pure Si states exist in both structures. The band structures agree with the findings of the structure analysis. The Si bands be- low −3 eV are sp-like, do not interfere with the Tb states and amount to 3𝑁Tb bands (6𝑁Tb electrons, more clearly visible in hex-AlB2-TbSi2). Hence, they represent the Si–Si bonds and confirm that the bonding within the Si sublattice is single and elec- tronically decoupled from the rest of the cell. The fourth electrons of the Si atoms and the Tb valence electrons form rather complicated Si-Tb-hybrid orbitals, which will be analysed in detail when the orthorhombic distortion is considered (vide in- fra). Since these bands cut the Fermi level, the RESi2 prototypes are clearly metallic. The band structure of hex-AlB2-TbSi2 has a peculiarity which is missing in that of tet- ThSi2-TbSi2: Exactly at the Fermi level, the DOS has a rather sharp peak, which stems from Si-sp and Tb-d states, as revealed by the PDOSs. This peak has already been observed in hex-AlB2-YSi2 [95, 100]. Contrarily, the DOS of tet-ThSi2-TbSi2 is indented right beneath the Fermi level. Orthorhombic distortion The band structures of hex-AlB2-TbSi2 and ort-AlB2-TbSi2 can be found in Fig. 4.10. The k-path is restricted to the three spatial directions (ΓK ∥ e𝑥 , ΓM ∥ e𝑦 and ΓA ∥ e𝑧 (inset in Fig. 4.10 (b.iv)). The changes in the bands due to orthorhombic distortion consist in a handful of features (highlighted by red markings) which agree with the results from the structure analysis. Both structures have in common that the lowest four bands (labelled 1 to 4) majorly belong to the Si honeycomb. Band 1 is Si-s-like and decoupled from Tb. Bands 2 and 3 are each strongly dispersive in one of the basal directions and less dispersive in the perpendicular directions. They are Si-s-like at lower energies and gain Si-p character towards higher energies. Because having only weak Tb contributions, bands 1 to 3 represent exclusive Si–Si bonds (as already found above). The 𝑙𝑚-resolved projec- tions (Fig. B.1, labels in Fig. 4.10) reveal that the p component of bands 2 and 3 points into the direction of the more dispersive branch. In detail, the band being steep along ΓM (the y-direction) has only p𝑦 projections (beside s projections at low energies). The other band being steep along ΓK (the x-direction) has only p𝑥 projections. Both bands are localised along ΓA (the z-direction). This accords with the expectation that the electrons are mobile parallel to the bonds and less mobile perpendicular to them. Band 4 is almost flat in the basal plane (the xy-plane), but steep along the crystal axis (the z-direction). Accordingly, the Si projections are exclusively p𝑧. Differently from band 2 and 3, the Si-s contributions are zero and, instead, Tb-d contributions arise at the zone edges, more precisely Tb-d𝑥𝑧 and Tb-d𝑦𝑧. As these d orbitals are crosses whose lobes point diagonally out of the basal plane, band 4 seems to represent a bond between the Si-p𝑧 and the Tb-d orbitals. The orthorhombic distortion affects only two of the four Si bands: band 2 and 3, whose upper parts shift into opposite directions (feature C in Fig. 4.10). This is clearly 64 4.3 Electronic properties (i) total (ii) s-orbitals (iii) p-orbitals (iv) d-orbitals (a) 7,8 A B 0 2A A31 6 -2 54 -4 2,3 C -6 -8 -10 1 1Å−1-12 hex-AlB2-TbSi2 M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A (b) 0 p dz z 2 -2 -4 px -6 py -8 M y -10 x 1Å−1 K-12 ort-AlB2-TbSi Γ2 Figure 4.10: Band structures of (a) hex-AlB2-TbSi2 and (b) ort-AlB2-TbSi2. The panels and the plotting mode are analogous to Fig. 4.9. Inset in (b,iv): k-path. Red markings highlight differ- ences between (a) and (b). Green numbers enumerate the bands at the Γ-point. linked to the distortion of the Si honeycomb and the concomitant splitting of the Si– Si bond distances (Tab. 4.7). The dilation of the bonds in the y-direction gives rise to a gain in binding energy, which becomes manifest in the downward movement of the Si-p𝑦 band. The other two bonds, which form the zigzag chains and have their major component in the x-direction, shorten a bit and lift the Si-p𝑥 band to lower binding energies. The position of the Si-p𝑧 band, on the contrary, is insensitive to the orthorhombic distortion, agreeing with the constancy of the axial lattice constant 𝑐. Bands 5 and 6 cross the Fermi level and are, thus, partially occupied. Band 5 stems from the d𝑧2 orbitals of the Tb atom, mixed with small in-plane Si-p contributions. While it is shallow near the Γ-point on the in-plane k-paths, it is a very steep and straight line crossing the Fermi level along the out-of-plane k-path. Band 6 is Si-s- like below the Fermi level and its in-plane branch follows a quite narrow electronic parabola with an effective mass of 0.15𝑚e. It abruptly bends into less steep branches beyond the Fermi level, while the character simultaneously changes into a mixture of in-plane Si-p and Tb-d. The out-of-plane branch is Si-s-like and completely below the Fermi level. It follows a wider parabola with an effective mass of >1.5𝑚e, the tenfold of the in-plane mass. The orthorhombic distortion changes several features near the Fermi level. The Tb- d band (band 5) slightly shifts at the K-point and the M-point, which flattens the in- plane dispersion, but leaves the out-of-plane dispersion unchanged. In contrast, band 6 undergoes fundamental changes. The ΓM branch (y-direction) closes a gap with a conduction band above (band 7) so that the light-electron parabola continues its shape across the Fermi level. After the gap closing, it has an almost pure Si-sp char- acter and crosses the second, Si-Tb-hybridised band without mixing (feature A1). The ΓA branch (z-direction) opens a gap at the intersection with a double band (bands 7 65 E − EF (eV) E − EF (eV) 4 The rare-earth silicide bulk phases and 8) dipping below the Fermi level halfway to the A-point. Thereby, band 6 hybrid- ises with the band having Si-p𝑦-Tb-d𝑦𝑧 character and splits apart (feature A3), while the other band having Si-p𝑥-Tb-d𝑥𝑧 character remains unchanged. The ΓK branch (x-direction) retains its shape, but is pushed below the Fermi level (feature A2). The band structures can be summarised as follows. The Si atoms are indeed sp2- hybridised and bind to each other via the trigonal-planar hybrid orbitals, forming three bands and, thus, hosting six electrons. The remaining p𝑧 orbitals are out of plane and hybridise with out-of-plane Tb-d orbitals, forming one band (two elec- trons). The remaining three electrons distribute over conduction bands near the Fermi level. One of these bands stems from the inter-se hybridisation of the d𝑧2 orbitals of the Tb atoms, being rather flat in-plane, but strongly dispersive out-of- plane. A second band is the s-like conduction band of the Si honeycomb, which is strongly dispersive in-plane, and less dispersive out-of-plane. As long as the struc- ture is hexagonal, this band transitions into Si-p-Tb-d states right at the Fermi level along with an abrupt change in curvature. This is the origin of the DOS peak. The or- thorhombic distortion rehybridises this band whereupon the DOS peak flattens out (feature B). Since tet-ThSi2-TbSi2 is very similar to ort-AlB2-TbSi2 concerning the bond lengths and angles, it lacks the DOS peak as well. The orbital-resolved partial atomic charges reveal that the orthorhombic distortion transfers 0.1 electrons per unit cell from the Si-p𝑦 orbitals to the Si-p𝑥 orbitals, in accordance with the shifting of band 2 and 3. This confirms the assumption that the orthorhombic distortion allows more excess charge to accumulate in the zigzag chains, which stabilises the structure. A last point to be emphasized is the incorrectness of the analogy between the Si hon- eycomb in AlB2-RESi2 and a hypothetical graphite-like (or even graphene-like) Si allo- trope, sometimes stated in the literature. In order to clarify this, the band structures are calculated for the AlB2-TbSi2 structures (hexagonal and orthorhombic) after the removal of the Tb atoms, which results in a layered “silicene” system (Si2, vide ap- pendicem, Fig. B.1). The three lowest bands, representing the sp2 single bonds, are very similar in TbSi2 and Si2, as well as the s-like conduction band right below the Fermi level. The major difference consists in the p𝑧 band, whose in-plane branch is flat in TbSi2 (bound to Tb-d→ localised in the basal plane) and dispersive in Si2 (de- localised 𝜋-electron system). Thus, the decisive property of silicene, the delocalised 𝜋-electron system, is clearly missing in TbSi2. Interestingly, but not unexpectedly, the effects on the Si bands from the orthorhombic distortion are similar in AlB2-TbSi2 and AlB2-Si2. Furthermore, the Si2 structures demonstrate the better charge storage of the orthorhombically distorted structures. The total energy difference per Si atom between ort-AlB2-Si2 and hex-AlB2-Si2 is Δ𝜇Si = +65 meV in the neutral state, so the hexagonal version is more stable. The stability relations change when excess elec- trons are added (Δ𝜇Si = −78 meV for +1 electron, Δ𝜇Si = −187 meV for +2 electrons). This underlines that the orthorhombic distortion of AlB2-TbSi2 is driven by charge balance issues due to the mismatch between the formal valences of Tb and Si. Explicit f-electron treatment Fig. 4.11 shows the band structures for f-valent fmo-ort-AlB2-TbSi2 and fmo-ort-AlB2- ErSi2, beside trivalent ort-AlB2-TbSi2, which serves as reference. The plotting mode 66 4.3 Electronic properties (i) ort-AlB2-TbSi2 (tri.) (ii) TbSi2 (f-up) (iii) TbSi2 (f-down) (iv) ErSi2 (f-up) (v) ErSi2 (f-down) 0 -2 -4 -6 -8 -10 -12 M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A Figure 4.11: Band structures of trivalent and f-valent, FMO ort-AlB2-RESi2. (i) trivalent TbSi2, (ii) spin-up and (iii) spin-down of f-valent TbSi2, and (iv) spin-up and (v) spin-down of f-valent ErSi2. The plotting mode is analogous to Fig. 4.9 (b,i). Colour: yellow/green→ Tb/Er, blue→ Si. Red arrows mark band shifts between spin-up and spin-down. ThSi2 structures: Fig. B.2. and k-paths are analogous to Fig. 4.10 (i) (line width: sum of s, p, d and f contributions → constant; line colour: RE/Si portions). Because of the spin polarisation, each f- valent RESi2 structure yields two band structures, one per spin component. The most prominent features in the f-valent band structures are bold, horizontal lines representing the 4f states. As expected, they are localised and occupy seven states in the spin-up component and one (TbSi2) and four (ErSi2) states in the spin-down component. Thus, the net spin polarisation is six (TbSi2) and three (ErSi2) electrons per RE atom. The 4f levels of ErSi2 are deeper than those of TbSi2, reflecting the higher atomic number of Er. Furthermore, the distance between the topmost spin-up and spin-down levels is smaller in ErSi2 than that in TbSi2 as a consequence of the higher exchange energy in the 4f shell of Er. The only non-f bands indirectly affected by the 4f electrons are the Tb-d𝑧2 states (band 5 in the trivalent case), which split apart by 0.41 eV in TbSi2 and 0.16 eV in ErSi2 (red arrows in Fig. 4.11). Being larger in TbSi2 than in ErSi2, this splitting is obviously linked to the net magnetic moment. All other bands are only slightly affected by the presence of the 4f states (<0.1 eV). Apart from the 4f levels, the band structures of ort-AlB2-TbSi2 and ort-AlB2-ErSi2 look very similar. They merely differ by small band shifts of −80 meV to 170 meV, which concentrate more in the RE bands than in the Si bands. This suggests that the band shifts are caused by the differences between the TbSi2 and ErSi2 structures due to lanthanoid contraction. The band structures of tet-ThSi2-TbSi2 and tet-ThSi2-ErSi2 differ by higher band shifts of −230 meV to 230 meV, in accordance with the higher sensitivity of the ThSi2 structure to the cation size (vide appendicem, Fig. B.2). Vacancy-populated structures The band structure of hex-AlB2-TbSi2 – b optimised with PBEsol is shown in Fig. 4.12 (a). For comparison, the band structure of ort-AlB2-TbSi2 optimised with PBEsol is ad- ded (Fig. 4.12 (b)). It shows merely tiny differences to the band structure of the PBE- optimised unit cell (Fig. 4.10 (b)). The plotting mode is the same as that in Fig. 4.10 (line width → orbitals, colour → Tb/Si portion). In order to make potential d con- tributions from the Si atoms visible, explicit projections on the spherical harmonics are em√plo√yed instead of the PAW projections. As hex-AlB2-TbSi2 – b is embedded in a R30◦( 3× 3×1) supercell of the primitive AlB2 cell, the k-path is chosen in such a way 67 E − EF (eV) 4 The rare-earth silicide bulk phases (i) total (ii) s-orbitals (iii) p-orbitals (iv) d-orbitals (a) 0 -2 -4 -6 -8 -10 -12 1Å −1 M K' Γ M' K|Γ A M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A (b) 0 -2 -4 -6 -8 -10 −1 -12 1Å Figure 4.12: Band structures of (a) hex-AlB2-TbSi2 – b and (b) ort-AlB2-TbSi2. The unit cells were optimised with PBEsol. The panels and the plotting mode are analogous to Fig. 4.9. The k-path connects the high-symmetry points of the primitive Brillouin zone of the AlB2 structure. The high-symmetry points of the supercell Brillouin zone are marked red. that it connects the high-symmetry points of the primitive Brillouin zone identified in the reciprocal space of the supercell. The high-symmetry points of the supercell Bril- louin zone M′ and K′ are marked red. All DOS and PDOS of the supercell are scaled by 1/3 to reflect the different sizes of the unit cells. Although the bands of hex-AlB2-TbSi2 – b are rather complicated due to band folding, some differences to ort-AlB2-TbSi2 are apparent. The bands and the PDOS reveal that the Si-s-like conduction bands disappear. The explicit projection scheme results in small Si-d contributions in ort-AlB2-TbSi2, which are zero in hex-AlB2-TbSi2 – b. Fur- thermore, the Fermi energy is lowered, so the binding energies of all bands are smal- ler. Consequently, the Fermi level cuts the DOS in a region with a particularly low density. The last difference consists in sharper peaks in the DOS along with flat bands of Tb-Si character. All these differences accord well with the preliminary considera- tions that the removal of a Si atom reduces excess charge. The strong Si-p-Tb-d con- tributions to the DOS in the range from −1 eV to −4 eV, as well as the binding energy reduction upon lower Si content agrees with experimental photoemission data [94, 97]. Although the DOS near the Fermi level is considerably reduced, hex-AlB2-RESi2 – b is metallic with strongly dispersive bands cutting the Fermi level. Charge transfer The electronic properties are completed by a proof that RESi2 is at least partially het- eropolar. For this purpose, the superposition of atomic charge densities is subtracted from the ground state density of the unit cell, which yields the density of transferred charge. The result is plotted as isosurfaces and tomographies in Fig. 4.13 for stoi- chiometric ort-AlB2-TbSi2 and vacancy-populated hex-AlB2-TbSi2 – b, both optimised 68 E − EF (eV) E − EF (eV) 4.4 Discussion and summary (a) ort-AlB2-TbSi2 (b) hex-AlB2-TbSi2−x Figure 4.13: Charge transfer in (a) ort-AlB2- TbSi2 and (b) hex-AlB2-RESi2 – b. Red→ elec- tron gain; Blue → electron loss. The view is along 𝑐. The bulbs are 3D isosurfaces of the electron transfer. Tomographies across (0001)-planes are inserted for three differ- ence heights (relatively to 𝑐). Green mark- ings highlight the zigzag chains in (a) and the (v)-type Si atoms in (b). Black dotted circles: vacancies. with PBE. The tomographies are taken at three different heights along the axial lat- tice constant 𝑐: the cut through the Si atomic plane at 𝑧 = 0.5𝑐, the cut through the middle plane between the Si atoms and the Tb atoms at 𝑧 = 0.75𝑐, and the cut through the Tb atomic plane at 𝑧 = 1.0𝑐 ≡ 0.0𝑐. The isosurfaces of the ort-AlB2-TbSi2 structure show that charge accumulates in the trigonal-planar bonds between the Si atoms, indicating sp2 hybridisation. In the hex- AlB2-TbSi2 – b structure, the (m)-type Si atoms show three charge bulbs in trigonal- planar geometry, indicating sp2 hybridisation. The (v)-type Si atoms show four bulbs, indicating sp3 hybridisation. Two of them are placed on the bonds to the (m)-type Si atoms and the other two are attached to the (v)-type Si atom, pointing towards the va- cancy and tilted out of the plane by a large angle. According to the tomographies, the Tb layer (𝑧 = 1.0) loses charge to the region between the Tb and the Si layer (𝑧 = 0.75). The charge transfer is as expected and confirms the partial heteropolarity of RESi2 – x. It is remarkable that the electrons accumulate mostly in sp2- or sp3-hybrid orbitals around the Si atoms, and not in defined bonds between the Si and Tb atoms. Integrat- ing the plane average of the transfer reveals that the Tb layer loses 0.14 electrons per Tb atom in ort-AlB2-TbSi2 and 0.11 electron per Tb atom in hex-AlB2-TbSi2 – b. These values have to be interpreted with caution as averaging over planes bears assign- ment problems, in particular because the Tb planes and the Si honeycombs are quite close together. 4.4 Discussion and summary The treatment of TbSi2 – x and ErSi2 – x within DFT provided a detailed picture of the chemistry and the physics of the RE silicides. The novel systematisation of the RESi2 prototypes revealed that the AlB2 and the ThSi2 structure are very similar to each other and merely differ in the orientations of the characteristic planes of zigzag- chain-Si (unidirectional→ AlB2, alternately perpendicular→ ThSi2). The structures are determined by the Si networks, while the RE atoms are embedded in the inter- stices. The Si–Si bond lengths are near the single-bond length of diamond-Si, dilated 69 z=1.00 z=0.75 z=0.50 4 The rare-earth silicide bulk phases by a few percent if charge accumulates. Structure variations are thus carried out mainly by the Si–Si bond angles. That the Si sublattice behaves like a linkage of rigid rods proved to be very helpful for understanding how Si vacancies and the RE ionic radius alter the RESi2 – x structures. The inspection of various RESi2 – x structures lead to the conclusion that both RESi2 prototypes are stabilised by vacant Si sites. Hence, stoichiometric RESi2 is unstable against segregation of diamond-Si from a RESi2 – x remainder. The stability analysis found that the AlB2 phase is stable for Si-poorer conditions, while the ThSi2 phase is preferred at Si-rich conditions. Furthermore, the stability of the AlB2 phase seems to be independent of the RE atomic number. On the contrary, the ThSi2 phase seems to destabilise when switching from Tb to Er. This agrees with expected different sensitivities of the Si sublattices towards the RE radius (layered for AlB2 → indif- ferent, 3D network for ThSi2 → anisotropic deformation). The theoretical stability analysis agrees with the experimentally determined temperature-composition phase diagrams of the RE–Si systems. Moreover, the vacancies are crucial for the agreement between the DFT-optimised lattice parameters and the experiment. They can also change the symmetry group of the structures by an orthorhombic distortion, which was traced back to an unequal vacancy concentration in the two inequivalent zig- zag planes. The lattice parameters of the herein investigated structures match the experimental reference up to 2 %, in spite of the approximated vacancy ordering. The structural and electronic properties agree with the ZKB concept. As proven by the band structures, the Si atoms bind to each other via bonds which are rather in- dependent of the presence of the Tb atoms. The formal free electron pairs of the Si anions occupy p𝑧 (m-type) or sp3-hybrid orbitals (v-type) and bind to the d orbitals of the Tb atom in a predictable manner. All spare electrons occupy conduction bands, which dominate the DOS in the Fermi-level region and consist of Tb-d states mixed with Si-sp states. Since several bands cross the Fermi level, the RESi2 – x band struc- tures are clearly metallic. The ZKB concept explains the inevitable presence of Si vacancies by an improved charge balance, as confirmed by the electronic properties. However, it fails to predict the exact stoichiometry as 1/3 electron per RE atom is still spare in the most stable RESi2 – b structure (the saturation of all valence electrons re- quires a stoichiometry of RESi1.5, which does not exist). The partial heteropolarity of RESi2 – x was confirmed by the calculated charge transfer and the contraction of the volumes upon compound formation. The findings on the bulk RE silicides have important consequences for the RESi2 nanostructures on silicon. Firstly, the peritectic nature of the Si-rich RE silicides sug- gests that also the RESi2 nanostructures might underlie a peritectic phase diagram, which would provoke kinetically inhibited structures. Secondly, the vacancy-related lattice distortions imply that one has to be very careful when employing lattice strain to explain nanostructures which lack vacancies. In particular, the inverted 𝑐/𝑎 ra- tio of the AlB2 phase is an issue in the RESi2 nanowires on Si(001) [P3, P6]. Sup- posed strain-induced growth is discussed for the monolayer-Tb@Si(111) system as well, which is the matter of the next chapter. 70 5 Monolayer films on silicon(111) A silicon(111) substrate can be regarded as a stack of buckled Si honeycombs, some- times referred to as Si bilayers, which also terminate the unreconstructed Si(111) sur- face. Hex-AlB2-RESi2 – b, on the other hand, consists of alternating layers of hexagon- ally arranged RE atoms and (defective) Si honeycombs. Hence, attaching a RESi2 – b (0001) sheet to the Si(111) surface would be a natural continuation of the substrate. As, in addition, the lateral lattice constants match up to a few percent, the silicides of many REs are suitable for epitaxy on Si(111), among these the REs from Gd to Lu and Y (lattice mismatch between −2.55 % and 0.83 % [33, 108]). The multilayer-RESi2 films on Si(111) are too thick for electronic quantum confinement, s√o thei√r electronic properties are similar to those of bulk hex-AlB2-RESi2 – b. The R30◦( 3 × 3) surface diffraction patterns indicate that the vacancies of the RESi2 films are ordered, dif- ferently from the bulk phases [36, 37, 39, 46, 50–52]. In contrast, a RESi2 film with a thickness of one bulk lattice constant, the monolayer-RE@Si(111) system, seems to lack Si vacancies, as indicated by (1 × 1) surface diffraction patterns. More pecu- liarly, the film is so thin that it confines the electronic states to a two-dimensional (semi-)metal [P8, P12, 34, 35, 38–53]. In view of the nanowire-Tb@Si(557) system, this chapter recovers the properties of the monolayer-Tb@Si(111) system. Tb can be considered a prototypical RE and the results are expected to be transferable to the other monolayer-RE@Si(111) sys- tems, at least if the RE is trivalent. This assumption is based on the knowledge about the bulk-RESi2 phases (Chap. 4) and on the extensive, synoptic, joint-experimental- theoretical work by Sanna et al., covering RE@Si(111) systems with thicknesses from sub-monolayer to multilayer [53]. Although the structure model of the monolayer- Tb@Si(111) system is well established, it bears peculiarities which have not been understood yet. Therefore, the chapter begins with the analysis of the structure. A systematic inspection of the configuration space of reasonable structure models proves that the established structure model is the most stable. Moreover, satisfact- ory microscopic explanations are found for all its peculiarities, e.g. the tendency of the covering Si honeycomb to buckle into the opposite direction of the Si honey- combs of the substrate. All findings on the structure are supported by the analysis of the electronic properties. The band structures and Fermi surfaces are compared to experimental ARPES measurements. The characteristic features are explained by relating the bands of the monolayer to those of the structurally related CaSi2-TbSi2 phases according to a simple particle-in-a-box model. The charge transfer is determ- ined as well. The chapter concludes with a critical revision of the popular statement that surface-induced mechanical strain determines the morphological details of the monolayer. 71 5 Monolayer films on silicon(111) 5.1 Structure optimisation The slab method Surfaces are problematic for plane-wave codes because they break the translation symmetry. A solution for this consists in the slab method which employs a supercell with several atomic layers modelling the substrate and empty space representing the vacuum. The two interfaces between the vacuum layer and the substrate layer then simulate the surfaces of the substrate. It is convenient to embed the slab into a right-prismatic supercell so that the axis is parallel to the surface normal and the basis equals the surface unit cell. Both the bulk and the vacuum region have to be thick enough to prevent the states of opposite surfaces from interfering. The slab method has two flavours: The asymmetric approach, which models the surface of interest on the top side, while the bottom side is passivated with hydrogen; and the symmetric approach, which models two identical copies of the surface on the top and the bottom side of the slab. The latter is advantageous if passivation is not possible or if the computational demand can be reduced by inversion symmetry. The monolayer structures are set up on an asymmetric Si(111) slab with a thick- ness of twelve atomic layers (six bilayers). The basis of the supercell represents the trigonal (1 × 1) unit cell of the unreconstructed Si(111) surface. A 31 Å thick vacuum layer separates the periodic replica of the slab in the axial direction (the (111)-direction or z-direction). It reduces to a still sufficient thickness of 24 Å when the slab bears the monolayer on the top side. The dangling bonds from the broken sp3-hybrid orbitals on the bottom side are passivated by H atoms. The clean, unrecon- structed slab is sketched in Fig. 5.1 (a). The lattice parameters of the substrate were determined by a Murnaghan relaxation of the primitive unit cell of diamond-Si (PBE, 16 × 16 × 16 Monkhorst-Pack mesh). The relaxation yielded a conventional lattice constant of 𝑎d-S√i = 5.469 Å, which translates into an Si(111) surface lat√tice constant of 𝑎Si(111) = 𝑎d-Si/ 2 = 3.867 Å and a Si–Si bond length of 𝑑d−Si = 𝑎d-Si · 6/4 = 2.368 Å. These parameters slightly overestimate the corresponding experimental values by +0.7 % (𝑎d-Si = 5.432 Å, 𝑎Si(111) = 3.841 Å, 𝑑d-Si = 2.352 Å [17, p. 1065]). Thermodynamical framework Following the experimental preparation, the monolayer forms in the annealing stage, i.e. during heating the sample at a constant temperature after depositing a certain amount of Tb. This translates into the thermodynamic boundary conditions of con- stant temperature, pressure, Si chemical potential (determined by the presence of the substrate) and amount of Tb (controlled by the Tb coverage). Similarly to the case of bulk RESi2 – x (Sec. 4.2.1), the thermodynamic stability is determined by the Tb chemical potential, defined an(alogously to Eq. (4.36): ) ≈ 1 cell − d-Si − H-Sat − hcp-Tb𝜇Tb 𝐸DFT 𝜇Si 𝑁Si 𝜇H 𝑁H 𝜇𝑁Tb Tb (5.43) where 𝐸cell d−SiDFT is the DFT total energy of the supercell and 𝜇Si the Si chemical potential in diamond-Si. Structures which minimise 𝜇Tb are stable. The H chemical potential 72 5.1 Structure optimisation 𝜇H-SatH refers to the H atoms, whose sole purpose is the saturation of the bottom side of the slab. It is clear that they have no physical meaning for anything happening on the top side, but their presence could have undesired effects on the thermodynamic relations if they were ignored. Therefore, the H chemical potential is calculated from a Si(111) slab which is symmetrically sa(turated by H atoms)at both sides. 𝜇H-Sat 1 = 𝐸H-Sat − 𝜇d−Si𝑁H-SatH 𝑁H-Sat DFT Si Si (5.44) H By this means, all energetic effects from the H-saturated bottom side are stored in 𝜇H-SatH and thus removed from the thermodynamics. Of course, the H term cancels out in differences between the Tb chemical potentials of structures having the same bottom side. This is the case in all structures considered in this chapter. Computational parameters Many parameters are equal to those of the previous chapter (p. 45). The DFT calcula- tions are carried out with VASP [2, 3] using PAW potentials and the PBE xc-functional [10, 21, 22]. The Tb atoms are trivalent (5p65d16s2, 4f electrons frozen in the core). Si and H have PAW valences of 3s23p2 and 1s1, respectively. The kinetic energy cutoff is increased to 𝐸cut = 450 eV to meet the increased requirements of the H atoms. The Brillouin zone is sampled by a Γ-centred surface Monkhorst-Pack mesh [117] at a density of 15 × 15 × 1 k-points. The Brillouin-zone integration is carried out by a Gaussian occupation with a smearing of 20 meV for the ionic relaxations of the metal- lic systems and by the tetrahedron method with Blöchl corrections [118] for all other calculations (static and relaxations of insulators). A convergence test proves that in- creasing the energy cutoff or the sampling density alters the total energy by ≈1 meV per atom. The ionic forces are calculated according to the Hellmann-Feynman the- orem [119]. Differently from the relaxation of the bulk structures, the unit cell is fixed. The lowest four atomic layers of Si are held fixed at the bulk positions during all relaxations. The H atoms on the bottom side are held fixed at the equilibrium positions of the symmetrically H-saturated slab. All other ions move within the unit cell along the forces towards the equilibrium positions and stop to relax if the forces acting on each ion are smaller than 0.005 eV/Å. The Tb@Si(111) monolayer The structure model of the monolayer-RE@Si(111) system (Fig. 5.1) is well established by experimental analysis as well as by theoretical calculations. It is a sandwich struc- ture of a hexagonal RE layer which is placed between the unreconstructed (1 × 1)- Si(111) surface and a vacancy-free, buckled Si honeycomb. The monolayers of all REs from Gd to Er and Y adopt this structure [39–41, 43, 45–48, 53]. The following no- menclature is used in this work: terminating Si honeycomb or substrate termination denotes the top Si bilayer of the substrate (Si3 and Si4) and covering Si honeycomb or cover denotes the Si bilayer above the RE atoms (Si1 and Si2). In the established struc- ture, the three valence electrons from each RE atom complement all incomplete Si 73 5 Monolayer films on silicon(111) (a) vacuum (b) (c)A-type B-type [112‾] 31 Å [112‾] ⊙ [111] ⊙ [11‾0] t h Si1 Si2 Si3 (d) Si4 fixed in position S T4 H3 T1 Figure 5.1: (a) Sketch of the Si(111) slab and (b) – (d) the TbSi2 monolayer structures. Blue circles are Si atoms; Yellow circles are Tb atoms; Red circles are H atoms. The light red rect- angle in (a) encircles the subset of atoms fixed in position during the relaxations. Red lines mark the unit cell. (b) shows the (11̄0) view and (c) the (111) view of the T4-h-B structure. Orange markings indicate the symmetry positions of the Tb atom relative to the substrate termination (T4, H3, T1 and S [39]) and to the cover (t-type and h-type). Small green arrows indicate the buckling direction of the cover (A-type or B-type [39]). (d) shows the trigonal- antiprismatic/ octahedral coordination of the Tb atom by the adjacent Si2 and Si3 layers. octets, i.e. they pair the dangling bonds of Si1, Si2 and Si3. This closed-shell configura- tion is reasonable a priori and holds for all structure models which are derived from the established one by layer translation [39]. Although many publications concerning different structure models exist, a rigorous and complete evaluation of the configur- ation space of reasonable structures has not yet been performed. Since it helps to understand why the established structure is stable in spite of some peculiarities, the configuration space is systematically explored in the following paragraphs. At first, the degrees of freedom to modify the established structure are identified (Fig. 5.1 (b)). The first degree of freedom has already been addressed in many works, e.g. Stauffer et al. [39]. It concerns the placement of the RE layer relatively to the substrate. In reasonable structure models, the RE atom is located at one of the three symmetry points of the unreconstructed Si(111)-(1 × 1) surface [39]: • Above the lower Si atom (egg-box position), denoted T4 • Above the upper Si atom, denoted T1 • Above the hole (hole position), denoted H3 In addition, Stauffer et al. considered a structure where the RE atom substitutes the upper Si atom, denoted S. This structure does not have a closed-shell configuration, of course. The second degree of freedom consists in the placement of the covering 74 [111] [11‾0] 5.1 Structure optimisation Si honeycomb relative to the RE layer: holes over RE atoms (h-type, hole position) or upper Si atoms over RE atoms (t-type, egg-box position). The symmetric placement analogous to T1, i.e. lower Si atoms over RE atoms, is discarded due to the low num- ber of Si atoms coordinating the RE atoms1. The cover placement has apparently not been attended to so far as the literature concerns only h-type structure mod- els. This probably originates in the premise that the monolayer-RE@Si(111) struc- ture is based on the AlB2 structure. However, because the AlB2 phase is only stable in vacancy-populated form, deriving the vacancy-free monolayer structure from the AlB2 structure is not reasonable. Moreover, the CaSi2 structures, which are the stable phases for (hypothetical) vacancy-free RESi2, incorporate h-type and t-type buckled honeycomb-Si (Sec. 4.2.6). The monolayer structure will prove to be more related to the CaSi2 structures than to the AlB2 structure, which underlines the need for invest- igating t-type structure models. Finally, the third – and important – degree of freedom is the direction into which the cover buckles: Conform with the underlying Si sub- strate (A-type) or mirrored/rotated by 180◦ (B-type). According to this nomenclature, the established structure is denoted T4-h-B. The configuration space of the monolayer-Tb@Si(111) system is sampled by each four T4 and H3 structure models (h-type/t-type × A-type/B-type) and each two T1 and S structure models (h-type × A-type/B-type)2. All structure models are optimised with respect to the atomic positions. The resulting Tb chemical potentials are plotted in Fig. 5.2 (a). The T1 and S structures are highly unstable (𝜇Tb > 0). In the other struc- tures, T4 and H3, the covering Si honeycomb prefers the h-type to the t-type position. Interestingly, the favoured buckling direction of the cover depends on the relative positions between the substrate, the Tb layer and the cover in an unobvious manner. The most stable structure model is T4-h-B with 𝜇Tb = −806 meV, which agrees well with the findings of Sanna et al. [53]. In order to quantify how varying the degrees of freedom affects 𝜇Tb, the respective penalties are reckoned back in the order: buckling direction, cover placement and Tb placement. The results are compiled as a decision tree in Fig. 5.2 (b), in which sum- ming the values along a branch returns 𝜇Tb of the respective structure. Flicking the buckling direction (lowest level) puts a penalty of+135 meV to+253 meV on the T4 and H3 structures. The established T4-h-B structure lies amid the range with +189 meV, matching the findings of [53] very well. Since the flicking penalties are much higher than 𝑘B𝑇 at room temperature, the orientation of the cover is definite and experi- mentally observable. Concerning the position of the cover, the T4 and H3 structures are rather h-type than t-type (Δ𝜇Tb = +382 meV and +274 meV, respectively). Con- cerning the position of the Tb layer, the T4 position is more favourable than the H3 position by +232 meV. The values agree very well with the difference of 225 meV in the local minima of the potential energy surface (PES) of Si(111) for the adsorption of a Dy atom [53]. The T1 site has a penalty of +1158 meV, also in agreement with the PES in [53] and similar to the penalty of the substitutional S site (+1164 meV). 1With respect to the findings on bulk RESi2 – x (Chap. 4), the RE atom tends to favour Si coordination numbers of about 8 to 10. However, placing the cover in a T1 manner would contribute only one Si atom to the coordination polyhedron, leading to a maximum Si coordination number of five. For the same reason, the T1 placement relatively to the substrate is strongly discouraged as well. 2The t-type structure models are ignored as T1 and S are per se strongly discouraged. 75 5 Monolayer films on silicon(111) (a) (b) 400 425 −806 meV366 352 358 200 +232 +1164 0 +1158 0 -47 H3 T4 T1 S -200 0 +274 0 +382 -289 -400 -300-353 -424 h t h t -600 -617 -574 +253 0 0 +135 0 +66 -800 A B A B A B -806 0 +221 +189 0 +14 0 A B A B A B Figure 5.2: (a) Tb chemical potentials of the monolayer-Tb@Si(111) structures and (b) de- cision tree for the energy penalties upon structure variations. Green boxes in (b) mark the branch of the stable T4-h-B structure. The tendency of the cover to buckle into a certain direction can be summarised as a simple rule: The buckling direction is favourable if the upper Si atom of the substrate termination (Si3) is not above the lower Si atom of the cover (Si2) (oblique, orange ar- row in Fig. 5.1 (b)). This is the case in the stable T4-h-B structure, as well as in H3-h-A, T4-t-A and H3-t-B. Conversely, the structure is penalised if Si2 is above Si3, as in the case in T4-h-A, H3-h-B, T4-t-B and H3-t-A. The physical reason for this will be illumin- ated by the band structures (Sec. 5.2). The stability relations of the other two degrees of freedom depend on how the Tb atom interacts with the Si honeycombs. In order to analyse this, the Si coordination numbers 𝑁C of the Tb atoms are compiled for all structures in Tab. 5.1, as well as the Tb–Si bond distances and the bond distances and angles of the two Si honeycombs Si1–Si2 and Si3–Si4. The vertical atomic-layer distances, which immediately follow from the bond distances, are plotted in Fig. 5.3. The energetically most stable T4-h-B structure is highlighted in each case. “Clean” de- notes the clean, unreconstructed Si(111) surface and “H-Sat” the H-saturated Si(111) surface (the same slab used for the calculation of 𝜇H-SatH , vide supra). For comparison, the respective parameters of the bulk phases of TbSi2 are inserted as well. Shifting the layers against each other changes the number of Si atoms coordinating the Tb atom. In the hole position (cover: h-type, substrate: H3), the Tb atom is co- ordinated only by Si2/Si3 in a threefold manner. The egg-box position (cover: t-type, substrate: T4) adds the Si1/Si4 atom to the coordination polyhedron, incrementing the coordination number by one. The coordinating Si–Tb bond distances range from 2.92 Å to 3.11 Å in the T4 and H3 structures. They follow the expectable trend that they dilate for higher 𝑁C. In detail, their averages are 2.98 Å, 3.00 Å and 3.03 Å for 𝑁C = 6, 7 and 8, respectively. Depending on the relative positions, the Tb atom modifies the Si honeycombs in the following manner: In the hole position, both the substrate ter- mination and the cover (if buckled into the favourable direction) assume buckling 76 Δμ − Δμ hcp-TbTb Tb (meV) T 4 - T h-A 4 -h T -B 4 - T t-A H4 -t-- B3H h- 3 - A H h-B 3 H -t-A T3 -t-B 1 - T h- 1 - Ah S -- Bh S -- Ah-B 5.1 Structure optimisation 6.0 5.54 5.57 Si1 5.01 5.05 4.97 5.0 4.71 4.74 4.68 4.67 4.89 0.79 0.80 4.78 Si2 0.83 0.67 0.80 0.88 0.65 0.86 0.92 0.88 Tb 4.0 0.79 1.87 1.88 Si3 3.0 1.95 1.89 2.10 2.17 1.90 1.96 2.11 2.05 1.80 Si4 2.0 Si d 4 2.88 2.89 1.0 2.09 2.05 2.03 2.05 1.98 2.06 2.00 1.92 2.10 0.0 0.76 0.63 0.89 0.91 0.90 0.89 0.78 0.81 0.76 0.74 0.81 0.81 0.90 -1.0 Figure 5.3: Vertical positions and distances of the atomic layers in the monolayer-Tb@Si(111) structures. All vertical positions are relative to that of Si3. The numbers between two symbols quantify the vertical distance between the respective layers. The numbers above the up- triangles quantify the vertical distance between Si1 and Si3 layers and, thus, the height of the monolayer. Sid4 marks the atomic layer distance in diamond-Si. Experimental reference for the monolayer-Ho@Si(111) system: [43]. Si1–Tb Si2–Tb Si3–Tb Si4–Tb 𝑁C Si1–Si2 Si3–Si4 Clean 2.32 (113.0◦) H-Sat 2.36 (110.1◦) T4-h-A 3.44 2.96 3.06 2.98 7 2.33 (112.1◦) 2.40 (107.1◦) T4-h-B 3.49 2.93 3.03 2.96 7 2.37 (109.3◦) 2.41 (106.7◦) T4-t-A 2.98 3.07 3.02 2.92 8 2.40 (107.4◦) 2.41 (107.0◦) T4-t-B 3.00 3.11 3.03 2.95 8 2.38 (108.5◦) 2.40 (107.1◦) H3-h-A 3.50 2.93 2.99 3.55 6 2.37 (109.4◦) 2.37 (109.7◦) H3-h-B 3.44 2.97 3.04 3.63 6 2.33 (112.5◦) 2.37 (109.1◦) H -t-A 2.97 3.07 3.00 3.55 7 2.39 (107.9◦) 2.36 (110.2◦3 ) H3-t-B 2.97 3.03 2.94 3.47 7 2.41 (106.5◦) 2.35 (110.6◦) T1-h-A 3.47 2.91 2.88 4.31 4 2.37 (109.5◦) 2.37 (109.0◦) T1-h-B 3.49 2.92 2.89 4.32 4 2.37 (109.3◦) 2.37 (109.1◦) diamond-Si 2.37 (109.5◦) ort-AlB2-TbSi2 3.04 8 2.42, 2.36 (124.7◦, 110.6◦) tr6-CaSi2-TbSi2 2.94 3.06 3.05 7 2.29 (118.9◦) 2.49 (104.9◦) tr3-CaSi2-TbSi2 3.00 3.06 8 2.41 (103.6◦) Table 5.1: Bond distances and angles in the monolayer-Tb@Si(111) structures. The distances are given in Å; The angles are parenthesised. 𝑁C is the number of Si atoms coordinating each Tb atom. 77 z (Å) H-Sat Clean T 4 -h-A T 4 -h-B T 4 -t-A T 4 -t-B H 3 -h-A H 3 -h-B H 3 -t-A H 3 -t-B T 1 -h-A T 1 -h-B E H xo pS .i2 5 Monolayer films on silicon(111) parameters similar to those of diamond-Si (bond length: 2.37 Å, tetrahedron angle: 109.5◦). In the egg-box position, the Si–Si bond length is longer, while the bond angle is more acute. This is expectable as a higher coordination enhances the bonding between the Tb atom and the Si honeycomb at the expense of the Si–Si intra-bonds. If the buckling flicks into the unfavoured direction, the cover flattens out a bit and, thus, has more obtuse bond angles and shorter bond lengths. The H-saturated surface has a bulk-like substrate termination, while that of the clean surface is considerably flattened due to the low electron density of the dangling bond of Si3. The layer distances follow the bond lengths by construction. The extending effect of egg-box Tb atoms on the bonds of adjacent Si honeycombs is clearly visible. Particu- larly in the stable T4-h-B structure, the cover is bulk-like, while the substrate termin- ation is expanded. If the buckling direction is unfavourable, the cover as a whole is elevated compared to the favourable case. In the t-type structures, the height of the monolayer (Si1–Si3) is larger than that of the respective h-type structures by∼0.3 Å. The calculated parameters of the most stable T4-h-B structure equal the results of Sanna et al. [53] up to 0.1 Å, which is not surprising as the theoretical approaches are very similar. Experimental data on the vertical layer distances are not available for the monolayer-Tb@Si(111) system. Therefore, the results are compared to the homologue monolayer-Ho@Si(111) system, for which Spence et al. measured com- plete stratification data set by MEIS [43], plotted in the rightmost column in Fig. 5.3. The calculated expansion of the substrate termination matches the experimental res- ults of Spence et al. very well (0.91 Å vs. 0.90 Å). The distance between the cover and the Tb layer (Si2–Tb) is greater than that of Spence et al. (1.89 Å vs. 1.80 Å). This not a disagreement, tough, if the lanthanoid contraction is taken into account: It makes the Si–RE layer distances contract by −0.2 Å to −0.3 Å per atomic-number increment [46, 53], hence adding up to −0.4 Å to −0.6 Å from Tb to Ho. The cover thickness (Si1–Si2) and the distance between the Tb layer and the substrate (Tb–Si3) are smaller than those of Spence et al. (0.80 Å vs. 0.88 Å and 2.05 Å vs. 2.10 Å, respectively). However, they match the results of another study, a LEED 𝐼-𝑉 analysis on the same system [45] (0.82 Å and 2.03 Å, respectively). In conclusion, the T4-h-B structure model optimised with PBE agrees with experimental evidence within the error bars. As the comparison between the Si-Tb coordination polyhedrons in the (T4 and H3) monolayer structures and those in the vacancy-free bulk TbSi2 structures reveals, that the monolayer is much more related with the CaSi2 structures than with the AlB2 structure. In the bulk, the coordination polyhedrons are cuboids in ort-AlB2- TbSi2, hexagonal prisms in hex-AlB2-TbSi2, trigonal antiprisms (distorted octahedra) with both bases capped in tr3-CaSi2-TbSi2 and trigonal antiprisms with one basis capped in tr6-CaSi2-TbSi2. In the monolayer, the Si2/Si3 coordination polyhedron is a trigonal antiprism (octahedron) if the buckling direction of the cover is favour- able and a regular trigonal prism otherwise. That the Tb atom seems to prefer the trigonal-antiprismatic (octahedral) Si coordination hints at the microscopic explan- ation for the buckling tendency of the cover (vide infra). The distinction between T4 and H3 and, analogously, between t-type and h-type consists in the presence of a cap at the respective basis. Accordingly, the coordination polyhedron in T4-t-A is a doubly capped trigonal antiprism, exactly like that in tr3-CaSi2-TbSi2, and that in 78 5.2 Electronic properties T4-h-B is a trigonal antiprism with capped bottom face (highlighted in Fig. 5.1 (d)), similar to that in tr6-CaSi2-TbSi2. Thus, the T4-h-B structure agrees with the energetic favourability of tr6-CaSi2-TbSi2 over tr3-CaSi2-TbSi2 and hex-AlB2-RESi2 within PBE (Tab. 4.8). The analogy is underlined by an almost perfect lattice matching between the basis of tr6-CaSi2-TbSi2 and the Si(111) surface (3.94 Å vs. 3.867 Å, +1.9 %). As the same holds also for tr3-CaSi2-TbSi2 (3.78 Å, −2.2 %), it may be concluded that layered structures of Tb atoms sandwiched between buckled Si honeycombs in general are unstrained. In contrast to the CaSi2 phases, the basis of hex-AlB2-TbSi2 (4.10 Å) is too large by +6.0 %, so this structure is even farther away from the monolayer structure as already given by the geometry and the thermodynamical instability. This point will be discussed in more detail in Sec. 5.3.1. 5.2 Electronic properties 5.2.1 Band structures The band structure calculations are carried out within the same approach as that in Sec. 4.3. The k-path in the surface Brillouin zone consists only of two line segments: MΓ parallel to the [112̄] direction and ΓK parallel to the [11̄0] direction. A sketch of it may be found in the inset of Fig. 5.4 (d,vi). For the calculation of accurate DOSs, the sampling density of the Brillouin zone is increased to a (40 × 40 × 1) mesh. The stable T4-h-B structure Fig. 5.4 shows the band structures of (a) – (c) the most stable T4-h-B structure and (d) the clean Si(111) surface. The line width corresponds to the portion of certain groups in the PAW projections. The first row (a) shows the portions of the Tb portion, the top four Si atoms Si1 – Si4 and bulk-Si3. The second row (b) shows the orbital-resolved projections of the Tb atom, in which angular momenta with the same z-component are combined (e.g. p𝑥 and p𝑦). The third row (c) shows the orbital-resolved projec- tions of the cover (Si1 and Si2) and the top Si atom of the substrate Si3. The last row (d) concerns the clean Si(111) surface, its panels showing the total portion of Si3, Si4 and bulk-Si, and the orbital-resolved projections of Si3. The line colour is fixed in each panel, but indicates the displayed atom group. The energies are reset by the bulk VBM 𝐸0 and the Fermi level of the slab is inserted as a red horizontal line. By this means, the band structures of T4-h-B and clean Si(111) become comparable. All Si bands in Fig. 5.4 have a characteristic set of three bands in common which stem from the three in-plane sp3-hybrid bonds: a U-shaped band between −12 eV and −8 eV and two bands between −8 eV and 0 eV which are shaped like a lying K and intersect at the Γ-point. The U and K bands of bulk-Si (panels (a,vi) and (d,iii)) are rather concentrated shadows resulting from the slab projection combined with 3The average of the upper two fixed Si layers in the red box in Fig. 5.1 (a). 79 5 Monolayer films on silicon(111) (a) T4-h-B (i) Tb (ii) Si1 (iii) Si2 (iv) Si3 (v) Si4 (vi) bulk-Si 2 0 -2 -4 -6 -8 -10 (b) (i) Tb-s (ii) px+py (iii) pz (iv) dxy+dx2−y2 (v) dxz+dyz (vi) dz2 2 0 -2 -4 (c) (i) (Si1+Si2)-s (ii) px+py (iii) pz (iv) Si3-s (v) px+py (vi) pz 2 0 -2 -4 -6 -8 -10 (d) Clean (i) Si3 (ii) Si4 (iii) bulk-Si (iv) Si3-s (v) px+py (vi) pz 2 0 -2 -4 -6 M y -8 x -10 1Å−1 Γ K M Γ K M Γ K M Γ K M Γ K M Γ K M Γ K Figure 5.4: Band structures of (a) – (c) the T4-h-B structure and (d) the clean Si(111) surface. The line colour is fixed in each panel. The line width corresponds to: (a) the total PAW projec- tions on (i) Tb, (ii) Si1, (iii) Si2, (iv) Si3, (v) Si4 and (vi) bulk Si; (b) the (i) s, (ii) (p𝑥 + p𝑦), (iii) p𝑧, (iv) (d𝑥𝑦 + d𝑥2+𝑦2), (v) (d𝑥𝑧 + d𝑦𝑧) and (vi) d𝑧2 orbitals of the Tb atom; (c) the (i) s, (ii) (p𝑥 + p𝑦) and (iii) p𝑧 orbitals of Si1 and Si2, and the (iv) s, (v) (p𝑥 + p𝑦) and (vi) p𝑧 orbitals of Si3; (d) the total PAW projections on (i) Si3, (ii) Si4 and (iii) bulk Si, and the (iv) s, (v) (p𝑥 + p𝑦) and (vi) p𝑧 orbitals of Si3. All energies are reset by the VBM of the bulk-Si bands. The red horizontal lines indicate the Fermi levels of the slabs. The inset in the bottom right panel shows the surface Brillouin zone. Mind that the energy scale of (b) is different from that of the other rows as there are no Tb states below −4 eV. a low dispersion in the (111) direction. In contrast, the bands from the vertical sp3- hybrid bonds are separated, diffuse, flat lines because they are less dispersive in- plane, but strongly dispersive out-of-plane. The similarity between the bulk-Si bands of the clean surface and T4-h-B proves that the slab is thick enough for the purposes of this work. The difference between the VBM and the CBM of the bulk-Si bands, the substrate band gap, amounts to 0.9 eV, which is smaller than the experimental band 80 E − E0 (eV) E − E0 (eV) E − E0 (eV) E − E0 (eV) 5.2 Electronic properties gap of bulk silicon (1.1 eV), but higher than the DFT band gap of bulk silicon (0.7 eV). Thus, the DFT-related underestimation of the band gap and the gap opening due to the finiteness of the slab partially compensate each other. Concerning the Si3 and Si4 bands of the clean Si(111) surface (panels (d,i) and (d,ii)), the in-plane U and K bands look similar to the bulk-Si bands, which indicates a strong coupling between the substrate termination the rest of the slab. In contrast, the dif- fuse out-of-plane states of Si3 are much weaker than those of Si4. Instead, a surface state of exclusive p𝑧 character appears as a single, isolated flat band just above the VBM of the slab, but completely within the substrate band gap (panel (d,vi)). As ob- served earlier [53], this state originates in the broken vertical sp3 bonds of Si3. That the band is half-filled pins the Fermi level at +0.3 eV above the VBM. The Si3 and Si4 bands of the T4-h-B structure (panels (a,iv) and (a,v)) share several characteristics with those of the clean surface. They are almost unaffected by the presence of the TbSi2 monolayer. The only major difference consist in a splitting of the Si3 surface state into an occupied part at the zone edges and unoccupied part at the centre. The bands of Si1 and Si2 (panels (c,i) and (c,ii)) are similar to those the substrate ter- mination with respect to the in-plane U and K bands. However, they are sharp in the sense that the majority of their projections is concentrated in one copy. The out-of- plane bands are sharp as well and resemble an O between −4 eV and +1 eV (panel (c,iii)). Hence, the cover exclusively hosts surface states and is completely decoupled from the bulk. The Tb states dominate the region a few eV below the Fermi level, similarly to bulk TbSi2. They have strong Tb-d contributions at lower binding ener- gies and hybridise with the dangling p𝑧 states of Si1, Si2 and Si3. The O-shaped double band is the most prominent band. Its lower branch is majorly of Tb-Si2 character with smaller Si1 portions. Its upper branch is isolated from the lower branch and lies com- pletely within the band gap. Since the Fermi level is lifted to +0.7 eV, right below the substrate CBM, the upper O band is almost completely filled. Below the Fermi level, its character is a hybrid of Tb-d𝑥𝑦, Tb-d𝑥2− 𝑦2 (both in-plane), Si1-p𝑧 and (to a lesser extent) Si2-p𝑧. At and above the Fermi level, the character changes to out-of-plane Tb-d𝑧2 mixed with Si3-p𝑧. The upper O band is hole-like as it cuts the Fermi level with a negative curvature. Near the M-points, a second, electron-like band crosses the Fermi level. Its character is a hybrid of Tb-d𝑧2 and p𝑧 of Si1 and Si2. Since all states crossing the Fermi level are confined to the surface and completely decoupled from the rest of the substrate, the TbSi2 film gives rise to a clearly two-dimensional, (semi-)metallic system. Although the DFT eigenenergies are only crude approximations of real excitation energies, the theoretical band structure in Fig. 5.4 compares well with experimental ARPES images of the monolayer-Tb@Si(111) system [52] and the homologous systems monolayer-Gd@Si(111) [47] and monolayer-Dy@Si(111) [49]. In all three works, the O-shaped double band between −5 eV and 0 eV is clearly visible. Its branches are separated by a smaller gap at the K-point and by a larger gap at the M-point so that the hole-like upper branch is isolated from the substrate bands. In this work, the binding energy of the upper O band at the zone boundary is ≈1.4 eV, which is close to the experimental reference. Also the electron-like pockets at the M-points perfectly match the experimental observations. 81 5 Monolayer films on silicon(111) Comparison between the monolayer-Tb@Si(111) structures In order to explain why T4-h-B is the most stable structure model for the monolayer- Tb@Si(111) system, its band structure is compared with those of five other structure models in Fig. 5.5. The plotting mode is the same as that in Fig. 5.4 (a). Compared to T4-h-B (Fig. 5.5 (c)), all other structure models show characteristic changes in the upper O band, which are marked orange. The Fermi energy depends on the relative positions between the Tb layer and the adjacent Si honeycombs in a quite predictable manner. As expected from bonding efficacy, the Fermi energy falls if the Tb atom is in the egg-box position (T4 and t-type structures) and rises if the Tb atom is in the hole position (H3 and h-type structures). Quantitatively, the Fermi levels relative to the bulk VBM and averaged over the two buckling directions are +0.58 eV for the T4-t structures, +0.76 eV for the T4-h and the H3-t structures, and +0.81 eV for the H3-h structures (band structures not shown). Hence, the T4-t structures minimise the Fermi energy. Each variation from the stable T4-h-B structure gives rise to a characteristic band changes near the Fermi level. From T4 to H3, the upper O band closes a gap with the CBs at the Γ-point (feature A in Fig. 5.5 (a) and (b)). Furthermore, as indicated by the sharpening, the in-plane bands of the substrate termination dehybridise from the bulk states and concentrate at the surface, along with higher binding energies. This accords with the contraction, ergo intra-bond strengthening, of the terminating Si honeycomb upon switching from T4 to H3 (vide supra). From h-type to t-type, the upper O band flattens and shows an electron-like furrow in the apex at the Γ-point (feature B in Fig. 5.5 (e) and (f)). Furthermore, the bands of the cover shift to lower binding energies, which accords with the expansion, ergo intra-bond weakening, of the covering Si honeycomb upon switching from h-type to t-type. Flicking the buck- ling direction from favoured to unfavoured entails a gap closing between the upper O band and the CBs along ΓK (feature C in Fig. 5.5 (a), (d) and (f)). On closer inspec- tion, both involved bands are characterised by strong Tb-d contributions beside p𝑧 contributions from Si1, Si2 and Si3. After the gap closing, the steeper band of the new crossing is majorly of in-plane d𝑥𝑦 and d𝑥2− 𝑦2 character and the shallower band of out-of-plane d𝑧2 character. This confirms that the steric impact of the Tb-d states is the driving force behind the tendency of the cover to buckle into a certain direction. All band variations increase the DOS at the Fermi level for different reasons, which finally explains why T4-h-B is the most stable structure, despite not having the lowest Fermi energy. Relation to the bulk TbSi2 structures That the TbSi2 monolayer seems to be electronically decoupled from the Si(111) sub- strate suggests that the surface band structure might be derivable from the bands of bulk TbSi2 by means of a simplified particle-in-box approach. For example, consider a monolayer slice of the hex-AlB2-TbSi2 structure with a thickness of one axial lattice constant 𝑐, which would be a layer of Tb atoms sandwiched between two Si honey- combs. The lateral motion of the electrons may be undisturbed by the presence of the boundary surfaces. The vertical distribution of the wave functions then determines 82 5.2 Electronic properties (i) Tb (ii) Si1 (iii) Si2 (iv) Si3 (v) Si4 (a) 2 A C 0 -2 -4 H3-h-B (b) 2 A 0 -2 -4 H3-h-A (c) 2 0 -2 -4 T4-h-B (d) 2 C 0 -2 -4 T4-h-A (e) 2 B 0 -2 -4 T4-t-A (f) 2 B C 0 -2 M y -4 x T4-t-B 1Å −1 Γ K M Γ K M Γ K M Γ K M Γ K M Γ K Figure 5.5: Band structures of the monolayer-Tb@Si(111) structures. (a) H3-h-B, (b) H3-h-A, (c) T4-h-B, (d) T4-h-A, (e) T4-t-A, (f) T4-t-B. The line width corresponds to the PAW projections of (i) Tb, (ii) Si1, (iii) Si2, (iv) Si3 and (v) Si4. All energies are reset by the VBM of the bulk-Si bands. The red horizontal lines indicate the Fermi levels. Orange markings highlight changes in the upper O band relatively to the T4-h-B structure. The inset in the bottom right panel shows the surface Brillouin zone. 83 E − E0 (eV) E − E0 (eV) E − E0 (eV) E − E0 (eV) E − E0 (eV) E − E0 (eV) 5 Monolayer films on silicon(111) how the bulk bands have to be projected onto the surface Brillouin zone of the slice. A reasonable ansatz is a first-order, fixed-end standing wave of wavelength 2𝑐 and corresponding wavenumber π𝑐 . Such a standing wave can be constructed from the solutions for the 3D crystal by superposing two countercurrent waves with equal lat- eral and opposite axial wavenumbers π π𝑐 and − 𝑐 . Since these lie in the opposite bases of the prismatic 3D Brillouin zone, the 2D band structure of the slice can be estim- ated by calculating the bands of the 3D crystal along paths in one of these (identical) bases. In the case of hexagonal unit cells, the surface k-path M–Γ–K corresponds to the k-path L–A–H in the bulk Brillouin zone. Out of the bulk TbSi2 structures in Sec. 4.2, three have a hexagonal basis. While the band structure of hex-AlB2-TbSi2, has already been analysed (Fig. 4.10), those of tr3- CaSi2-TbSi2 and tr6-CaSi2-TbSi2 are calculated from the PBE-optimised unit cells in the same manner. Fig. 5.6 shows the bands of all three structures along (a) the main path M–Γ–K|Γ–A and (b) the path for the slice projection L–A–H. In the bands of (iii) tr6-CaSi2-TbSi2, the blue colour refers only to the buckled Si honeycomb, while the contributions from the other, almost flat Si honeycomb are blended out. The Tb con- tributions are divided by two so as to retain the Tb/Si ratio. In order to facilitate the comparison, the slab band structures of (c) T4-h-B (related with tr6-CaSi2-TbSi2) and (d) T4-t-A (related with tr3-CaSi2-TbSi2) are shown as well. The L–A–H bands of hex-AlB2-TbSi2 (Fig. 5.6 (b,i)) have some features with the surface bands of T4-h-B in common (Fig. 5.6 (c)). The in-plane U and K bands of the Si atoms (i) hex-AlB2 (ii) tr3-CaSi2 (iii) tr6-CaSi2 (c) T4-h-B (a) 0 -2 -4 -6 -8 -10 -12 M Γ K⏐Γ A M Γ K⏐Γ A M Γ K⏐Γ A (d) T4-t-A (b) 0 -2 -4 -6 -8 -10 -12 L A H L A H L A H M Γ K Figure 5.6: Band folding of the bulk TbSi2 structures: (i) ort-AlB2-TbSi2, (ii) tr3-CaSi2-TbSi2, and (iii) tr6-CaSi2-TbSi2. The plotting mode is analogous to Fig. 4.9 (yellow→ Tb, blue→ Si). K-paths: (a) M–Γ–K|Γ–A; (b) L–A–H. The two Si atoms of the almost flat honeycomb in tr6- CaSi2-TbSi2 are suppressed. The bulk bands are reset by the Fermi level. The band structures of (c) T4-h-B and (d) T4-t-A are plotted in an analogous manner, considering only Tb, Si1 and Si2 (summed PAW projections→ line width, Tb/Si portion→ line colour). The slab bands are reset by the VBM of the bulk-Si bands and red horizontal lines indicate the Fermi levels. 84 E − E0 (eV) E − E0 (eV) 5.2 Electronic properties qualitatively match. Also the O-shaped double band of Tb-d and Si-p𝑧 character is clearly visible. However, the band widths of hex-AlB2-TbSi2 differ considerably from those of T4-h-B. The Si valence bands range over ≈8 eV (too small by ≈2 eV) and the O-shaped double band ranges over ≈7 eV (too large by ≈2 eV). The L–A–H bands of tr3-CaSi2-TbSi2 (Fig. 5.6 (b,ii)) resemble the surface bands of T4-h-B much closer. Not only the band widths match, but also the shapes of the U and K bands. In particular, the latter show a gap at ≈−5 eV due to hybridisation with the lower O band, which missing in the bands of hex-AlB2-TbSi2. The upper O band is almost isolated, apart from gap closings at the Γ-point and along ΓK. If these were relaxed, the upper O band would resemble that in T4-t-A quite closely, including the furrow at the Γ-point. The band structure of T4-h-B is derivable from the bulk bands without changing the band order. Interestingly, also the electron pocket at the M-point is present, an important feature missing in the bands of hex-AlB2-TbSi2. Concerning the band structure of tr6-CaSi2-TbSi2, it has to be taken into account that the cell consists of two Tb layers, so one unit cell would be a quasi-double slice. Con- sequently, the surface band structure of a one-Tb-layer slice can be found in the middle of the Brillouin zone, not at the base. The focus is thus on the M–Γ–K path (Fig. 5.6 (a,iii)) instead of the L–A–H path. The bands are difficult to read because the Tb atoms hybridise with two types of Si honeycombs: a buckled one in egg-box po- sition and a rather flat one in hole position. Although the plotting mode suppresses the contributions of the flat Si honeycomb, the respective Tb-Si hybrid states are still visible as yellow lines. If these are ignored, i.e. only blue and greenish bands are considered, several features become manifest: the U and K bands of the Si atoms having a gap and the correct shape, the O-shaped double band whose upper branch is deformed and dips into the Si valence bands, and the electron pocket at the M- point. However, the projected bands of tr6-CaSi2-TbSi2 do not agree better with the monolayers than the bands of tr3-CaSi2-TbSi2 do. 5.2.2 Charge transfer The charge transfer in the T4-h-B structure is calculated analogously to pp. 68f and shown in Fig. 5.7. Panel (a) shows the averages over Si(111) planes, where red por- tions indicate electron gain and blue portions electron loss. Panel (b) shows tomo- graphies of across three (01̄1)-planes: the cuts through Si1, Si2 and the Tb atom. Fig. 5.7 (a) reveals that the plane-averaged charge transfer within the monolayer is much smaller than that within the substrate upon formation of the covalent bonds. The overall transfer into the covering Si honeycomb amounts to 0.01 electrons per surface unit cell, if the boundary is set to the middle plane between the Tb layer and the Si2 layer. The tomographies give insight into the three-dimensional redistribu- tion. Most of the contrast is due to the expected charge accumulation in the Si–Si intra-bonds. The Si atoms adjacent to the Tb atom (Si2 and Si3) show a pronounced charge bulb at their dangling p𝑧 orbitals, whereas the charge above Si1 is almost like that of the atomic state, as indicated by the faint colour. This leads to the conclusion that the Tb atom transfers charge only to the adjacent Si atoms, whereas all other Si 85 5 Monolayer films on silicon(111) (a) (b) Si1-cut Si2-cut Tb-cut Si1 Si2 Tb Si3 Si4 Si Si56 Si7 Si8 Figure 5.7: Charge transfer in T4-h-B. Red→ electron gain; Blue→ electron loss. (a) average over (111)-planes; (b) tomographies across (01̄1)-planes (parallel to a face of the surface unit cell in Fig. 5.1 (d), which is titled away from the plane of projection by 30◦). Red ellipses mark the charge bulbs of the Si atoms near the surface. atoms do not accept additional charge. Since the charge bulbs of Si2 and Si3 clearly penetrate the Tb layer, the plane average suffers from false charge assignments, ana- logous to the problems discussed for the bulk TbSi2 – x structures (pp. 68f). A more sophisticated method would be worth applying on this system in order to correctly quantity the charge transfer. 5.2.3 Fermi surfaces The analysis of the electronic properties concludes with the Fermi surfaces, which correspond to the k-resolved DOS at the Fermi energy 𝐸F. They can thus be calculated from the DFT eigenvalues at a dense s∑︁et of k-points with the following formula:𝑁Band 𝐷F(𝒌) = 𝛿(𝐸F − 𝜀𝑛(𝒌)) (5.45) 𝑛=1 The delta function is approximated as a Lorentzian with width of 𝛾 = 0.1 eV. The Fermi surfaces of different structure models may be found in Fig. 5.8. Looking very different from each other, they are a fingerprint for the respective structure. All Fermi surfaces show the hole-pocket of the upper O band at the Γ-point and the elec- tron pockets of the conduction band at the M-points. They have different shapes and sizes, though, which is linked to the stability of the structures. The most stable T4-h-B structure (Fig. 5.8 (a)) has the smallest pockets in accordance with the tendency to reduce the DOS at the Fermi level. Experimentally measured Fermi surfaces exist for the monolayer-Tb@Si(111) system [52] as well as for the monolayer-Gd@Si(111) [47] and the monolayer-Dy@Si(111) [49] systems. Only the T4-h-B structure reproduces the experimental reference almost 86 5.2 Electronic properties (a) T4-h-B (b) H3-h-A (c) T4-t-A K 1 0 Γ M -1 (d) T4-h-A (e) H3-h-B (f) T4-t-B 1 0 -1 -1 0 1 -1 0 1 -1 0 1 k[112̄] (Å−1) k[112̄] (Å−1) k −1[112̄] (Å ) Figure 5.8: Fermi surfaces of the monolayer-Tb@Si(111) structures: (a) T4-h-B, (b) H3-h-A, (c) T4-t-A, (d) T4-h-A, (e) H3-h-B, (f) T4-t-B. The axes point into the crystallographic directions [112̄] (x-axis) and [1̄10] (y-axis). The Brillouin zone is marked white in (a). The colouring corresponds to the colour bar on the right (a.u.). (a) EF + 50 meV (b) EF (c) EF − 100 meV (d) EF − 200 meV K 1 0 Γ M -1 -1 0 1 -1 0 1 -1 0 1 -1 0 1 k −1[112̄] (Å ) k[112̄] (Å−1) k[112̄] (Å−1) k[112̄] (Å−1) Figure 5.9: Energy surfaces of T4-h-B at different isoenergies: (a) 𝐸 = 𝐸F + 50 meV, (b) 𝐸 = 𝐸F, (c) 𝐸 = 𝐸F − 100 meV and (d) 𝐸 = 𝐸F − 200 meV. perfectly. In particular the experimental Fermi surface of the monolayer-Dy@Si(111) system by Wanke et al. [49] looks like a copy of Fig. 5.8 (a), apart from the colouring. Beside the Fermi surface, Wanke et al. measured also a series of energy surfaces at different binding energies (0 meV, 100 meV and 200 meV). These measurements are simulated by inserting corresponding isoenergies in Eq. (5.45). The result (Fig. 5.9) shows that the electron pockets shrink to faint lines while the hole pocket grows and acuminates for increasing binding energies, exactly as in the respective images of Wanke et al. The theoretical energy surface at −50 meV binding energy (Fig. 5.9 (a)) worsens the agreement with the experiment. In detail, the ratio between the length of the electron pockets and the width of the hole pocket is greater than 1 in Fig. 5.9 (a), whereas it is nearly 1 in Fig. 5.9 (b) and in [49]. This leads to the conclusion that the Fermi levels of the simulated slab and the experiment are at the same position. 87 k (Å−1[1̄10] ) k −1 −1[1̄10] (Å ) k[1̄10] (Å ) 0 max 0 max 5 Monolayer films on silicon(111) 5.3 Discussion and summary By means of a systematic inspection of the configuration space of reasonable struc- ture models, the established T4-h-B model was found to be the most stable. Further- more, all its peculiarities were explained. The inverse buckling direction of the cov- ering Si honeycomb is due to a steric effect from the Tb-d orbitals, which clearly favour an octahedral Si coordination irrespectively of caps (>100 meV per surface unit cell). The preference of the Tb atoms to occupy the egg-carton positions of the substrate termination (T4) and the hole positions of the cover (h-type) is based on two countercurrent effects. On the one hand, the egg-carton position enhances the bond- ing between the Tb layer and the respective Si honeycomb, which lowers the Fermi level. On the other hand, this weakens the intra-bonding of the Si honeycomb. In the case of the substrate termination, the second effect is compensated by the substrate in that the weakened in-plane states hybridise with the bulk. On the contrary, the va- cuum above the cover negates such a compensation, so the t-type structures cannot profit from the Fermi level drop. The band structures revealed that the Si atoms primarily bind to each other and the Tb atoms bind via their d-orbitals to the dangling bonds the Si honeycombs. This is remarkably analogous to the bulk TbSi2 – x structures and accords with the ZKB concept. Moreover, the ZKB concept predicts a perfect charge balance in the mono- layer structure as all valence electrons of the trivalent Tb atoms are compensated by the incomplete shells of Si1, Si2 and Si3. The closed-shell configuration has already been noted by Stauffer et al. [39]. However, the strength of the ZKB concept in this case is that it explains in a comprehensible way why the monolayer lacks vacan- cies, contrarily to bulk TbSi2. The Si3 plane sharply separates the electronic states into two entities: All states below Si3 (including the in-plane sp3-hybrid bonds of Si3) are three-dimensional substrate states and all states above (including the vertical dangling bond of Si3) are vertically confined to the thin RESi2 monolayer. Therefore, the monolayer is a clearly two-dimensional (semi-)metal. The hole pocket at the Γ- point of the Fermi surface is due to hybrid states of out-of-plane Tb-d and Si3-p𝑧 The electron pockets at the M-points are due to hybrid states of out-of-plane Tb-d, Si1-p𝑧 and Si2-p𝑧. The band structures and the Fermi surfaces very well reproduce experi- mental ARPES images. The structure analysis showed that the TbSi2 monolayer is much more related with the CaSi2 phases than with the AlB2 phase concerning several morphological prop- erties. Not only does the monolayer contain buckled honeycomb-Si instead of flat honeycomb-Si, but also the coordination polyhedrons are very similar to those of the CaSi2 phases. The lattice matching underlines the relatedness as the bases of the CaSi2 structures match the Si(111) surface very well, while that of hex-AlB2-TbSi2 is con- siderably too large. In accordance with the structural relatedness, the surface band structure of T4-h-B can be derived surprisingly well from the bulk band structures of CaSi2-RESi2 by means of a simple particle-in-a-box consideration. The respective projection of the hex-AlB2-TbSi2 band structure disagrees with the former. The struc- tural and electronic properties agree very well with experimental data. Therefore, the approach can be considered safe for the nanowire-Tb@Si(557) system, which is 88 5.3 Discussion and summary the matter of the next chapter. Though, before treating these, the argument of strain- induced growth will be critically revised in the following section. 5.3.1 Strain induced growth? Many previous works argued on the mechanisms which determine the structures of the monolayer- and multilayer-RE@Si(111) systems. The present findings that the monolayer-Tb@Si(111) structure resembles the CaSi2-TbSi2 structures much closer than the AlB2-TbSi2 structure is a novel point, which casts a different light on the previous, heuristic explanations. In order to appraise their validity, the main points of these explanations are summarised, in particular those stating that a strain release of the Si honeycomb leads to the morphological details. Epitaxists often consider the lattice matching between the substrate and the layer to be grown so as to determine whether epitaxy is possible. If the lattice constants of the respective planes match, the layer can reach a high thickness without structural defects, and, otherwise, defect-free growth is limited. For the RE@Si(111) systems, the following points are taken into account: 1) The surface lattice constant of the unreconstructed, trigonal Si(111) surface is 𝑎Lit.Si(111) = 3.841 Å [17, p. 1065]. 2) The basis of the AlB2 phase of RESi2 – b (RE ∈ [Y,Gd − Lu]) matches the Si(111) surface by the lattice constants, e.g. TbSi2 – b: 𝑎Lit.AlB -RESi = 3.847 Å [33].2 2 – b 3) Hypothetical flat honeycomb-Si would have a lattice constant of 4.074 Å if it ad- opted the Si–Si bond distance of diamond-Si. From 1) and 2), it follows that epitaxial growth of thick hex-AlB2-RESi2 – b layers on Si(111) is possible. While the growth of thick RESi2 – b layers can be embedded into simple geometrical considerations, major misunderstanding concerns the formation of the monolayer and thin-layer structures. Not few publications claim that the monolayer-RE@Si(111) system is a one-unit-cell slice of the bulk hex-AlB2-RESi2 – b structure, which is at- tached to the Si(111) surface. Since the covering Si honeycomb lacks vacancies, they further argue that it buckles so as to release compressive strain exerted by the Si(111) surface, following 1) and 3). More venturesome explications concern the hex-AlB2- RESi2 – b structure, which, inserted between the monolayer and the substrate, leads to multilayer-RE@Si(111) structures. Its Si layers supposedly contain vacant Si sites because they are squeezed into the surface lattice constant. Instead of buckling, they release strain by expelling every sixth Si atom so that the remaining Si atoms are al- lowed to increase their bond distances. These argumentations, to read e.g. in [38, 39] and from then inherited by subsequent publications, are problematic since they pos- tulate that the AlB2 prototype is an “ideal” structure and that buckling and vacancies are reconstructions or defects due to lattice mismatch. However, following this work so far, it is already clear that both the buckling and the vacancies are of chemical 89 5 Monolayer films on silicon(111) nature and that mechanical strain is not the reason for their occurrence. The ma- jor mistake originates in the ignorance that AlB2-RESi2 incorporating flat, vacancy- free honeycomb-Si is per se highly unstable. In order to rectify the physical picture about the RE@Si(111) systems, the findings on the monolayer-RE@Si(111) and the bulk RESi2 – x structures are brought together into the following conclusive theory. The trivalent RE atoms in the monolayer-RE@Si(111) system each formally donate one electron to the dangling bonds of the substrate termination and the other two to a buckled, covering Si honeycomb. This structure is very convenient because, firstly, the Si shells are all closed and, secondly, the buckled Si honeycomb is similar to those composing the substrate. The reason for the buckling (sp3 hybridisation) of the cover consists in the nature of Si to strongly avoid sp2 hybridisation and in a better charge storage. This becomes manifest also in the high stability of CaSi2-RESi2 against AlB2-RESi2 (constrained RESi2 stoichiometry). The multilayer structures can be de- rived from the monolayer in an inductive manner. As further investigated in [53], the (0001)-slices of bulk hex-AlB2-RESi2 – b, comprising one RE layer and a vacancy- populated Si honeycomb, represent bulk building elements. Inserting these between the substrate and the lowermost RE layer of the precedent structure increases the thickness of the RESi2 – x film by one layer. The charge balance shows that the elec- trons donated by one RE layer are exactly compensated by the substrate and the vacancy-free, buckled covering Si honeycomb. Those of the other RE layers are com- pensated by the other, vacancy-populated, flat Si honeycombs to an extent of 8/9. Since the lateral lattice constants of the bulk building element and the Si(111) sub- strate match, the stacking procedure can be repeated several times without running the risk of strain-induced stacking faults. The reason for the presence of Si vacan- cies is the same as that discussed for the bulk RESi2 – x structures, i.e. 1/6 vacancies increase the electron capacity of the Si honeycomb from 2 to 5/3 per RE atom. In summary, the monolayer- and multiplayer-RE@Si(111) systems can be general- ised to a stack of building elements. The top building element (RE layer + vacancy- free, buckled Si honeycomb) is always present and equals the monolayer if being the only one. The bulk building elements (RE layer + vacancy-populated, flat Si hon- eycomb) are facultative and increase the thickness of the RESi2 – x film, eventually rendering it bulk-like. The morphologies of both building elements are solely due to chemistry and independently match the Si(111) surface. Consequently, the statement that surface-induced strain causes the buckling of the covering Si honeycomb or the presence of Si vacancies is incorrect. 90 6 Nanowires on silicon(557) After the detailed analysis of the monolayer-Tb@Si(111) system, the next step consists in the investigation of the nanowire-Tb@Si(557) system. Derived from the Si(111) sur- face by inclination towards the (001) plane, Si(ℎℎ𝑘) surfaces where ℎ < 𝑘 provide Si(111)-like terraces which are separated by single Si(001)-like steps. The terrace width is controlled by the inclination angle in that smaller angles leads to wider terraces. If a Si(ℎℎ𝑘) substrate is prepared with RE at a coverage of approximately one monolayer and successively annealed at a certain temperature, the monolayer- RESi2 structure grows on the terraces, but is interrupted by the steps. In this manner, strips of monolayer-RESi2 form by self-organisation and resemble nanowires from a structural point of view. Experimental investigations of the structural and electronic properties of nanowire-RE@Si(ℎℎ𝑘) systems can be found for Tb [70], Dy [68, 69] and Er [68]. One of the key findings is that the electronic structure of densely covered terraces is similar to that of the monolayer-RE@Si(111) system. In particular, it is of two-dimensional character, in spite of the stripy morphology of the film. This chapter covers the theoretical part of a joint-experimental-theoretical work on the nanowire-Tb@Si(ℎℎ𝑘) system which goes a step further [P8]. It investigates how the electronic dimensionality of the system alters with the Tb coverage, subjected to the idea that lower coverages thin out the occupied terraces and separate the TbSi2 strips by semiconducting barriers. While the above stepped film, the system of dense nanowires, is a two-dimensional electronic system, sparsely occupied terraces, the sparse nanowires, are expected to reduce the dimensionality of the individual strips so that they resemble a quasi-one-dimensional electronic system. Indeed, it turns out that separating the TbSi2 strips by empty terraces induces anisotropy into the ef- fective masses at the Fermi level, in accordance with the Heisenberg principle. In addition, a new state arises at the edge of TbSi2 strips neighbouring an empty ter- race. This edge state is of pure one-dimensional character and completely decoupled from the rest of the strip. The corresponding features in ARPES images are more pronounced, the narrower and thinner the nanowires are, as achieved by higher tilt angles and/or lower Tb coverages. Since the agreement between the experimental and theoretical findings is excellent, [P8] is a firm documentation of a fascinating and unique dimensional crossover. In addition to the results already published in [P8], this chapter provides a deeper insight into the methodology and addresses the stability of the nanowire structures under certain thermodynamic conditions. Furthermore, it is proven that the band structures and Fermi surfaces of the nanowires can be derived from those of the monolayer-Tb@Si(111) system by means of a simple particle-in-a-box model. This highlights the differences between dense and sparse nanowires and illuminates the origin of the one-dimensional edge state. 91 6 Nanowires on silicon(557) 6.1 Structure optimisation Methodological details The Si(557) substrate is modelled by tilted (5×1) (sketched in Fig. 6.1 (a)) and doubled (10× 1) surface supercells of the Si(111) slab (Fig. 5.1 (a)). The slabs provide terraces having a widths of five projected surface lattice constants (“5a”), which are separated by Si(001)-like single steps. Similarly to the Si(111) slab, the Si(557) slab is twelve Si layers thick, out of which the lowest four are fixed in position to simulate the bulk of the substrate. The dangling bonds on the bottom side are saturated by H atoms, which assume their equilibrium positions for a symmetrically saturated slab (exactly that depicted in Fig. 6.1 (a)) and are then fixed in position for all further calculations. The periodic replica in the (557)-direction (the z-direction) are separated by 27 Å of vacuum, which reduces to 24 Å when the slab bears the nanowires. Most computational parameters are equal to those of the monolayer-Tb@Si(111) cal- culations (p. 73). The Brillouin-zone sampling is adapted to the (5× 1) supercell by a Γ-centred (3 × 15 × 1) Monkhorst-Pack mesh so that 15 k-points are placed along the wires and 3 k-points across. For the calculations of the (10 × 1) surfaces, the mesh is adapted to (2 × 20 × 1). A convergence test proves that increasing the sampling density alters the total energy by ≈1 meV per atom. The dense nanowires At first, the dense nanowires are analysed. For this purpose, the (5×1) slab is covered with monolayer-TbSi2 so that each terrace bears a 5𝑎wide strip. Since it is a priori not clear that the strips adopt the T4-h-B structure as the monolayer does, two structure models are set up: the one derived from T4-h-B (Fig. 6.1 (b)) and the other derived from H3-h-A (Fig. 6.1 (c)). They are named after the corresponding monolayer struc- ture, prefixed by the width, “5a” in this case. The H chemical potential is recalibrated with the symmetrically H-saturated slab according to Eq. (5.44). The Tb chemical potential is calculated with Eq. (5.43) and determines the stability of the structure during the annealing stage, i.e. when the Tb coverage and the temper- ature are constant. The 5a-T4-h-B structure yields 𝜇Tb = −327 meV and the 5a-H3-h-A structure 𝜇Tb = −53 meV, so the former is more stable than the latter. The 𝜇Tb dif- ference between the nanowire structure models (274 meV) is a bit higher than that between the respective monolayer structure models (232 meV). Concerning the geo- metric details of the optimised structure models, the mid of the TbSi2 strip ranging from Tb2 to Tb4, resembles the respective monolayer structure (Tab. 5.1). In particu- lar, the Si–Si bond lengths of the Si honeycombs adjacent to the Tb layer are similar to that of bulk Si and stretched if the Tb atom is in the egg-box position. Also the bond distances between the Tb atoms and the coordinating Si atoms accord with those of the respective monolayer structures, amounting to ⟨𝑑Tb{2,3,4}–Si⟩ ≈ 3.0 Å on average. In contrast, the edges of the TbSi2 strips near Tb1 and Tb5 are considerably distorted. The covering Si honeycomb at the left edge near Tb5 is flattened and, interestingly, it adopts bond angles similar to the orthorhombically distorted Si honeycomb of the 92 6.1 Structure optimisation (a) Si(557) slab (d) 5a-T4-h-B (Sparse NW) [0 ⊙ [11‾0]01] [771‾0‾] [112‾] 5 4 3 2 1 [771‾0‾] ⊙ [557] fixed in position (e) (b) 5a-T4-h-B (Dense NW) 40 D 5 4 20 en3 s2 e1 NW Sparse NW 0 -20 Monol (c) 5a-H -h-A -40 aye 3 r -60 5 4 3 2 1 -80 AlB2-TbSi2−x -2.5 -2.0 -1.5 -1.0 μ − μ hcp-TbTb Tb (eV) Figure 6.1: (a) Sketch of the Si(557) slab and (b) – (d) the optimised TbSi2 nanowire struc- tures. Blue circles are Si atoms; Yellow circles are Tb atoms; Red circles are H atoms. The light red polygon in (a) encircles the subset of atoms fixed in position during the relaxations. Red lines mark the unit cells. Orange M-stars in (b) – (d) indicate mutually threefold coordin- ated Si atoms near Tb. The green boxes and orange circles in (b) and (c) highlight the con- nection between the covering Si honeycomb and the step. The orange circle in (d) marks the stabilising Si atom. (e) shows the grand-potential-chemical-potential phase diagram of the 5a-T4-h-B nanowire@Si(557) systems (solid, red → dense, orange → sparse). The T4-h-B monolayer@Si(111) system is inserted as well (dashed, red) as it will be used for explaining the details of the growth mechanism (vide infra). The dotted blue line sketches a hypothetical phase with lower Tb density. All 𝛾(𝜇Tb) are reset to that of the clean Si(111) slab. stoichiometric bulk ort-AlB2-TbSi2 structure, i.e. one angle is smaller and the other two are larger than 120◦ (Tab. 4.3). In both structure models, the Tb5 edge is pinched in that the coordinating Tb5–Si bond distances are shorter than those in the mid of the strip (⟨𝑑Tb5–Si⟩ = 2.9 Å). In the 5a-H3-h-A structure, the other edge near Tb1 is pinched as well (⟨𝑑Tb1–Si⟩ = 2.9 Å), while in the 5a-T4-h-B structure, it is similar to the mid of the strip (⟨𝑑Tb1–Si⟩ = 3.0 Å). Whether the Tb1 edge is pinched or not is clearly linked to how the covering Si honeycomb is connected to the step (green boxes in Fig. 6.1 93 [557] [111] γ − γ c-Si(111) (meV/Å2) [11‾0] 6 Nanowires on silicon(557) (b) and (c)). In the 5a-T4-h-B structure, the Tb atom lies in the same (11̄0)-plane as the Si(001)-like Si bridge of the step. Hence, the first Si atom of the covering Si honey- comb lies in the adjacent (11̄0)-plane and can bind to the twofold coordinated tip of the step (orange circle) in such a way that the tetrahedron of the latter completes. In particular, the beginning of the cover and the tip of the step remain distinct Si atoms. On the other hand, in the 5a-H3-h-A structure, the Tb and Si planes of the cover are interchanged. Therefore, the tip of the step has to be the first atom of the covering Si honeycomb at the same time, so the Tb1 edge is pinched. The different Tb1 edges are the reason why the 𝜇Tb difference between the 5a-H3-h-A and the 5a-T4-h-B struc- ture models is even larger than that between the respective monolayer structures. In other words, the geometry of the stepped Si(557) surface per se favours the T4 position over the H3 position. The balance of the valence electrons is facilitated by orange M-stars marking mutu- ally threefold coordinated Si atoms in the structure models in Fig. 6.1. In 5a-T4-h-B, all 15 valence electrons from the 5 Tb atoms are absorbed by 15 Si acceptors, so all Si shells are closed. Although the same holds for 5a-H3-h-A, there is an imbalance as the cover has 11 acceptors and the substrate termination 4, differently from the 2:1 ratio in the monolayer structures and the 5a-T4-h-B nanowires. The sparse nanowires The next step consists in the investigation of the sparse nanowires, which are mod- elled on the doubled (10 × 1) slab providing two terraces per unit cell. Every second terrace is occupied by a nanowire strip, while the others are saturated by H atoms so as to remove artificial, metallic bands from the unreconstructed Si(111) planes. The following paragraphs deal only with nanowires of the 5a-T4-h-B structure. After emptying a terrace, a dangling bond remains at the covering Si honeycomb of the nanowire on the right. Such a structure would be unstable against hcp-Tb se- gregation (𝜇Tb = +100 meV) if not treated with an appropriate saturation. Therefore, a stabilising Si atom is added t the structure so that the cover can smoothly link to the Si bridge of the adjacent surface step. This lowers the Tb chemical potential to −135 meV. The relaxed structure model of the sparse nanowires is depicted in Fig. 6.1 (d), where the stabilising Si atom is marked by an orange circle. The averaged co- ordinating Tb–Si bond distances are equal to those in the dense nanowires for Tb1 to Tb4 (⟨𝑑Tb{1,2,3,4}–Si⟩ ≈ 3.0 Å). The coordination polyhedron of Tb5 has an average distance of 3.0 Å as well, so it is more voluminous than that of the dense nanowires. Hence, the Tb5-edge pinching of the dense nanowires is absent in the sparse nano- wires, which is advantageous. The balance of the valence electrons shows that the sparse nanowires have 5 Si acceptors in the substrate termination, similarly to the dense ones. However, the cover has 12 Si acceptors and thus two in excess. The phase diagram Obviously, given a fixed number of Tb atoms, the dense arrangement of nanowires is more advantageous than the sparse one as the Tb chemical potential of the former 94 6.1 Structure optimisation is smaller than that of the latter (−327 meV vs. −135 meV). However, the thermo- dynamic boundary condition of a constant Tb coverage is an idealised situation in which the mobility of the Tb atoms on the substrate is infinitely high during the an- nealing stage. Therefore, also the converse situation is considered, in which the Tb atoms are in equilibrium with an infinite reservoir which fixes 𝜇Tb. This is described by a thermodynamical potential Ω(𝑉,𝑇, 𝜇•) which results from the Legendre trans- formation of the free energy with respect to the nu∑︁mber of atoms of all species •. Ω(𝑉,𝑇, 𝜇•) = 𝐹 (𝑉,𝑇, 𝑁•) − 𝜇•𝑁• (6.46)• Ω(𝑉,𝑇, 𝜇•) is also referred to as the grand potential. Dividing by𝑉 makes this quant- ity intensive and returns the mechanical tension 𝑝 according to Euler’s theorem. 𝑝(𝑇, 𝜇•) 1 = Ω(𝑉,𝑇, 𝜇•) (6.47) 𝑉 The potential has to be adapted to the reduced dimensionality of surfaces by repla- cing the volume𝑉 with the surface area 𝐴 and changing the symbol 𝑝 to 𝛾. Inserting the bulk Si chemical potential and neglecting the lattice dynamics returns the follow- ing ready-to-use f(orm: ( ) ≈ 1 ) 𝛾 𝑇, 𝜇 𝐸cell − 𝜇d-Si• DFT Si 𝑁Si − bot top top𝜇H 𝑁 bot c-Si(111) H − 𝜇H 𝑁H − 𝜇Tb𝑁Tb − 𝛾 (6.48)𝐴 The discrimination between the H atoms of the bottom and the top side of the slab is necessary because they play a different role. Setting 𝜇bot to 𝜇H-satH H (vide supra) guaran- tees the cancellation of any effects from the bottom side. The H atoms of the top side, only present in the sparse nanowires, have a more physical meaning as they interact with the TbSi2 strips. If their chemical potential was set to 𝜇H-satH , i.e. if the H avail- ability was so high that the empty Si terraces completely saturate, the slab carrying the sparse nanowires would be highly preferable due to the H-saturated terraces, ir- respectively of the (in-)stability of the TbSi2 strips. Though, this situation contradicts the experiment because much effort is made to remove as much H from the vacuum chamber as possible. Thus, the H atoms of the top side deserve a separate H chem- ical potential top𝜇H . As they sit solely on the vertical dangling bonds of the Si(111)-like terraces, top𝜇H is gauged in such a way that the top side of the H-saturated Si(557) slab is in chemical equilibrium with the clean Si(111) surface. This approximates that the availability of H is so small that the empty terraces remain clean. Of course, the grand potential 𝛾 has to be referred to a reference system, which is chosen to be the clean Si(111) surface (𝛾c-Si(111)) or, equivalently, the unsaturated Si(557) surface, ascertained by the gauge of top𝜇H . Since the Tb chemical potential 𝜇Tb is unknown, it has to be treated as a variable. 𝛾(𝜇Tb) then becomes a linear func- tion, which can be plotted into phase diagrams. In doing so, the abscissa 𝜇Tb tunes the experimental situation: higher and lower values for 𝜇Tb correspond to Tb-rich and Tb-poor conditions, respectively. The slope of 𝛾(𝜇Tb) equals the areal Tb dens- ity 𝑁Tb/𝐴. Consequently, phases with high Tb densities have steeper lines and are principally favoured for Tb-rich conditions, and vice versa. By means of the phase 95 6 Nanowires on silicon(557) diagrams, structure models with different Tb densities become comparable. The sta- bility at a certain 𝜇Tb given by those which minimise 𝛾(𝜇 1Tb) . Fig. 6.1 (e) shows the phase diagram of the sparse nanowires and the dense nano- wires on Si(557) and the T4-h-B monolayer on Si(111), denoted simply monolayer in the following paragraphs. The domain of 𝛾(𝜇Tb) is confined to Tb chemical poten- tials smaller than that of hcp-Tb since higher values would precipitate elemental Tb. The abscissa is reset by hcp-Tb𝜇Tb so that the domain is 𝜇Tb < 0. Concentrating on the solid lines (red→ dense NW, orange→ sparse NW and black→ clean Si(111) / clean Si(557)), there are stability regions for each phase, which are shaded in the respect- ive colour. Very small values for 𝜇Tb make the Si substrate remain clean in the sense that neither the sparse nor the dense nanowires form. Above ≈−2.0 eV, the sparse nanowires are stable, superseded by the dense nanowires at ≈−1.5 eV. Of course, the phase diagram is incomplete. There are probably other phases which are more stable on certain regions, e.g. sparse nanowires in which the TbSi2 strips are sep- arated by more than one empty terrace. An exemplary pathway of such a phase is sketched in the phase diagram as a dotted blue line. Although not checked by cal- culation, these phases would match the expectable trend that lower 𝜇Tb results in sparser nanowires. In the other limit, there are nanowires of heights corresponding to more than one monolayer which become stable at high 𝜇Tb. For very high 𝜇Tb, a thick film assuming the hex-AlB2-TbSi2 – b structure would cover the substrate (vide supra, Sec. 5.3.1), which manifests itself as a vertical line at 𝜇 = 𝜇hex-AlB2-TbSi2 – bTb Tb in the phase diagram. A green arrow marks the position in Fig. 6.1 (e), employing the value of the approximated bulk structure (−1.827 eV, Tab. 4.6). At first glance, this would imply that dense nanowires would not occur at all. However, the thick film is kinetically hindered as the Tb deposition is limited by experimental controlling and because the Tb atoms are not infinitely mobile. That the silicide aggregation is locally and temporally limited effectively constrains the height of the structures, one layer in this case. The phase diagram moreover reveals that the monolayer on Si(111) is considerably more stable than the dense nanowires on Si(557). In terms of a constant Tb amount, the difference between the Tb chemical potentials of the monolayer and the dense nanowires amounts to: −806 meV + 327 meV = −479 meV. The nanowire formation is thus solely due to presence of the steps of the Si(ℎℎ𝑘) surface, which hinder the formation of a smooth monolayer. A second consequence of the high stability of the monolayer with respect to the dense nanowires is that surfaces with wider terraces and hence wider nanowires should be more stable than those with narrow terraces. For a fixed tilt angle, a substrate can widen its terraces by step bunching, i.e. the 1The main difference between the grand potential in Eq. (6.48) and the Tb chemical potential in Eq. (5.43) consists in the change of the thermodynamic boundary conditions from the constancy of the Tb amount 𝑁Tb and the surface tension 𝛾 to the constancy of the Tb chemical potential 𝜇Tb and the surface area 𝐴. In experimental terms, this corresponds to switching from the annealing stage, where an infinite substrate with a fixed Tb coverage is held at constant temperature, to the de- position stage, where a substrate of fixed area is exposed to vaporised Tb, the chemical potential of which is controlled by the pressure of the Tb vapour. According to the Gibbs-Duhem relation, in- creasing the pressure at constant temperature generally increases the chemical potential of a single- species thermodynamic subsystem. 96 6.2 Electronic properties formation of double or triple steps to the benefit of the terrace widths. The formation of nanowires which are wider than the terraces of the single-stepped surface (>18 Å for Si(557)) is indeed observed in the experiment, in particular, if nanowire samples are “cleaned” and reused [P8, 70]. 6.2 Electronic properties 6.2.1 Band structures The band structures are calculated analogously to Sec. 5.2. Since the nanowires are embedded in tilted supercells of the hexagonal (1 × 1)-Si(111) surface unit cell, the supercell Brillouin zones (sBZs) of the nanowires are folded versions of the prmitive Brillouin zone (pBZ) of the monolayer. In order to make the band structures compar- able, the Γ-point, K-points and M-points of the pBZ are identified in the sBZs. How- ever, it is impossible to find an exact mapping because the Si(557) and Si(111) sur- faces are tilted against each other by 9.4◦ and the steps misalign the (557)-projected (1 × 1) building elements of the terraces. Therefore, the M- and K-points of the pBZ are approximately addressed by their Cartesian coordinates in the sBZs, although they are not high-symmetry points in the latter. The band structures are calculated along two k-paths: ΓK parallel to the nanowires (x-direction) and ΓM perpendicular to the nanowires (y-direction). The points where the paths cross the boundaries of the sBZs are labelled X and Y, respectively. Approximate sketches of the sBZs may be found in Fig. 6.2 (c,iv). Accurate DOS calculations are carried out on dense Γ-centred (40 × 8 × 1) and (40 × 4 × 1) meshes for the dense and sparse nanowires, respect- ively. Fig. 6.2 shows the band structures of (a) the dense nanowires, (b) the sparse nanowires and (c) the monolayer. The line width corresponds to the portions of (i) all Tb atoms, (ii) all Si atoms of the cover (Si1 and Si2 in the monolayer), (iii) all top Si atoms of the substrate termination (Si3 in the monolayer) and (iv) bulk-Si in the PAW projections. The line colour is fixed in panels (ii) – (iv). In panel (i), it indicates the Tb5 portion in the PAW projections of all Tb atoms according to the colour bar. The energies are reset by the respective bulk VBMs 𝐸0. The Fermi level of the slab is inserted as a red horizontal line. Before the analysis can start, the band folding between Si(111) and Si(557) has to be elaborated in more detail since the Si(557) supercells and the (1 × 1)-Si(111) unit cell are incommensurable. It proves to be helpful to consider the unit cell of the dense/sparse nanowires a (6 × 1)/(12 × 1) supercell of the (1 × 1) unit cell of the ter- races. The increment of the symbolic notations (5× 1)/(10× 1) stems from the steps, which approximately add a 1𝑎 advance to the 5𝑎 wide terraces. The new reconstruc- tion symbols suggest that the ΓM path contains the ΓY path six and twelve times, respectively. The band structures in Fig. 6.2 (a) and (b) confirm this by the periodic features along ΓM having period lengths of 1/3 and 1/6, respectively. Conversely, the substrate bands along the Si(557)-ΓY path are a superposition of respectively six and twelve differently shifted images of the bands along the Si(111)-ΓM path. As a consequence, the substrate bands along ΓM in Fig. 6.2 (a) and (b) are horizontally 97 6 Nanowires on silicon(557) (i) Tb (ii) Si1 + Si2 (iii) Si3 (iv) Sibulk (a) 1 4 5 3 0 2 1 -1 Dense NW -2 M YΓ X K M YΓ X K M YΓ X K M YΓ X K (b) 1 4 5 3 2 0 1 -1 Sparse NW -2 M YΓ X K M YΓ X K M YΓ X K M YΓ X K (c) 1 Tb5 portion 20 % 50 % 0 M Y Γ -1 K X Monolayer -2 y M Γ K M Γ K M Γ K x Figure 6.2: Band structures of (a) dense and (b) sparse nanowires on Si(557) and (c) the mono- layer on Si(111) (all based on the T4-h-B structure). The line width corresponds to the PAW projections of (i) the Tb atoms, (ii) the Si atoms of the cover, (iii) the top Si atoms of the sub- strate termination, and (iv) bulk-Si. All projections are normalised by the number of atoms of the respective group. The line colour in (i) indicates the portion of Tb5 in all Tb projections according to the colour bar: yellow→ 20 %, green→ 50 %. All energies are reset by the VBM of the bulk-Si bands. The red horizontal lines indicate the Fermi levels of the slabs. (c,iv) Brillouin zones: black→ML on (1× 1)-Si(111), red→ dense NW on (5× 1)-Si(557), orange→ sparse NW on (10 × 1)-Si(557). blurred, particularly in the case of the dense nanowires. On the contrary, the perpen- dicular Si(557)-ΓK direction shows projections of bands along different, equidistant lines parallel to Si(111)-ΓK. After reduction by time-reversal symmetry, the multipli- city is 6/2+1 = 4 for the dense and 12/2+1 = 7 for the sparse nanowires. As a result, the substrate bands along ΓK in Fig. 6.2 (a) and (b) are vertically diffuse and, thus, still distinguishable, in contrast to those along ΓM. 98 E − E0 (eV) E − E0 (eV) E − E0 (eV) 6.2 Electronic properties The bands related to the nanowires, of which the Tb bands are the most important, look different from the substrate bands. For both the dense and the sparse nano- wires, they include five prominent, thick Tb bands, enumerated in Fig. 6.2 (a,i) and (b,i). Along ΓK (parallel to the nanowires) they point downwards with a steepness similar to that of the upper O band of the monolayer. Near the Γ-point and the K-point, they bend oppositely, so they resemble each an inverted S. Along ΓM (perpendicular to the nanowires) each Tb band is confined into a narrow energy range. Therein, the dense and the sparse nanowires show an important difference: While the Tb bands of the former are still wavy with strong dispersion, those of the latter are strictly flat and, hence, completely localised in that direction. All thick Tb bands are hybridised with the covering Si honeycomb on the entire Brillouin zone and with the substrate termination near the Γ-point, which is remarkably similar to the upper O band of the monolayer. While the projections of the four lowest thick Tb bands are distributed over all Tb atoms, the topmost band 5 majorly belongs to Tb5 at the open edge of the TbSi2 strips. Beside this band, there are further Tb5-associated bands at higher bind- ing energies. They are thinner than band 5 and complemented by PAW projections of the adjacent Si atoms, thus indicating strong Tb–Si hybridisation. Relation to the monolayer Tb@Si(111) system The analogy between the Tb bands of the nanowires and the upper O band of the monolayer suggests that the former may possibly be related to the latter by projec- tion. However, ordinary folding considerations like those above hold only for deloc- alised states, e.g. the substrate bands. For the bands related to the nanowires, it has to be taken into account that the TbSi2 strips extend to five projected (1 × 1) surface lattice constants per terrace, while the sixth is the step. Therefore, it is likely that the nanowire states originate in the monolayer states by (partial) localisation perpen- dicular to the growth direction, i.e. by mixing countercurrent monolayer states with wave vectors 𝒌 and −𝒌 to form standing waves. In order to test this, the hexagonal Brillouin zone of the monolayer is segmented by equidistant parallel translations of the ΓK path. Beginning at the x-axis, the path is shifted in the y-direction until it reaches the edge of the Brillouin zone defined by KMK. As the path crosses the zone y 1 2 3 4 5 6 7 8 9 10 11 x 1.0 0.5 0.0 -0.5 -1.0 Γ K M M K Figure 6.3: Segmentation of the Brillouin zone of the monolayer. The band structure is plotted along eleven equidistant paths parallel to ΓK, the sketches of which are shown above the plots (red lines). The upper O band and the electron pocket, both associated with Tb, are highlighted thick and yellow. All energies are reset to the VBM of bulk-Si. The red horizontal lines indicate the Fermi level. 99 E − E0 (eV) 6 Nanowires on silicon(557) boundary, it reappears in the lower half of the Brillouin zone after zone reduction. The bands along 11 segments are plotted in Fig. 6.3 and the Brillouin zones with the k-paths may be found above the respective panels. The upper O band and the con- duction band forming the M-point electron pocket are highlighted thick and yellow. From ΓK to MK+, the upper O band turns into an S shape similar to the Tb bands of the nanowires. The band width decreases. If the path approaches an M-point, the electron pocket appears (segments 6 and 11 in Fig. 6.3). Sparse Dense Project. Fig. 6.4 shows an overlay of the equidistant seg- NW NW ML 1 ments 1, 3, 5, 7 and 9 (right panel) and compares it with the ΓK part of the band structures of the dense nanowires (middle panel) and the sparse nanowires 0 (left panel), taken from Fig. 6.2 (a,i) and (b,i). The right and middle panels look very similar to each other. The lowest four Tb bands of the dense nano- -1 Γ K Γ K Γ K wires follow the segments 3, 5, 7 and 9 of the upper O band almost congruently, merely differing by smal- Figure 6.4: Band structures of the ler shifts. Segment 1, the topmost band (coloured nanowires and projected bands of green in Fig. 6.4), undergoes more shifting and re- the monolayer. hybridisation to become the Tb5 band of the dense nanowires. Hence, the band structure of the dense nanowires is related to that of the monolayer by a simple zone projection in large part. It is astonishing that the discontinuities from the steps only slightly affect the intercoupling between adja- cent Tb strains, even at the edges of the TbSi2 strips. The close electronic related- ness between the dense nanowires and the monolayer proves that the former are a two-dimensional electronic system, in spite of the stripy structure. The situation is different for the sparse nanowires (left panel). The bands related to the inner Tb atoms are shifted to higher binding energies compared to the dense nanowires and the monolayer projection, but they retain their morphology. Thus, they are still subjected to the physics of the monolayer, concerning the intercoup- ling of the Tb strains. However, the total localisation perpendicular to the nanowires (vide supra) shows that these states are genuine standing waves in that direction. Consequently, the TbSi2 strip can be considered a quantum well derived from the TbSi2 monolayer. In contrast to the inner states, the green Tb5 band rehybridises so strongly that it completely loses the comparability with segment 1 of the monolayer. In other words, it represents a new state which cannot be derived from the mono- layer any more. Because of its localisation at Tb5, it can be considered an edge state and, thus, resembles a one-dimensional system within the sparse nanowires. 6.2.2 Fermi surfaces The hypotheses on the dimensionality of the sparse and the dense nanowires are tested by means of the Fermi surfaces, which are calculated analogously to Sec. 5.2.3. Fig. 6.5 shows the Fermi surfaces of the monolayer and the nanowires. The Fermi level of the sparse nanowires is aligned to that of the dense nanowires (+0.1 eV). 100 E − E0 (eV) 6.2 Electronic properties The Fermi surface of the dense nanowires has stripy features, which are clearly linked to a folding procedure from the hexagonal pBZ into the smaller sBZ, marked white in Fig. 6.5 (a) and (b). Following the light blue dashed lines, the electron pock- ets at the M-points generate the thin, bright stripe at the Γ-point as well as the broad, weak stripes at the X-point. The thin, weak lines accompanying the middle bright stripe (indicated by a light green arrow) stem from the points of the star-shaped hole pocket, as clarified by the light green dashed line. These lines are the Fermi break- throughs of band 5 (green arrow in Fig. 6.2 (a,i)). The relationship between them and the hole pocket agrees with the above observation that band 5 originates from the upper O band of the monolayer. Since all features are wavy or corrugated in the (vicinal) [112̄] direction (perpendicular to the nanowires), the Fermi surface of the dense nanowires has a two-dimensional character. At first glance, the Fermi surface of the sparse nanowires looks similar to that of the dense nanowires. The major dif- ference consists in a lack of corrugation, so all stripes are straight. Also the lines related to band 5 straighten (green arrow), so they are completely separated from the other stripes. Hence, the these states gain a purely one-dimensional character, confirming the results from the band structure analysis. In Fermi surface measurements, the electrons expelled from the sample carry a par- allel momentum equal to the crystal momentum of the initial state. However, the latter does not accord with the multiply reduced zone scheme of the supercells, but with the Brillouin zone of the local periodicity of the primitive building elements, the (1 × 1)-Si(111) terraces in the case. The Fermi surface in the reduced zone scheme of any supercell can be unfolded into the extended zone scheme by weighting the k-resolved DOS (Eq. (5.45)) with the Fourier components of the wave functions and integrating it over the sBZ. In a second step, the unfolded Fermi surface can be folded back into the pBZ [124]. In detail, the unfolded Fermi surface in terms of the primitive Brillouin zone pBZ𝐷F (𝒌) holds: 𝑁∑︁Band ∫ pBZ 𝐷F (𝒌) = ∑︁d𝐾∑︁𝑊𝑛𝒌𝑲 𝛿(𝐸F − 𝜀𝑛(𝑲))𝑛=1 sBZ 𝑲 ∈ sBZ, 𝒌 ∈ pBZ, 𝑊 = 𝛿(𝑲 +𝑮 − (𝒌 + 𝒈)) 𝑢 (𝑲) 2 (6.49)𝑛𝒌𝑲 𝑛𝑮 𝒈 𝑮 where 𝑲 and 𝒌 are the k-vectors of the sBZ and pBZ, respectively, 𝑮 and 𝒈 are the direct lattice vectors of the supercell and the primitive unit cell, respectively, and 𝑢𝑛𝑮 (𝑲) are the Bloch factors of the supercell (Eq. (2.27)). In practice, this procedure works only if the super lattice vectors are linear combinations of the primitive ones with integral coefficients. Therefore, the above Fermi surface of the dense nanowires cannot be unfolded into the (1× 1) Brillouin zone since the terraces advance by half a period in the parallel direction when passing a step. This issue could be resolved by calculating the dense nanowires in a doubled (10×1) supercell, which is not done in this work, though. The sparse nanowires, on the other hand, are already modelled in a (10 × 1) super- cell. Their unfolded Fermi surface is plotted in Fig. 6.5 (c). The unfolding procedure recovers the M-point electron pockets from the thin stripe though the Γ-point and the 101 6 Nanowires on silicon(557) (a) Monolayer (b) Dense nanowires 1.5 K (111) → (557) 1.0 + X folding 0.5 M 0.0 Γ YΓ -0.5 -1.0 -1.5 (c) Sparse NW (unfolded) (d) Sparse nanowires 1.5 K 1.0 X 0.5 M 0.0 Γ Γ Y -0.5 -1.0 unfold -1.5 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 k −1 −1[112̄] (Å ) k[112̄] (Å ) (e) Edge state Figure 6.5: Fermi surfaces of the Tb@Si(ℎℎ𝑘) systems: (a) monolayer, (b) dense nanowires, (c) and (d) sparse nanowires (all based on T4-h-B). The axes point into the directions [112̄] (parallel to the NW, vicinal for NW) and [1̄10] (perpendicular to the NW). (c) is derived from (d) by unfolding the sBZ to the local hexagonal pBZ of the Si(111) terraces. The Brillouin zones are marked white. The colouring corresponds to the colour bar (arbitrary units, the ranges are comparable for (b), (c) and (d), but not linked to that of (a)). Ellipses highlight the electron pockets at the M-point. Green arrows indicate the new edge state. Dashed lines are guides for the eyes. (e) Real-space distribution of the edge state. Reproduced from [P8]. broad stripes trough the X-points. However, the pocket shapes differ from those of the monolayer: The two horizontal pockets have a considerably stretched aspect ra- tio, which indicates higher effective masses perpendicular to the nanowires. Also the electron pockets at the other four M-points are stretched and effectively rotate away from the growth direction of the nanowires. Both effects indicate that the electronic mobility perpendicular to the nanowires decreases. In contrast to the electron pock- ets, the hole pocket is not recovered. Instead, the state linked to band 5 embraces the 102 k −1 −1[1̄10] (Å ) k[1̄10] (Å ) 0 max dense → sparse 6.3 Discussion and summary region around Γ as two straight lines (green arrow). This confirms the above find- ings that the band-5 state has undergone so many changes that the relationship with the Tb states of the monolayer is lost. In order to clarify that this state is indeed an edge state, as inferred from the PAW projections, the corresponding real-space par- tial charge at the Fermi level is plotted as isosurfaces in Fig. 6.5 (e). Confirming all expectations, this state is strongly confined to a narrow channel near Tb5. 6.3 Discussion and summary The nanowire-Tb@Si(557) surface provided the opportunity to explore a fascinating lower-dimensional quantum system. The terraces of the substrate are indeed a tem- plate for a stripped TbSi2 film which resembles nanowires from a structural point of view. The strips follow the stability relations of the monolayer-Tb@Si(111) film in that they locally adopt the T4-h-B structure. Only the edges of the strips are subjected to distortions. The geometric relations between the Si(001)-like steps and the Si(111)- like terraces was shown to be very convenient for the T4-h-B structure since the TbSi2 strip can connect with the broken sp3 bonds of the step. This may be the reason why these types of TbSi2 nanowires grow only on Si(ℎℎ𝑘) surfaces where ℎ < 𝑘 [70]. The phase diagram simulating the deposition stage proved that the density of terraces bearing a TbSi2 strip is smaller/higher for Tb-poor/Tb-rich conditions. It furthermore hints that step-bunching leading to wider TbSi2 strips is favourable. As far as visible from experimental STM images [P8], the narrow 5𝑎 wide TbSi2 strips do occur in Tb@Si(557), but the surface is dominated by wider strips. TbSi2 nanowires on step- bunched surfaces are certainly worth investigating in future work. The dense nanowires are a two-dimensional electronic system, irrespectively of the discontinuities from the steps. Their band structures and Fermi surfaces can be de- rived from the monolayer by a simple zone projection. The electronic relatedness between the dense nanowires and the monolayer agrees with experimental ARPES and Fermi surface measurements [P8]. The sparse nanowires differ very much from the dense nanowires regarding the electronic properties. The states associated with the inner Tb atoms become genuine standing waves according to a simple particle-in- a-box model. In other words, the isolated TbSi2 strips act as a quantum wells which laterally trap the electrons. In doing so, the coupling within the strips remains unaf- fected. The effective masses of the electron pockets agree with the Heisenberg prin- ciple: The lateral confinement of the positions broadens the lateral crystal momenta in the Fermi surface. Very similar effects were also observed in the experiment [P8]. In contrast, the edge of each isolated strip gives rise to an edge state which cannot be derived from the monolayer states. This state is laterally confined to a narrow region around the outermost Tb atom and has a clear one-dimensional character, as proven by the unfolded Fermi surface. The edge state was also observed in the measure- ments on the nanowire-Tb@Si(335) system [P8], whose terrace widths correspond to the Si(557) substrate of this work (step bunching). In conclusion, the Tb@Si(557) surface permits the observation of a fascinating dimensional crossover from a two- dimensional film to a system of quasi-one-dimensional nanowires. 103 Part II: Thin antimony layers on bismuth selenide The discovery of topological order in solid-state physics was the door opener for a new class of materials whose state underlies the mathematical concept of topo- logy. Topologically non-trivial materials are band insulators with intrinsic properties which are linked to the topological nature of the electronic wavefunctions. These spe- cial properties show an exceptionally high stability against many perturbations since topological non-triviality is much stronger than a plain symmetry protection. The research of topology in materials science began in the 1980s when von Klitzing et al. discovered the integer Hall effect (IHE) in the two-dimensional electron gas of MOSFETs [125]. Their work was the first documentation of topological order in a solid-state system. Because of the topological non-triviality, the IHE is insensitive to deviations from the idealised assumption of a homogeneous, two-dimensional elec- tron gas. Hence, it even works in the presence of impurities, the finiteness of the sample and electron-electron interaction [126]. Topologically ordered solids soon at- tracted the attention of the condensed-matter community, who had a great deal of work published on different systems. In doing so, several types of topological order were discovered, which are based on different topological invariants. The research was so groundbreaking that is was awarded with three Nobel Prizes [W4]: • Klaus von Klitzing “for the discovery of the quantized Hall effect” (1985) • Robert B. Laughlin, Horst L. Störmer and Daniel C. Tsui “for their discovery of a new form of quantum fluid with fractionally charged excitations” (1998) • David J. Thouless, F. Duncan M. Haldane and J. Michael Kosterlitz “for theoret- ical discoveries of topological phase transitions and topological phases of mat- ter” (2016) As von Klitzing pointed out in his Nobel Prize Lecture, nobody expected at that time that semiconductor physics could produce any fundamental discoveries: Semiconductor research and the Nobel Prize in physics seem to be contra- dictory, since one may come to the conclusion that a complicated system like a semiconductor is not useful for very fundamental discoveries. In- deed, most of the experimental data in solid-state physics are analyzed on the basis of simplified theories, and very often the properties of a semicon- ductor device are described by empirical formulas since the microscopic details are too complicated. Up to 1980 nobody expected that there ex- ists an effect like the quantized Hall effect, which depends exclusively on 105 Thin antimony layers on bismuth selenide fundamental constants and is not affected by irregularities in the semicon- ductor like impurities. [126] The topological class of materials is linked to certain quantities, the topological in- variants. These quantities are fundamental properties of topological spaces and in- dicate whether they can be transformed into each other or not. In detail, if two topo- logical spaces have different topological invariants, they are not homeomorphic to each other, i.e. they cannot be mapped onto each other in a continuous and bijective manner. The reverse statement does not necessarily hold. The simple example of Euler’s theorem of polyhedrons provides an easy approach towards the matter of topology. As taught in many geometry lessons, it is well known that all convex polyhedrons obey the rule that the number of vertices minus the number of edges plus the number of faces equals 2. This holds, e.g., for the cube (8−12+6 = 2), the tetrahedron (4−6+4 = 2) and the octahedron (6−12+8 = 2). The theorem can even be generalised to closed surfaces which are not necessarily con- vex or polyhedral, e.g. the smooth 𝕊2 sphere. In this case, a triangulation produces countable vertices, edges and faces. The alternating sum of these is the Euler charac- teristic, which defines a topological invariant of the surface. It is thus independent of the details of the triangulation because different triangulations are homeomorphic to each other. Since Euler’s theorem for polyhedrons is a special case of the Euler characteristic, it is obvious that also the sphere has an Euler characteristic of 2 as it can be mapped onto a cube, a tetrahedron or an octahedron. The ring-shaped 𝕋2 torus is another example for a closed surface which clarifies the usefulness of the Euler characteristic. A possible triangulation for this surface con- sists in a right-angled, closed arc, similar to the shape of l’Arche de la Défense, Paris, France2. Comprising 16 vertices, 32 edges and 16 faces, it yields an Euler character- istic of 0, differently from the sphere. Thus, the torus and the sphere are so funda- mentally different from each other that a homeomorphism between them does not exist. In this case, the hole of the torus is the central property which discriminates the topological classes. A consequence of the different topological classes which is indeed relevant for materials scientists is the existence of a global map for the torus, while a global map for the sphere does not exist3. This is the topological reason for why a 2D crystal can be mapped onto a square (more generally a parallelogram), despite being a closed torus due to the periodicity in two directions. The same holds for the Brillouin zone, whose toroidal nature will be relevant at several points in Chap. 7. In the case of solids, topological invariants can be calculated from the electronic wavefunctions, provided that they can be separated into well-defined occupied and unoccupied manifolds, i.e. the material of interest is a band insulator. Those insulat- ors whose topological invariants are non-trivial are called topologically non-trivial insulators or simply topological insulators (TIs). In terms of solid-state physics, the 2La Grande Arche de La Défense, https://fr.wikipedia.org/wiki/Arche_de_la_D%C3%A9fense#/media/ Fichier:Grande_Arche_F%C3%A9vrier_2020.jpeg (visited on 24/03/2022). 3For instance, the Earth has no single, global map. Instead, it has to be described by a set of local maps which partially overlap: an atlas. More precisely, at least two local maps are necessary in order to find any geographic point on at least one of the two maps. For example, the UNO logo lacks Antarctica for this reason. 106 Thin antimony layers on bismuth selenide above theorem of conserved topological invariants tells that the special properties of TIs are retained as long as the band gap does not close. For this reason, the topological order of a TI is insensitive to imperfections like disorder or defects. The practical usefulness of TIs does not necessarily consist in the intrinsic proper- ties. If insulators of different topological classes border each other, in particular if a TI is connected to a topologically trivial insulator (conventional insulator, CI), the band gap closes so as to allow the bands to reorder. In other words, there are metallic edge states somewhere at the interface. Since the vacuum is also a CI, the surface of every TI bears topologically protected surface states (TSSs), which are indestructible by any perturbation, provided that the topological classes of the involved bulk ma- terials are conserved. Of course, metallic surface states which exists regardless of the morphological details are very attractive for surface scientists. This part of the thesis is dedicated to the Sb@Bi2Se3 system, a van der Waals (vdW) heterostructure consisting of thin layers of antimony (𝛽-Sb) which are adsorbed on bismuth selenide substrates (Bi2Se3). This heterostructure is a particularly interest- ing system because, firstly, the 𝛽-Sb adlayer adsorbs to the Bi2Se3 substrate by means of vdW interactions and, secondly, the 𝛽-Sb adlayer is a CI, while bulk Bi2Se3 is a TI, more precisely a ℤ2 topological insulator. Of course, TSSs emerge as the non-trivial class of the substrate has to transition into the trivial class somewhere. However, to- pology does not specify any details about the transition, e.g. the location of the TSSs. A priori, the TSSs can be located anywhere, from being buried in the substrate to rising up to the surface of the Sb film. That Bi2Se3 and 𝛽-Sb are vdW materials does not simplify this issue as it is unclear how the adlayer and the substrate mutually affect their electronic properties. The aim of this part is to analyse the nature of the TSSs of the Sb@Bi2Se3 hetero- structure. Since the interplay between the topological classes of the Bi2Se3 substrate, the 𝛽-Sb adlayer and the vacuum is complex, elaborating the topological invariants is inevitable for analysing the formation of the TSSs and for understanding topolo- gical phase transitions. Therefore, Chap. 7 introduces the matter of geometric phases, which finally leads to the definition of the ℤ2 invariant for time-reversal-symmetric solids. Chap. 8 follows as a preparatory chapter which treats the constituents of the heterostructure Bi2Se3 and 𝛽-Sb separately. The investigations serve two purposes: testing the approach, which employs methods beyond DFT, in particular vdW and SOC, and calculating the band structures and the ℤ2 invariants. The actual hetero- structure is the matter of Chap. 9. At first, a stable structure model is derived, where the focus is on the inversions of the Sb adlayer. After that the electronic properties are calculated for the most stable structure models. The chapter concludes with the analysis of the complex series of topological phase transitions, which occurs upon a simulated adsorption process of the Sb adlayer. 107 7 Geometric phases The concept of geometric phases is unfamiliar to many scientists, but fundamentally important for understanding modern methods in condensed-matter theory. It is the mathematical basis of the topological invariants in solid-state systems. The aim of this chapter is thus to introduce the concept of geometric phases and then to derive theℤ2 invariant step by step. The Aharonov-Bohm effect marks the beginning as it is a manifestation of geometric phases, granting a more or less easy access to the matter. After that, the geometric phases along with the respective geometric connections and geometric potentials are formally defined, according to the approach of Berry [127]. The modern theory of polarisation according to the review of Resta [128] follows as it is the basis of topological transport. This defines the IHE, which is introduced in a generalised manner. Based on the IHE, the chapter concludes with the related ℤ2 topological invariant in two and three dimensions, the objective of this chapter. 7.1 The Aharonov-Bohm effect According to the principle of minimal coupling (Sec. A.2), a gauge transformation of the electromagnetic fields introduces a phase in the wavefunctions |Ψ⟩ of charged particles. Consequently, the absolute phase of a quantum state is not a measurable quantity since gauge transformations must not change the physical observations. On the other hand, phase differences can be measured, e.g. in interference experiments. This raises the question how the observability of phase differences accords with the non-observability of gauge transformations. A phenomenon which is taught in every quantum-mechanics class, but whose interpretation is still the matter of vivid discus- sions, is the Aharonov-Bohm effect (ABE) [129–134]. After first hints by Franz [135] in 1939 and a more complete description by Ehrenberg and Siday [136] in 1949, Ahar- onov and Bohm [129] developed a general gedankenexperiment in 1959 which clari- fies the geometrical nature of gauge potentials. The space-dependent, magnetic ABE is based on the magnetic vector potential and can conceptually be realised by the model of an infinitely long solenoid with finite radius 𝑅. The magnetic field may be constant inside the solenoid and point into the z-direction. Outside, the magnetic field may vanish. In polar coordinates and in the Coulomb g{auge, the vector potential 𝑨 holds: 𝐵𝜌 { 2 e𝜑 , 𝜌 < 𝑅( ) ⇒ ∇ × 𝐵e , 𝜌 < 𝑅𝑨 𝒙 = 𝐵𝑅2 𝑨 = 𝑩 = 𝑧 ∧ ∇ · 𝑨 = 0 (7.50) 2𝜌 e𝜑 , 𝜌 > 𝑅 0 , 𝜌 > 𝑅 108 7.1 The Aharonov-Bohm effect Consider a particle with charge 𝑞 in the hollow-cylindrical field-free region Ω = {𝒙 | 𝜌 > 𝑅}. It may be in the quantum state |Ψ⟩ when the solenoid is switched off. Switch- ing the solenoid on transforms |Ψ⟩ by a phase shift according to the principle of min- imal coupling in combination with ∇ × 𝑨 = 0. This phase is proportional to the line integral of 𝑨 〉along a〉𝐶1 curve 𝛾 ⊂ Ωs B Ω〉\ {𝒙 | 𝜑〉= 𝜑s ∈ [ 0, 2〉π)}. ( ) 〉𝐻 (𝒑, 𝒙) Ψ = 𝐸 Ψ → 𝐻 (𝒑 − 𝑨 𝑞𝑞 , 𝒙) Ψ̃ = 𝐸 Ψ̃ ⇒ Ψ̃ = exp i ℏ 𝜒(𝒙) Ψ (7.51a) with ∫𝛾 : [0, 1] → Ωs, 𝛾(0) = 𝒙0, 𝛾(1) = 𝒙 𝜒(𝒙) = 𝑨 · d𝛾 = 1 22𝐵𝑅 (𝜑 − 𝜑0) ⇔ 𝑨(𝒙) = ∇𝑥𝜒(𝒙) (7.51b) 𝛾 where 𝜑 and 𝜑0 denote the polar angles of 𝒙 and 𝒙0, respectively. The constraint on Ω for integrating 𝑨 is necessary for the validity of Eq. (7.51b), as will be explained below. In other words, 𝛾 must stay in a sector around the solenoid. The phases in Eq. (7.51a) can be observed as an interference between two particle beams running clockwise and anticlockwise around the solenoid. Both beams ori- ginate in 𝒙0 ∈ Ω, the source, and meet again in 𝒙1 ∈ Ω, where the interference measurement is performed. The points divide Ω into two parts Ω+ and Ω− in which the paths of the beams are denoted 𝛾+ ⊂ Ω+ and 𝛾− ⊂ Ω− (Fig. 7.1). The superposed probability amp lit〉ude ho+〉lds: 〉Ψ̃ = Ψ̃ + Ψ̃− [= exp(i 𝑞 + +〉 𝜒 ) Ψ + e〉xp( i 𝑞 𝜒〉−]) 〉ℏ ℏ Ψ− = exp(i 𝑞ℏ 𝜒 −) [exp(i 𝑞ℏ (𝜒+ − 𝜒〉−)) Ψ+ 〉+] Ψ− (7.52)∫ = exp∮(i 𝑞 𝜒−) exp(i 𝑞∬𝜒 ) Ψ+ + Ψ−ℏ ℏ 𝛾with ∬ · d · d Stokes𝜒 = 𝑨 𝛾 = 𝑨 𝛾 = (∇ × 𝑨) · d𝑺 = 𝑩 · d𝑺 = 𝐵π𝑅2𝛾 = Φ0 𝛾+−𝛾− 𝛾 𝑆 𝑆 where Φ0 is the total magnetic flux through the area 𝑆 encircled by 𝛾+ and 𝛾− . Eq. (7.52) leads to an astonishing conclusion: A magnetic field can influence the phases of particles which are strictly prevented from crossing it. The phase differ- ence 𝑞ℏ 𝜒𝛾 depends neither on the point 𝒙1 where the measurement is performed, nor on the details of the paths, but it is proportional to the total magnetic flux through the solenoid. The indirect effect of a magnetic field on charged particles via its vector potential is the central result of Aharonov and Bohm [129]. As the field-free interac- tion sounds spooky at first glance, the ABE causes arguments about its interpretation until today [131–133]. The probably most important source of misunderstanding is the seeming observability of the gauge transformation in Eq. (7.51b). The misunderstandings arise if one infers the integrability of 𝑨 from its vanishing curl ∇× 𝑨= 0. Since it would then have a scalar potential, all loop integrals of 𝑨 would be zero. Moreover, 𝑨 could be “gauged away” to 𝑨≡ 0, which would negate any 109 7 Geometric phases action. However, curl-freeness is not sufficient for the integrability of a vector field, but the domain has to be simply connected in addition [137, p. 264]. As long as simply connected sectors of Ω, e.g. Ω+ or Ω− , are considered, 𝑨 is integrable and has a scalar potential 𝜒, as explicitly shown in Eq. (7.51b). As a consequence, particles moving in the same simply connected region do not show the ABE and the phase in Eq. (7.51a) is indeed an unobservable gauge. This changes if the line integral runs around the solenoid, as effectively realised by the two countercurrent beams. Since the solenoid pierces a cylindrical hole into the field-free region, Ω is multiply connected and a scalar potential for 𝑨 does not exist, despite ∇× 𝑨 = 0 1. Consequently, line integrals in general depend on the integration curve and loop integrals can yield non-zero values. Moreover, 𝑨 cannot be gauged away any more, but it encodes the magnetic field inside the solenoid in any gauge. In mathematical terms: the equivalence class [𝑨] defined by 𝑨 ∼ 𝑨 + ∇𝜒 is non-zero. While the phases in (7.51a) depend on the gauge, the ABE phase in (7.52) is gauge invariant since additional gradient fields ∇𝜒 vanish in loop integrals. Therefore, it is a physical observable and can be measured in principle. The ABE is a fascinating illustration of the role of Ω− x1 Ω+ fields and potentials in physics. Although it is not obvious in this case, the ABE phase shift can be explained solely by the magnetic field 𝑩, as well as solely by its vector potential 𝑨, whose equi- γ+ valence class is bijectively connected to 𝑩. Thus, γ− the popular statement that 𝑨 contains “more in-R formation” than 𝑩 is not meaningful, as well as the question whether the one or the other is more important in physics. The ABE is a pure quantum effect, for which a classical analogon does not ex- x0 ist. For this reason, imagining a quantum particle moving in a force field leads to confusions about Figure 7.1: Sketch of the ABE setting. + − the ABE. Aharonov and Bohm pointed out that the𝛾 and 𝛾 connect 𝒙0 with 𝒙1 anti- clockwise and clockwise around the idea of local force fields like 𝑩 determining the solenoid (grey). physics of a quantum system applies nowhere in quantum physics. Instead, the quantum state is determined by a Hamilton operator which is constructed by locally acting potentials (local in position and momentum operators) [130]. Non-local effects naturally arise since the differential equations are solved for the entire domain of the operator. An example for the absurdity of force fields acting locally on quantum particles is the simple double-slit experiment: It is very confusing to imagine that a particle “going” through the one slit “feels” whether the other slit is covered. Hence, the non-locality of wavefunctions and the importance of path possibilities, as emerging in the ABE, are not really exotic phenomena in quantum mechanics. 1One could have the idea to consider 𝜒(𝒓) = 𝜑 𝐵𝑅2/2 a scalar potential. Its gradient indeed yields the correct vector field 𝑨(𝒓) of Eq. (7.50). However, the domain of 𝜒 is a simply connected sector of Ω, e.g. the area {𝒓 | 𝜌 > 𝑅, 𝜑 ∈ [0, 2π)}. It is impossible to define 𝜒 continuously on Ω. More precisely, at least two local definitions of 𝜒 are necessary in order to globally calculate 𝑨. One example for these definitions are 𝜒+ and 𝜒− on the sectors Ω+ and Ω− above (with an appropriate overlap). 110 7.2 The Berry phase Since the ABE phase depends only on the enclosed magnetic flux Φ0, and neither on the point of measurement, nor on the details of the paths of the particle beams, the ABE has already been observed in 1962: By means of the ABE, Möllenstedt and Bayh determined the ratio ℎ/𝑒 with a precision of 14 % [138]. Many years followed, in which experimental evidence for the ABE was still argued to be due to leaking magnetic fields, until Tonomura et al. brought a convincing experimental proof in 1986 [139]. Obviously, the ABE phase is a fundamental property of the setup and the wavefunction domains in that it is insensitive to continuous deformations of the solenoid and of the beams. This phenomenon can be generalised by the concept of geometrical phases, which is the matter of the next section. 7.2 The Berry phase Geometric phases are also called Berry phases, as Berry introduced them in 1984 by the following procedure [127]. Let 𝐻 be the Hamilton operator for a quantum system which depends parametric- ally on a multidimensional parameter 𝝃 = (𝜉1, 𝜉 𝑁2, . . . , 𝜉𝑛) ∈ ℝ . The corresponding stationary Schrödinger equation is then〉 〉 𝐻 (𝝃) 𝑛(𝝃) = 𝐸 (𝝃) 𝑛 𝑛(𝝃) . (7.53) Furthermore, let 𝛾 : [0, 1] → 𝝃(𝑡) ⊂ ℝ𝑁 be a 𝐶1 curve which describes a continuous parametric variation. It may be slow enough that the system stays in the eigenstate |𝑛(𝝃)⟩ on the entire curve 𝛾, i.e. for every parameter 𝝃(𝑡), the state |𝑛(𝝃(𝑡))⟩ satis- fies Eq. (7.53) and the eigenvalue 𝐸𝑛(𝝃) has the same index 𝑛. In other words, level crossing during the parametric variation is explicitly forbidden. The phase factor for |𝑛(𝝃(𝑡))⟩ at a given parameter 𝝃(𝑡) ≠ 𝝃(0) is not arbitrary like in single-shot calculations, but it depends on the initial state |𝑛(𝝃(0))⟩. The connection between the phases follows from considering 𝑡 as time and solving the Schrödinger equation with〉 the initial condition |Ψ(0)⟩ B |𝑛(𝝃(0))⟩ and the following ansatz:Ψ(𝑡) = ︸︷1︷︸ e︸x p ( − i 1ℏ ∫ 0 𝑡 ︷𝐸︷𝑛 ( 𝝃 ( 𝑡 ′ ) ) ′︸ ︸ 〉d𝑡 ) exp(︷i𝜒︷𝑛 ( 𝑡 )︸) 𝑛(𝝃(𝑡)) (7.54) arb. phase dynamical phase geometric phase The ansatz wavefunction |Ψ(𝑡)⟩ has three phase factors: the arbitrary phase (≡1), the general dynamical phase from the temporal evolution, and a third term 𝜒𝑛(𝑡) which bears all other phase shifts due to the paramet ric vari𝜕 ∫ 𝜕𝜒𝑛(𝑡) ∫〉 ation. 𝜕𝝃(𝑡) 〉 iℏ |Ψ(𝑡)⟩ = 𝐻 (𝝃) |Ψ(𝑡)⟩ 〈 ⇒ ℏ 〉 𝑛(𝝃(𝑡)) = iℏ · ∇𝜉 𝑛(𝝃(𝑡))𝜕𝑡 𝜕𝑡 𝜕𝑡⇔ 𝜒𝑛 = i 𝑛(𝝃) ∇𝜉 𝑛(𝝃) · d𝝃 C A𝑛(𝝃) · d𝝃 𝛾 𝛾 111 7 Geometric phases with 〈 〉 A𝑛(𝝃) = i 𝑛(𝝃) ∇ 𝜉 𝑛(𝝃) (7.55) As it is evident, the parametric variation does not only produce the normal dynamical phase shift. There is an additional phase shift 𝜒𝑛 which originates in the parametric variation itself: the geometric phase or Berry phase [127]. The integrand A𝑛(𝝃) is the Berry connection, which describes how the phase of the eigenfunction |𝑛(𝝃)⟩ re- sponds to infinitesimal variations of the parameter 𝝃. If 𝛾 is a closed loop, the Berry phase assumes a cert∮ain value, which can be zero orfinite. 𝝃(1) = 𝝃(0) ⇒ 𝜒𝑛(1) = 𝜒B𝑛 = A𝑛(𝝃) · d𝝃 (7.56) 𝛾 Hence, a cyclic parametric variation of the Hamilton operator can shift the phases of the eigenstates beside the usual dynamical phase shift. Since not depending on the time lapse, the Berry phase ar ises e〉ven if the vaB riation〉 is instantaneous (Δ𝑡 = 0).𝑛final = exp(i𝜒𝑛) 𝑛initial If the parameter space is three-dimensional, it is instructive to apply Stokes theorem so as to obtain a surface integral: ∬ 𝜒B𝑛 = B𝑛(𝝃) · d𝑺 (7.57) 𝑆 where 𝑆 is the area encircled by 𝛾 and B𝑛(𝝃) = ∇𝜉 × A∑︁ 〈 〈 〉(𝝃) = i ∇𝜉𝑛(𝝃) |× |∇𝜉𝑛〉(𝝃) 〈 𝑛(𝝃) ∇ 𝐻 (𝝃) 𝑚(𝝃) × 𝑚(𝝃) ∇ 〉 − 𝜉 𝜉 𝐻 (𝝃) 𝑛(𝝃) (7.58) = Im ( ( ) − ( ))2 [127]≠ 𝐸𝑚 𝝃 𝐸𝑛 𝝃𝑚 𝑛 The way how the Berry connection (Eq. (7.55)) and the Berry potential (Eq. (7.57)) influence the phase of the parametrised system are remarkably analogue to the ac- tion of a magnetic field and its vector potential on the phases of a charged quantum particle in the ABE (Eq. (7.52)). Therefore, analogously to the ABE phase, the Berry phase 𝜒𝑛(𝝃) is not observable for open curves as the absolute phase information is gauge-dependent. Only if the system is compared with a reference system at the same set of parameters, which effectively corresponds to a closed loop, or if the system is directly subjected to a circular parametric variation, the Berry phase is, in principle, obs erva〉ble. This holds becau〉se 𝜒 B 𝑛 is gauge invariant. 𝑛(𝝃) → exp(i𝜒(𝝃)) 𝑛(𝝃) ⇒ A𝑛(𝝃) → A𝑛(𝝃) − ∇𝜉𝜒(𝝃) ⇒ 𝜒B𝑛 → 𝜒B𝑛 + 0 (7.59) In particular, the above choice of≡1 for the arbitrary phase has not affected the phys- ical results. The proof that the ABE can indeed be formulated in terms of Berry phases can be found in the appendix (Sec. C.1). 112 7.3 The electric polarisation in a crystal The line integral of the Berry connection A𝑛(𝝃) is more handy for proving the con- nection between physical quantities and for analytical problems. For the numeric treatment, however, A𝑛(𝝃) is not easy to calculate because the derivatives in Eq. (7.55) have to be replaced by finite differences with connected phase relations. In order to avoid this problem, the surface integral of the Berry potential B𝑛(𝝃) can be used as it depends on well-defined derivatives of 𝐻 (𝝃) [127]. 7.3 The electric polarisation in a crystal Geometric phases are applicable whenever the response of a quantum system to a parametric variation is the quantity of interest. An example which actually neces- sitates them for the correct description are null-field polarisation effects in a crystal, e.g. ferroelectricity or piezoelectricity. A naive, but wrong formula for the macro- scopic polarisation 𝑷 of a crystal would be the summation of all electrostatic dipole moments created by the charge density 𝜌(𝒙∫) in the direct unit cell CR. 1 𝑷 = 𝑷ion + |C | d𝑥 𝒙𝜌(𝒙) (7.60)R CR where 𝑷ion is the polarisation from the ions, |CR | the direct unit cell volume and 𝜌(𝒙) the electronic charge density g∑︁iven by∫the mo〈dulus of t−𝑒 1 〉 h〈e B loch fu〉nctions: 𝜌(𝒙) = |C | |C | d𝑘 Ψ𝑛(𝒌) 𝒙 𝒙 Ψ𝑛(𝒌)R 𝑛 G CG where |CG | is the reciprocal unit cell volume. This definition works fine if all elec- trons are localised within CR, e.g. in molecular crystals or ionic systems. In cova- lent systems like semiconductors, however, the electrons are not confined to a spe- cific cell, so Eq. (7.60) fails to correctly describe the macroscopic polarisation [140]. King-Smith and Vanderbilt (1993) and Resta (1994) wrote reviews on this, analysing the quantum effects of electric polarisation and linking it to geometric phases of the electronic wavefunctions [128, 141]. The key points of these reviews are retraced in the following paragraphs as they are the basis of topological transport. From now on, electrons in a crystal are considered. They are described by the Bloch Hamilto- nian (vide supra, Eqs. (2.27)) and carry the negative elementary charge 𝑞 = −𝑒. Being fermions, they occupy the states from the lowest band 𝑛 = 1 to the highest band 𝑛 = ?̄? at zero temperature, making ?̄? electrons per unit cell. In the spin-degenerate case, the bands are counted with respect to their algebraic duplicity. As will be proven later, it is crucial that the system is insulating, i.e. the occupied and the unoccupied bands are well separated by a band gap ∀𝒌 ∈ CG. Eq. (7.60) fails because the phase information of the wavefunctions is omitted by the modulus, so current terms between adjacent unit cells are missing. Beside the mathematical reason, there is another, general point, why Eq. (7.60) cannot be well defined: In real experiments, the bulk polarisation cannot be determined in a single measurement. Instead, a series of measurements has to be conducted whereupon 113 7 Geometric phases the polarisation changes can be deduced from the transported charge (current). The same holds in solid-state theory since Berry phases are observable only as phase differences. Resta begins his considerations with defining a charge redistribution between two states (𝜆 = 0) and (𝜆 = 1) of a crystal, where 𝜆 represents the reac- tion coordinate of the polarisation process. In the exemplary case of a ferroelectric, 𝜆 describes the transition from the unpolarised into the polarised state. In order to resolve the delocalisation problem of the Bloch functions, he changes the basis to the Wannier functions (vide supra, Eq. (2.29)). Since localised, the Wannier density can safely be inserted into Eq. (7.60). As a result, the change in polarisation equals a change in the centres of charge of the Wannier functions (𝜆 )𝑎𝑛 (𝒙) = ⟨𝒙 | (𝜆 ) 𝑎𝑛 (0)⟩, which depend on the∑︁rea(ction coordinat)e 𝜆 via the Bloc∑︁h f∫unctions | (𝜆 ) [ 𝑢𝑛 (𝒌)⟩.− ?̄? ?̄?𝑒 〈 〉 〈 〉 Eq. (2.31) −𝑒 ] Δ𝑷 𝒙 (1) − 𝒙 (0) = d 𝒙 (1) 2 (0) B |C | 𝑛 ∫ 𝑛 |C |2 𝑥 𝑎 (𝒙) 𝑛 − 𝑎𝑛 (𝒙) 2 R 𝑛=1 ∑︁ [〈 R 𝑛=1?̄?Eq. (2.32) −𝑒 ]= d (1)(𝒌) i∇ (1) 〉 〈 (0) 𝑘 𝑢 𝑢 (𝒌) − 𝑢 (𝒌) i∇ (0) 〉(2π)3 𝑛 𝑘 𝑛 𝑛 𝑘 𝑢𝑛 (𝒌) 𝑛=1 CG In order to extract the geometric phases, the expression for Δ𝑷 is evaluated by the components with respect to the reciprocal basis {𝑮 𝑗}. For this purpose, let 𝑖 ∈ [1, 𝑁] be a fixed index. The polarisation component Δ𝑃𝑖 along 𝑮𝑖 then holds: Δ𝑃𝑖 B 𝑮𝑖 ·∑︁Δ𝑷?̄? ∫ [ ]−𝑒 〈 (1) 𝜕 (1) 〉 〈 (0) 𝜕 (0) 〉 (7.61)= |C | d𝑠 𝑢𝑛 (𝒔) i 𝑢𝑛 (𝒔) − 𝑢𝑛 (𝒔) i 𝑢 (𝒔)R [0,1]𝑁1 𝜕𝑠𝑖 𝜕𝑠 𝑛𝑛= 𝑖 where 𝒔 is the relative coordinate vector of the k-vector in the reciprocal unit cell2. The integrand in Eq. (7.61) reminds of the Berry connection A (Eq. (7.55)). In order to clarify the analogy, a two-dimensional parameter 𝝃 ∈ [0, 1]2 is introduced whose first component is the reaction coordinate 𝜆 C 𝜉0. The second component is the relative coordinate 𝑠𝑖 C 𝜉1 of 𝒌 along 𝑮𝑖 . The corresponding two-dimensional Berry connection A 3𝑖∑︁then∏ho[l∫ds :?̄? 1 ] [〈 ] A (𝝃) = d ( 𝜆 ) 〉 𝑖 𝑠 𝑗 𝑢𝑛 (𝒔) i∇ (𝜆 )𝜉 𝑢𝑛 (𝒔) with 𝝃 = (𝜆, 𝑠𝑖) (7.62) 𝑛=1 𝑗≠𝑖 0 2The reciprocal vector 𝒌 and its differential operator ∇𝑘 can be expressed in terms of 𝒔 ∈ [0, 1]𝑁 , the relative coordinates of 𝒌∑︁in the recipr∑︁ocal b(asis {𝑮 𝑗}.𝑁 1 𝑁 ) 1 ∑︁𝑁 𝜕 𝒌 = 𝑮 𝑗𝑠 𝑗 = 𝑮 𝑗 𝑹2π 𝑗 · 𝒌 ⇒ ∇𝑘 = 𝑹2π 𝑗 𝜕𝑠 𝑗=1 𝑗=1 𝑗=1 𝑗 3The (𝑁−1)-dimensional integral in Eq. (7.62) traces all dimensions except 𝑖, so it is an ordinary state summation like the 𝑛-sum over the bands. The 𝑖-th component of 𝒔 is excluded from the integration and composes the variation parameter 𝝃 together with the reaction coordinate 𝜆. 114 7.3 The electric polarisation in a crystal The line integral of A(𝝃) along the edge of its quadratic domain 𝛾 B 𝜕[0, 1]2 yields the Berr∮y phase: 𝜒B𝑖 = ∫ A𝑖 (𝝃) · d𝝃𝛾 1 ∫ 1 ∫ 0 ∫ 0 = ︸ d 𝜆 ︷A︷0 ( 𝜆 , 0 ︸) +︸ d 𝑠 ︷A︷1 ( 1 , 𝑠 ︸) +︸ d 𝜆 ︷A︷0𝑖 𝑖 𝑖 𝑖 𝑖 ( 𝜆 , 1 ︸) +︸ d 𝑠 𝑖 ︷A︷1𝑖 ( 0 , 𝑠 𝑖︸) (7.63)0 0 1 1 I II III IV where A 𝑗 (𝝃) denotes the 𝑗-th component of A𝑖 . The terms I and III of Eq. (7.63)𝑖 cancel each other because A𝑖 (𝜆, 0) = A𝑖 (𝜆, 1) ∀𝜆, reflecting the boundary conditions of the Bloch factors (Eq. (2.28b)). The remaining terms II and IV reproduce the electric polarisation in Eq. (7.61) up to a prefactor. −𝑒 B − ∮ 𝑒 Δ𝑃𝑖 = |C | 𝜒𝑖 = |C | A𝑖 (𝝃) · d𝝃 (7.64)R R 𝛾 Eq. (7.64) is the central result of [128]. It proves that the projection of the static electric polarisation on a G-vector can indeed be considered a geometric phase of the para- meter space spanned by the reaction coordinate and the k-path along that G-vector. From Eq. (7.59), it is directly clear that Δ𝑃𝑖 is gauge-invariant under G-periodic𝑈 (1) transformations, which preserve the band order. However, this restriction is a prob- lem in solid-state crystals since the bands of the occupied subspace in general swap indices when traversing the Brillouin zone or when progressing on the 𝜆-path. For this reason, Resta generalised the proof for gauge invariance to more general 𝑈 (?̄?) transformations, which mix states of the occupied subspace with different band in- dices [128]. The proof can be found in the appendix (Sec. C.2). As a consequence of the boundary conditions of the Bloch factors, the parameter space is special in that its (𝜆, 𝑠𝑖 = 0) λ = 1 si and (𝜆, 𝑠𝑖 = 1) edges are identical. In other words, the do- main of the Berry connection A(𝝃) is a cylinder without bases, like that sketched in Fig. 7.2. Its length corresponds λ to the progressive nature of 𝜆 and its circumference re- flects the periodicityA on the reciprocal lattice. One could λ = 0 have the idea that calculating only one part of Eq. (7.61), si e.g. the 𝜆 = 1 part, might yield a physical quantity like an “absolute” polarisation. It corresponds to integrating Figure 7.2: Path for calcu-lating the electric polarisa- the geometric phase solely along one of the two circles in tion as a geometric phase. Fig. 7.2. However, the single terms of Eq. (7.61) are ill- defined because they are gauge-dependent. More precisely, according to Eq. (C.108) and the boundary conditions of the Bloch factors (Eq. (2.28b)), the loop integrals along the individual circles are only defined up to integer multiples of 2π. Con- sequently, the single summands of Δ𝑃𝑖 are defined only modulo 2π −𝑒| C | , which cor-R responds to an advance of the extensive polarisation |CR |𝑷 by −𝑒 · 𝑹𝑖 . This reflects that the Wannier centres are defined only modulo a direct lattice vector [128]. 115 7 Geometric phases From a topological point of view, the subtraction of the two circles together with the cancelling 𝜆-edges establishes a loop integral over the edge of a simply connected rectangle which is wound up to a cylinder. In other words, the loop has an inner area, so the integral assumes a distinct value according to Stokes. This is analogous to the ABE, where only the complete knowledge about the gauge fields on a simply connected region guarantees a well-defined ABE phase. Calculating only one of the circles of the cylinder in Fig. 7.2 is ill-defined because this circle has no inner region and is thus not simply connected. Although clear from the mathematics, it is quite astonishing that the existence of the 𝜆-paths is crucial for the definition of the polar- isation, despite their effective cancelling. 7.4 Topological transport 7.4.1 The integer Hall effect The integer Hall effect (IHE) is based on topological transport, which is a straight- forward application of the modern theory of polarisation. Only the points related to topology are addressed in the following paragraphs, while the details about the IHE as well as explicit calculations on the two-dimensional electron gas are passed. Let {|𝑢𝑛(𝒌)⟩} be the occupied set of Bloch factors of an insulating, two-dimensional Bloch system with direct basis vectors 𝑹1 and 𝑹2 and reciprocal basis vectors𝑮1 and 𝑮2. In order to calculate the response of the Bloch factors to an in-plane, constant, homogeneous electric field 𝑬, consider Faraday’s law: −𝜕𝑨(𝑡)𝑬 = ⇒ 𝑨(𝑡) = −𝑡 𝑬 𝜕𝑡 Without loss of generality, 𝑬 may be antiparallel to𝑮1 (Sketch in Fig. 7.3 (a)). Accord- ing to the principle of minimal coupling4, the vector potential 𝑨(𝑡) couples into 𝒌 of the Bloch Hamiltonian (Eq. 2.27): ℎ(𝒑, 𝒙 | 𝒌) → ℎ(𝒑 + 𝑒𝑨(𝑡), 𝒙 | 𝒌) = ℎ(𝒑, 𝒙 | 𝒌 + 𝑒ℏ𝑨(𝑡)) (7.65) Obviously, 𝑬 shifts the k-vector by a time-dependent term 𝑒ℏ𝑨(𝑡) = − 𝑒 ℏ 𝑡 𝑬. Because of the parametric nature of 𝒌 in the Bloch formalism, this equals a time-dependent parametric variation of the Bloch manifold. In particular, the k-vector advances once across the Brillouin zone when the elapsed time is such that 𝑒ℏ𝑨(𝑡) = 𝑮1: ℏ|𝑮1 | ℎ|𝑹2 | Δ𝑡 = = 𝑒|𝑬 | 𝑒|𝑬 | |CR | The field-related parametric variation permits the calculation of the concomitant electric polarisation according to Sec. 7.3. For this purpose, a two-dimensional para- 4Mind that the charge 𝑞 is replaced by −𝑒 116 7.4 Topological transport meter 𝝃 ∈ [0, 1]2 is introduced. The first component 𝜉1 represents the advance of the k-vector due to the electric field. It equals 𝑠1, the coordinate of 𝒌 with respect to the reciprocal basis vector 𝑮1, and corresponds to the reaction coordinate 𝜆 in Sec. 7.3. The second component 𝜉2 arises from the Bloch-phase integration for obtaining the Wannier centres. It equals the relative coordinate 𝑠𝑖 if the polarisation component along 𝑮𝑖 is considered, the same 𝑠𝑖 as that in Sec. 7.3. Integrating the Berry connection A𝑖 (𝝃) along the edge of its domain 𝛾 = 𝜕[0, 1]2 yields the Berry phase 𝜒B, which is p∑︁roportional to the polarisation 𝑮𝑖 · Δ𝑷.𝑖 ?̄? 〈 〉A𝑖 (𝝃) = 𝑢𝑛(𝝃) i∇∮ 𝜉 𝑢𝑛(𝝃) (7.66)𝑛=1 · −𝑒 B −𝑒𝑮𝑖 Δ𝑷 = |C | 𝜒𝑖 = |C | A𝑖 (𝝃) · d𝝃 (7.67)R R 𝛾 For the same reasons as those discussed in Sec. 7.3, Eq. (7.67) is invariant under𝑈 (?̄?) transformations and, thus, the polarisation is observable in principle. The polarisa- tion component along 𝑮1, parallel to the electric field, vanishes since the parameter 𝝃 has identical components (𝑠1, 𝑠1), so the line integral in Eq. (7.67) does not enclose any finite area. Therefore, Δ𝑷 can be written in terms of the component perpendic- ular to the electric field: 𝑹 𝑮 21 · Δ𝑷 = 0 ⇔ Δ𝑷 = 𝑃 |𝑹2 | Along the other reciprocal basis vector 𝑮2, the polarisation is proportional to the Berry phase 𝜒B C 𝜒B2 : 𝑮2 · 2π −𝑒 Δ𝑷 = | |𝑃 = B 𝑹2 |CR | 𝜒 In summary, each passage of Δ𝑡 pumps a certain amount of charge along 𝑹2, which leads to a current on average. The corresponding current density 𝒋 is exactly the quotient between the polarisation and the elapsed time: Δ𝑷 −𝑒|𝑹 | 1 𝑒|𝑬 | |C | 𝑹 −𝑒2 1 𝑹 𝒋 = = 2 B R 2 B 2 Δ𝑡 |C | 𝜒 = 𝑬 𝜒 (7.68)R 2π ℎ|𝑹2 | |𝑹2 | ℎ 2π |𝑹2 | If 𝜒B is positive, 𝒋 is turned by +90◦ relatively to 𝑬, as clarified by the sketch in Fig. 7.3 (a). Since the considered lattice is arbitrary, the right-angled relation between 𝒋 and 𝑬 can be summ( arised a)s Ohm’s law: 𝜎 𝜎 2 𝒋 = 𝜎𝑬 = 𝑥𝑥 𝑥𝑦 𝑒 1 𝑬 with 𝜎 B 𝜎 𝜎 𝑥𝑥 = 𝜎𝑦𝑦 = 0, 𝜎𝑥𝑦 = −𝜎𝑦𝑥 = − 𝜒 (7.69) 𝑦𝑥 𝑦𝑦 ℏ 2π The astonishing detail about the above considerations becomes manifest if the loop integral of the Berry connection A(𝝃) in Eq. (7.67) is written as a surface integral of 117 7 Geometric phases (a) (b) R2 R1 G2 G1 s1 E j s2 Figure 7.3: (a) Sketch of the directions of the integer Hall effect and (b) path of 𝝃 on the Bril- louin torus for calculating the topological transport. the Berry pot∑︁ential B∬(𝝃) according to Stokes∬:1 ?̄? ?̄?B 1 d · B( ) [127] ∑︁ i 〈 〉𝜒 = 𝑺 𝝃 = d𝑺 · ∇ 𝑢 (𝝃) × ∇ 𝑢 (𝝃) (7.70) 2 𝜉 𝑛 𝜉 𝑛π 𝑛=1 2π 𝑆 1 2π𝑛= 𝑆 where 𝑆 denotes the surface enclosed by 𝛾. As Berry noted, the integrand in Eq. (7.70) resembles a curvature whose surface in- tegral yields the first Chern class of the Hermitian line bundle represented by the eigenstates [127]. Moreover, and this is special about the Berry phase considered here, the surface is a closed manifold since the reciprocal unit cell is homeomorphic to the torus 𝕋2 (Fig. 7.3 (b)). As a consequence, the integral in Eq. (7.70) inclusive the prefactor assumes integral values, leading to a quantisation of the conductivity in Eq. (7.69): 𝑒2 𝑒2 𝜎𝑥𝑦 = − 𝑖 = 𝑖 𝜎0𝑥𝑦 with 𝑖 ∈ ℤ and 𝜎0𝑥𝑦 = − (7.71)ℎ ℎ The above considerations show that the appliance of a lateral electric field 𝑬 on a two-dimensional, insulating Bloch system raises a current density 𝒋 which is strictly perpendicular to 𝑬. The corresponding off-diagonal conductivity 𝜎𝑥𝑦 is quantised by the quantum conductivity 𝜎0𝑥𝑦 which depends only on natural constants, but not on any details of the system. The quantum number 𝑖 depends on the system and characterises it in the manner of a topological invariant. In other words, 𝑖, or equi- valently 𝜎𝑥𝑦, is a fundamental property of Bloch systems and categorises them into topological classes. The topological classification holds in general for all Bloch sys- tems which fulfil the prerequisites of being two-dimensional and insulating. On the contrary, topological classes are ill-defined for metallic systems. The Hall conduct- ivity of a system is protected as long as the topological class does not change, which means under all transformations which retain insulating state. This type of protec- tion is much stronger than protection by mere symmetry since the latter permits a continuous, gap-conserving destruction of the concomitant properties. An important consequence of non-trivial Berry phases is that the Bloch functions cannot be defined within a global gauge. Instead, there are overlapping patches on the torus in Fig. 7.3 118 7.4 Topological transport (b) where the phase factors are continuously defined. A transition function permits the change from one patch to the next by adding the necessary gauge. The integer in Eq. 7.71 “is then related to the winding number of the phase of the transition function around a non-contractable path” [142]. So far, the topologically invariant nature of 𝜎𝑥𝑦 is proven in a general manner. One way to actually realise topologically non-trivial Bloch systems is the integer Hall ef- fect (IHE), which occurs in a two-dimensional homogeneous electron gas subjected to a strong, vertical magnetic field at a low temperature [125, 126]. In the model pic- ture, the magnetic field introduces Landau levels, which, when integrally occupied, render the electronic system insulating. In such a state, the Hall conductivity 𝜎𝑥𝑦 as- sumes integer multiples of 𝜎0𝑥𝑦, hence the name integer Hall effect. The topological state of this system can transition from 𝜎𝑥𝑦 to 𝜎 0𝑥𝑦 ± 𝜎𝑥𝑦 if the magnetic field varies. However, this entails a gap closing. Such effects from topological phase transitions are clearly visible in the IHE experiments as the parallel conductivity 𝜎𝑥𝑥 is zero if the system is in an IHE state (→ insulating), and finite if the system changes the IHE state (→metallic) [125, 126]. The IHE can be modelled by Laughlin’s gedankenexperiment, which will be discussed when considering the ℤ2 insulators (vide infra). Therein, the topological transport is explicitly derived from the wave functions of the Landau levels and Faraday’s law of induction combined with an ABE phase shift (“flux threading”) [126]. However, Laughlin’s gedankenexperiment bears problems if a periodic system is considered. Thouless, Kohmoto, Nightingale and Nijs resolved them by theoretically investigat- ing an explicit Bloch system involving a sinusoidal potential and a vertical magnetic field. They calculated the Hall conductivity with the Kubo formula and obtained an expression equivalent to Eq. (7.67). Although they did not extract the topological nature, they argued that the emerging loop integral has to assume integer multiples of 2π, which leads to a quantised conductivity [143]. As they were the first who es- tablished a way of calculating the topological invariant for a quantum Hall state, the integer 𝑖 in Eq. (7.71) is also called TKNN integer. 119 7 Geometric phases 7.4.2 The two-dimensional ℤ2 insulator Theℤ2 insulators are closely related to the IHE insulators. They are classified by aℤ2 topological invariant, which, like the TKNN integer, describes topological transport, not of charge, but of spin or, more precisely, of time-reversal polarisation (defined later). Kane and Mele [144] were the first who demonstrated the existence of aℤ2 in- variant in the quantum spin Hall phase constructed from two copies of the Haldane model [145]. In the simplest case, the z-component of the spin 𝑠𝑧 is conserved, so the copies represent each an independent IHE system with a TKNN integer classifying the topological transport of charge. Since time-reversal symmetry (T-symmetry) is presupposed, the TKNN integers of the two subsystems have equal magnitudes and opposite signs, so the total TKNN integer of the system is trivial5. However, “the differ- ence [between the two integers may be] non-zero and defines a quantized spin Hall conductivity” [144]. The key point of [144] consists in the demonstration that the non- trivial property of the system is retained if the 𝑠𝑧-conservation is explicitly broken. The former topological transport of spin quanta 𝑠𝑧 then degrades to a shift in the expectation value of the spin operator ⟨𝑆𝑧⟩, which was later denoted time-reversal polarisation [142]. If the perturbation is strong enough, the system trivialises by a change of its ℤ2 invariant to zero. However, the transition involves a closing of the band gap at some point, as expected from the discontinuous nature of the change between topological classes [144]. There are many different ways to derive the to- pological ℤ2 invariant for two and three dimensions, which employ more or less ab- stract topological field theories, summarised in [146]. The method presented here is based on the IHE and demonstrates an alternative reason for the ℤ2 nature of the invariant without going too deeply into the matter of topology. Two-dimensional T-symmetric Bloch systems In view of the three-dimensional ℤ2 topological insulators relevant for this work, it is instructive to demonstrate the definition of the ℤ2 topological invariant for a two- dimensional T-symmetric Bloch system at first and then to generalise the findings to a three-dimensional crystal. This exactly follows the spirit of [142, 144, 147]. Hence, let {|𝑢𝑛(𝒌)⟩} be the occupied set of Bloch factors of an insulating, T-symmetric 2D Bloch system. It may host 2?̄? electrons per unit cell. The real-space domain is spanned by the direct basis vectors 𝑹1 and 𝑹2 and the reciprocal unit cell by the reciprocal basis vectors 𝑮1 and 𝑮2. Because what follows is slightly complicated, the lattice geometry is simplified by the constraint that both R-vectors 𝑹1 and 𝑹2 have a length of 1 and are orthogonal to each other so that they equal the euclidean coordinate system. Consequently, the G-vectors𝑮1 and𝑮2 are also orthogonal to each other and have a length of 2π. The Brillouin zone is thus a square defined by the points (±π,±π). The analysis of the IHE (Sec. Sec. 7.4.1) shows that this simplification does not alter the general findings. 5From an interpretational point of view, all Bloch states of a T-symmetric system occur in Kramer’s pairs (defined later), so the Wannier centres of each pair move into opposite directions under any T-symmetry-conserving variation. As a consequence, the transport of net charge is inhibited. 120 7.4 Topological transport Time reversal is described by an antiunitary operator Θ [148], which for fermions and in the representation of [142] holds: ( ) Θ = exp(i π 0 12𝜎𝑦)𝐾 = i𝜎𝑦𝐾 = −1 0 𝐾 (7.72) where 𝜎𝑦 is the second Pauli matrix and 𝐾 is the complex-conjugation operator. A T-symmetric Hamilton operator implies that each eigenstate |𝑛⟩ has the same ei- genenergy 𝐸𝑛 as its time-reversed counterpart Θ|𝑛⟩. Hence, the eigenstates occur in degenerate pairs, which are called Kramers pairs. The Kramers pairs of a Bloch system comprise each two states with opposite k-vectors: Θℎ(𝒑, 𝒙 | 𝒌)Θ−1 = Θ ?̃? 𝐻 (𝒑, 𝒙)?̃?−1 Θ−1𝒌 𝒌 = ?̃?−𝒌𝐻 (𝒑, 𝒙)?̃? −1 −𝒌 = ℎ(𝒑, 𝒙 | −𝒌) (7.73) where 𝐻 (𝒑, 𝒙) is the T-symmetric Hamilton operator and ℎ(𝒑, 𝒙 | 𝒌) the correspond- ing Bloch Hamiltonian. Since electronic Bloch systems host independent spin-12 fermions, the rotating action of i𝜎𝑦 in Θ specifies the Kramers pairs to be linked components of the same spinor. In detail, the occupied Bloch manifold of 2?̄? fermionic eigenstates groups into ?̄? spinors, which permits identifying ?̄? states of each component I and II, spin-up and spin-down in the 𝑠𝑧-conserving case . The ac〉tion of Θ then holds [142]: 〉𝑢I𝛼(−𝒌) 〉= − exp(i𝜒𝛼(𝒌)) Θ 𝑢II𝛼 (𝒌)𝑢II 〉 (7.74)𝛼 (−𝒌) = exp(i𝜒 I𝛼(−𝒌)) Θ 𝑢𝛼(𝒌) The minus signs ensure that Θ2 = −1. The gauge functions exp(i𝜒𝛼(𝒌)) seem to be arbitrary and are often replaced by 1 if topologically trivial systems are considered. Though, they turn out to be essential forℤ2-insulating systems since the non-triviality negates the existence of a global gauge for both components [142]. In the electronic band structure, two Kramers-paired bands obviously meet at the Γ-point (𝒌 = 0) and form a twofold, same-k-vector degeneracy, i.e. the eigenspace at 𝒌 = 0 is mapped onto itself under Θ. For this reason, the Γ-point is called a time- reversal-invariant momentum (TRIM), denoted 𝚪0. In addition to 𝚪0, the periodicity of the Bloch functions |Ψ𝑛(𝒌)⟩ on G produces three other TRIMs in CG where the Kramers pairs meet agai{n t}o fo{rm a same-k degeneracy. This makes a total of fourirreducible TRIMs: 𝚪 1 1 } 𝑖 = 0, 2𝑮1, 2𝑮2, ( 1 2𝑮1 + 1 2𝑮2) (7.75) Partial, sum and difference polarisations While T-symmetry expectedly inhibits the transport of net charge, the individual components might still show a shifting of a non-zero quantity. This leads to the defini- tion of partial polarisations 𝑷I and 𝑷II which arise in the components I and II, respect- ively, and which are determined by a procedure analogous to the IHE case. Since all types of polarisations 𝑷• emerge perpendicular to the applied vector potential shift, they have only one non-vanishing component in the appropriate basis. The bold face 121 7 Geometric phases of 𝑷• is retained so as to indicate its vectorial dimension. Whenever scalars are ad- ded, it means that they are added to the non-vanishing component. At first, the partial Berry connections A𝜎 (𝒌) are calculated from the Bloch factors of the respective component. ∑︁ 〈 〉 A𝜎 ?̄? (𝒌) = 𝑢𝜎𝛼(𝒌) i∇ 𝑘 𝑢𝜎𝛼(𝒌) (7.76)𝛼=1 where 𝜎 denotes the component and 𝛼 enumerates the Kramers pairs. Then, the partial Berry phases 𝜒B,𝜎 are calculated by integrating A𝜎 (𝒌) along the edge of the reciprocal unit cell. In order to exploit symmetries from the TRIM-nature of the Γ- point, the integration loop is shifted to∮the edge of the quadratic Brillouin zone (bluesquare in Fig. 7.4 (a)). 𝜒B,𝜎 = A𝜎 (𝒌) · d𝒌 (7.77) 𝜕BZ Finally, each traverse of the k-vector produces∮a partial polarisation 𝑷𝜎 proportionalto 𝜒B,𝜎6: 1 𝑷𝜎 1 = 𝜒B,𝜎 1 𝜎 − = A (𝒌) · d𝒌 (7.78)𝑒 2π 2π 𝜕BZ The Kramers symmetry establishes the follo∑︁wing link between AI(𝒌) and AII(𝒌). AI(−𝒌) = AII(𝒌) +∑︁ ?̄? (∇𝑘𝜒𝛼) (𝒌) (7.79a)𝛼=1 II I ?̄?A (−𝒌) = A (𝒌) − (∇ 𝜒 ) (−𝒌) (7.79b) 𝛼=1 𝑘 𝛼 The proof for these equations can be found in the appendix (Sec. C.2). The sum of AI(𝒌) and AII(𝒌) results in the total Berry connection of the system, denoted A+(𝒌). It describes the total charge polarisation 𝑷+ and is the same Berry connection as that constructed for the IHE system (Eq. (7.56)). Inserting the Kramers transformation (Eqs. (7.79)) and calculating the curl of A+(𝒌) shows that the total Berry potential B+(𝒌) is antisymmetric with respect to∑︁the Γ-point: A+ ?̄? (𝒌) B AI(𝒌) + AII(𝒌) = AI(𝒌) + AI(−𝒌) − (∇𝑘𝜒𝛼) (𝒌) (7.80a) 𝛼=1 B+(𝒌) = ∇𝑘 × A+(𝒌) = BI(𝒌) − BI(−𝒌) ⇒ B+(−𝒌) = −B+(𝒌) (7.80b) For this reason, the surface integral of B+(𝒌) over the entire Brillouin zone vanishes, as do the total Berry phase 𝜒B,+ and the tota∬l charge polarisation 𝑷+. 1 𝑷+ 1 = 𝜒B,+ +− = B · d𝑺 = 0 (7.80c)𝑒 2π BZ Eqs. (7.80) permits the following interpretation: The partial polarisations 𝑷I and 𝑷II 6Cf. Eq. (7.67) inserting the unit cell volume |CR | = 1 and the reciprocal basis vector |𝑮 | = 2π. 122 7.4 Topological transport exactly cancel each other, implying that the Wannier centres of the Kramers pairs move equally into opposite directions. This accords with the expectations from T- symmetry and explains why the topological transport of net charge is inhibited. Of course, the vanishing total polarisation is gauge-invariant, for the same reasons as those discussed in Sec. 7.4.1. Taking the difference between AI(𝒌) and AII(𝒌) results in the time-reversal Berry connection A− (𝒌). It describes the time-reversal polarisation 𝑷− . Inserting the Kra- mers transformation (Eqs. (7.79)) and calculating the curl of A− (𝒌) shows that the time-reversal Berry potential B− (𝒌) is symmetric with∑︁respect to the Γ-point: A− ?̄? (𝒌) B AI(𝒌) − AII(𝒌) = AI(𝒌) − AI(−𝒌) + (∇𝑘𝜒𝛼) (𝒌) (7.81a) 𝛼=1 B− (𝒌) = ∇𝑘 × A− (𝒌) = BI(𝒌) + BI(−𝒌) ⇒ B− (−𝒌) = B− (𝒌) (7.81b) Because of the centro-symmetry, the surface integral of B− (𝒌) over the entire Bril- louin zone equals twice that over the positive half plane (shaded area in Fig. 7.4 (a)). Therefore, the time-reversal Berry phase 𝜒B,− and, thus, the time-reversal polarisa- tion 𝑷− are not necessarily zero∬. ∬ 𝜒B,− = B− · d𝑺 = 2 B− · d𝑺 (7.81c) BZ BZ+ Another representation for 𝜒B,− follows from the equality of the integrals of 𝐵I(−𝒌) and 𝐵I(+𝒌) due to the centro-sym∬metry of the Brillouin zone: 𝜒B,− = 2 BI · d𝑺 = 2𝜒B,I = −2𝜒B,II (7.81d) BZ Eq. (7.81d) immediately implies that the time-reversal Berry phase 𝜒B,− is quantised: Since each component 𝜎 represents a Bloch manifold, the respective partial Berry phase 𝜒B,𝜎 assumes integer multiples of 2π, for the same reasons as those discussed in Sec. 7.4.1. Consequently, the time-reversal Berry phase 𝜒B,− assumes integer mul- tiples of 4π. In terms of polarisations, the partial polarisations 𝑷𝜎 are up to the sign equal integer multiples of the unit polarisation, −𝑒 in this case. The time-reversal polarisation 𝑷− thus assumes even multiples of −𝑒. 𝑷I = −𝑷II = −𝑖 𝑒, 𝑖 ∈ ℤ ⇒ 𝑷− = −2𝑖 𝑒 (7.82) The link between these quantities is summarised in the upper part of Fig. 7.4 (b). Although the time-reversal polarisation 𝑷− is quantised by a curvature integral, it is not yet an observable as the gauge invariance remains to be proven. From Eq. (C.108), it is clear that the individual partial Berry phases are invariant under 𝑈 (?̄?) transformations which mix the states within a fixed component. If the components are well-defined by good quantum numbers, 𝑷− is gauge invariant in the form of Eq. (7.82) without further restrictions. However, if the components are not conserved, i.e. unitary transformations are considered which mix them, 𝑷− obviously depends 123 7 Geometric phases (a) (b) P+ = 0 ⊕ π Γ2 Γ3 PI = −i e PII = +i e I II BZ+ ∂CG ⊖ -π Γ π0 Γ1 P− = −2i e y −e -π ∂BZ x U ⇄ +2e −2e +e Γ2' Γ3' P− += 4e Figure 7.4: (a) Sketch of the reciprocal space of the T-symmetric Bloch system. The red lines indicate the primitive reciprocal unit cell and the blue lines the Brillouin zone. The edges of both can be chosen as the integration path for obtaining the Berry phases. 𝚪𝑖 are the TRIMs. The positive half plane of the Brillouin zone is shaded grey. (b) Flow chart of the connection between the partial polarisations 𝑷I and 𝑷II, the total polarisation 𝑷+ and the time-reversal polarisation 𝑷− . The lower part shows how a unitary transformation𝑈⇄ flipping a Kramers pair transfers one unit polarisation −𝑒 between 𝑷I and 𝑷II, which changes them by 2 unit polarisations and increases 𝑷− by 4 unit polarisations. on the division of the Kramers pairs into the two components and, thus, on the gauge. The question is then which part of 𝑷− remains observable. For the answer, consider the action of a unitary transformation𝑈⇄ which simply swaps the components of a certain Kramers pair. According to Eq. (7.82), the transformation can transfer only even multiples of the unit polarisation from 𝑷I to 𝑷II. At the same time, the oppositely equal polarisation shifts from 𝑷II to 𝑷I, so the time-reversal polarisation 𝑷− changes by multiples of 4 unit polarisations. The effect of𝑈⇄ is sketched in Fig. 7.4 (b). The ℤ2 topological invariant In summary, the time-reversal polarisation 𝑷− has two properties: 1) It is quantised into even multiples of the unit polarisation. 2) It is defined only modulo 4 unit polarisations for non-conserved components. Analogously to the IHE, 𝑷− produces a current on average, provided that the com- ponents are characterised by good quantum numbers, e.g. in spin-conserving Bloch systems. This is exactly the quantum spin Hall effect, where an electric field gives rise to a spin current at a spin Hall conductivity 𝜎−𝑥𝑦. Property 1) implies that 𝜎−𝑥𝑦 is quantised into even multiples of the quantum conductivity. 𝜎−𝑥𝑦 = 2𝑖 𝜎 0 𝑥𝑦 = −2𝑖 𝑒2/ℎ, 𝑖 ∈ ℤ (7.83) 124 7.4 Topological transport The integer 2𝑖 is analogous to the TKNN integer and defines a topological invariant for the same reason. For non-conserved components, the 4 modulus from property 2) restricts 𝜎−𝑥𝑦 to two values: zero or twice the quantum conductivity. 𝜎−𝑥𝑦 = 2𝑖 𝜎 0 𝑥𝑦 = −2𝑖 𝑒2/ℎ, 𝑖 ∈ ℤ2 (7.84) The cyclic structure renders 𝜎−𝑥𝑦 a ℤ2 invariant. All insulating T-symmetric 2D Bloch systems belong either to the trivial class (𝑖 = 0) or to the non-trivial class (𝑖 = 1). The latter is referred to as the class of ℤ2 topological insulators or simply ℤ2 insu- lators. The arbitrariness of the sign of 𝜎−𝑥𝑦 negates the quantum spin Hall effect and questions if there is anything observable at all. The key point is that there exists a topologically non-trivial shifting in topologically non-trivial phases and that these phases are distinct from the trivial systems in that they cannot be transformed into each other without closing the band gap. The analysis presented in this work shows in a systematic and comprehensible way how the ℤ2 topological invariant emerges from the Kramers symmetry by applying methods which are familiar from the IHE. Furthermore, it gives concise reasons for why integrating the Berry connection over the edge of half the Brillouin zone is suf- ficient for calculating the ℤ2 invariant. The original literature concerning this issue [142, 144, 147] calculates the response of the edges by threading a half quantum flux through a Laughlin cylinder. It employs multiply valued functions and branches of logarithms and square roots in order to extract theℤ2 invariant. The following equa- tions prove that the ℤ2 formulation presented here is equivalent to that of Fu and Kane [142]. If dropping the factor 2, the remainder of Eq. (7.81c) is the time-reversal polarisation due to the advance of the k-vector across half the Brillouin zone. This can then be re∬formulated in ter∮ms of an edge inte∮gral of A− (𝒌). ∮ 1 2π 𝑷− = B− · d Stokes𝑺 = A−− (𝒌) · d𝒌 = A − (𝒌) · d𝒌 + A− (𝒌) · d𝒌 2 𝑒 BZ+ ∫ 𝜕BZ+ 𝚪′2∫𝚪′3𝚪1𝚪0 𝚪0𝚪1𝚪3𝚪2 (∗) 𝚪2 𝚪3 = − [(A∫− (𝒌) + A− (−𝒌)] · d𝒌 + [A− (𝒌) + A− (−𝒌)] · d𝒌𝚪0 𝚪1 ) Eq. (C.110) 𝚪2 [ ∑︁ = −∫ A+ ] (𝒌) − A+(−𝒌) · d𝒌 + 2∑︁ [𝜒𝛼(𝚪[ ] 2 ) − 𝜒𝛼(𝚪0)] 𝚪0 𝛼 𝚪3 + A+(𝒌) − A+(−𝒌) · d𝒌 + 2 [𝜒𝛼(𝚪3) − 𝜒𝛼(𝚪1)] (7.85) 𝚪1 𝛼 Step (∗) uses that the horizontal paths of 𝜕BZ+ in Fig. 7.4 (a) cancel each other because of the boundary conditions of the Bloch factors. For the same reasons, the vertical paths can be mapped onto the upper half plane where −𝚪′2𝚪 ′0 equals 𝚪2𝚪0 and −𝚪3𝚪1 equals 𝚪3𝚪1. The individual terms of Eq. (7.85) equal Eq. (3.21) in [142], from which Fu and Kane derive the ℤ2 invariant as the product of the phases of the Pfaffians of the time-reversal matrix. 125 7 Geometric phases Edge states As noted in the course of defining the ℤ2 invariant, a non-zero time-reversal con- ductivity (Eq. (7.84)) raises the question whether there is an observable current at all if the components are not conserved. Part of this question is answered in [144]: A weak perturbation which breaks spin conservation alters the former current of quantised spin to a reduced current of magnetisation density along the quantisation axis. However, although the spin quantisation is lifted, the ℤ2 invariant remains the same as long as the perturbation does not close the gap. Another manifestation of the ℤ2 invariant is the behaviour of the edge states, as il- lustrated by the Laughlin gedankenexperiment in [142, 147]: Consider a sample of finite width and length which is wound up to a cylinder and has two edges at the top and the bottom. The x-coordinate of the Brillouin zone 𝑘1 parallel to the circumfer- ence may be the direction of the applied vector-potential shift. The y-coordinate 𝑘2 parallel to the axis is then the direction of the time-reversal polarisation. The vector potential shift is equivalent to threading a magnetic flux Φ through the cylinder. If the flux (divided by the circumference) equals the quantum flux ℎ/𝑒, the advance in 𝑘1 equals 𝑮1, the distance between the TRIM 𝚪0 and its copy 𝚪′0. A half quantum flux ℎ/2𝑒 thus corresponds to an advance of𝑮1/2, the distance between the TRIMs 𝚪0 and 𝚪1. The setting is sketched in Fig. 7.5 (a). Once the setting is clear, the gedankenexperiment is as follows. At zero flux (𝒌 = 𝚪0), all Kramers pairs are same-k degeneracies and either doubly occupied or empty. This holds also for any states arising at the edges of the cylinder. If 𝒌 now advances by 𝑮1 towards 𝚪′0, one of two possibilities occurs: 1) The band structure is trivial and the Kramers pairs associated with the edges end up in the same configuration as the initial one, i.e. states which are Kramers partners at 𝚪0 are also partners at 𝚪′0, as sketched in Fig. 7.5 (b). For example, the Kramers pair denoted 1 splits up at 𝚪0, but finds together again at 𝚪′0. In particular, if this Kramers pair is doubly occupied in the beginning, it will be doubly occupied in the end. This accords with the prerequisite that the time- reversal polarisation due to the traverse of the k-vector has to be zero. 2) The band structure is non-trivial and the Kramers pairs associated with the edges end up in a configuration different from the initial one, i.e. the states change partners, as sketched in Fig. 7.5 (c). For example, Kramers pair 1 splits up at 𝚪0 and its upper branch (red) goes to Kramers pair 2 at 𝚪′0 where it finds a new partner from the bulk conduction bands. The lower branch (blue) vanishes in the bulk valence bands. In turn, the Kramers pair 1 at 𝚪′0 is restored by a blue branch from Kramers pair 2 and a red branch from the bulk valence bands. As a consequence of the partner change, if Kramers pair 1 is doubly occupied and Kramers pair 2 is empty in the beginning, they both will be singly occupied in the end, which corresponds to the transport of one electron. Since the oppos- ite happens at the other edge, the net charge transport is zero. However, the time-reversal polarisation amounts to 2 unit polarisations, which reflects the non-trivial ℤ2 invariant. 126 7.4 Topological transport (a) Φ (b) E trivial bulk I II A 2 2 Edge Edges 1 1 states bulk Γ0 Γ1 Γ0' (c) E non-trivial Γ2 Γ3 I II k ~ A 2 2 Γ0 Γ1 Γ0' k 1 12 k1 G1 Γ0 Γ1 Γ0' Figure 7.5: (a) Sketch of the Laughlin-type gedankenexperiment. The sample is wound up to a finite cylinder and provides two edges. A magnetic flux Φ threading the cylinder corresponds to a shifting of the vector potential 𝑨, which in turn makes the k-vector 𝒌 advance in that dir- ection. (b) and (c): Sketches of possible topologies of T-symmetric edge band structures. Only the bands of one edge are shown, where red and blue lines indicate the two components. The bulk bands are shaded grey. At the TRIMs, red an blue bands have to form a same-k Kramers degeneracies. In (b), the Kramers pairs at different TRIMs are trivially connected (𝜎−𝑥𝑦 = 0), which allows the edge states to form a band gap. In (c), they are non-trivially con- nected (𝜎−𝑥𝑦 = 2𝜎0𝑥𝑦), which strictly prohibits the formation of band gaps. Adapted from [147]. As it is evident, the non-trivial band structure in Fig. 7.5 (c) guarantees that the Fermi level cuts at least one band associated with the edges of the cylinder. Furthermore, a Laughlin gedankenexperiment with swapped roles for 𝑘1 and 𝑘2 obviously yields the same results. This leads to the conclusion that the edges of a non-trivial ℤ2 insulator in general host metallic edge states which are topologically protected, i.e. there is no way to introduce a band gap within an adiabatic, T-symmetric transformation. On the contrary, the trivial band structure in Fig. 7.5 (b) either provides a gap in the edge states or the bands can be adiabatically shifted so that a gap opens. The schematic band structures in Fig. 7.5 (b) and (c) moreover show that the topolo- gical character of the edge states does not only emerge if the Laughlin-type gedanken- experiment is performed for a full Brillouin-zone traverse of 𝒌 from 𝚪 ′0 to 𝚪0. Since the T-symmetry strictly produces TRIMs halfway along the reciprocal basis vectors, also the half traverse from 𝚪0 to 𝚪1 causes topologically distinctive partner changes. In the trivial case, the states of each Kramers pair at 𝚪0 find together again at 𝚪1. In the non-trivial case, the states of each Kramers pair at 𝚪0 find new partners at 𝚪1. The Laughlin-type gedankenexperiment works also for a half quantum flux because the time-reversal Berry potential is symmetric, so integrals over the half of the Brillouin zone are sufficient for the calculation of the ℤ2 invariant. 127 7 Geometric phases 7.4.3 The three-dimensional ℤ2 insulator The ℤ2 classification of a T-symmetric 3D Bloch system is based on the behaviour of its surface states in the presence of a perturbing vector-potential shift. Analogously to the edge band structure in the 2D case, a surface band structure is topologically non- trivial if the surface states change partners when going from one TRIM to another. However, differently from the 2D case, there are more than one possibilities to make such considerations in three dimensions: Firstly, instead of two edges in the 2D case, a 3D crystal has three main surfaces which are defined by the reciprocal Γ2 Γ3 basis vectors 𝑮 𝑗 acting as surface normals. Secondly, in- Γ0 Γ1 stead of four TRIMs in the 2D case, the 3D Brillouin zone GΓ G16 Γ 37 provides eight TRIMs {𝚪𝑖}, which are determined analog- Γ4 Γ5 ously to Eq. (7.75) by taking also the third reciprocal basis vector 𝑮3 into account. For the sake of simplicity, the dir- ect and reciprocal unit cells of the T-symmetric 3D crystal Γ2 Γ3 G2 to be characterised are supposed to be both cubic. Then, Γ0 Γ1 the TRIMs define the vertices of a cube which is exactly an Figure 7.6: TRIMs 𝚪𝑖 in a eighth of the cubic Brillouin zone (Fig. 7.6). cubic Brillouin zone. Instead of one TRIM pair per edge in the 2D case, each surface of a T-symmetric 3D crystal provides six different TRIM pairs between which the surface band structure can be topologically trivial or non-trivial. The topological character is determined by quantifying the topological transport on a square which is defined by the respective TRIM pair and the two opposite TRIMs of the TRIM cube. In detail, the identification of non-trivial phases is analogous to the following example: • Consider the surface defined by the surface normal𝑮3 (shaded blue in Fig. 7.6). • Consider the first surface k-path defined by 𝚪0𝚪1, which is parallel to 𝑮1 (red labels in Fig. 7.6). • Calculate the time-reversal polarisation along 𝑮3 due to a vector-potential shift along 𝑮1 by integrating B− (𝒌) in the square spanned by 𝑮1 and 𝑮3. Accord- ing to Eq. (7.85), this corresponds to an integration within the TRIM square (𝚪0, 𝚪1, 𝚪5, 𝚪4) (red face in Fig. 7.6). If the time-reversal polarisation in that square is zero, the surface band structure along 𝚪0𝚪1 is trivial and looks like that Fig. 7.5 (b). Otherwise, it is non-trivial and characterised by partner changes (Fig. 7.5 (c)). The above procedure can be repeated for all surface paths defined by pairs out of {𝚪0, 𝚪1, 𝚪2, 𝚪3}. If the 𝑮3-surface is com- pletely characterised, the above procedure can be performed also for the two other surfaces defined by 𝑮1 and 𝑮2. It is obvious that the above procedure has many redundancies. For example, if the 𝑮3-surface paths 𝚪0𝚪1 and 𝚪0𝚪2 are both non-trivial, the𝑮3-path 𝚪1𝚪2 must be trivial because the ℤ2 invariant is only defined modulo 2. Moreover, the 𝑮3-surface path 𝚪0𝚪1 and the 𝑮1-surface path 𝚪0𝚪4 share the topological character because the TRIM 128 7.4 Topological transport square for calculating the ℤ2 invariant is the same7. In order to remove those re- dundancies, Fu and Kane simplified the characterisation by transforming Eq. (7.85) into a form which relates the topological transport to a product of units 𝛿𝑖 (𝒌) which are evaluated at each TRIM of the respective TRIM square. These units are the ratio between the square root of the determinant of the time-reversal matrix 𝑤(𝒌) and the Pfaffian of 𝑤(𝒌). In doing so, the branch of the square root has to be continuous between TRIM pairs. For a T-symmetric 2D Bloch system, the ℤ2 invariant is non- trivial if the produc∏t of the four 𝛿√︁𝑖 is −1. If the product is +1, the system is trivial[142]. 3 { } (−1)𝜈 det(𝑤(𝚪𝑖)) = 𝛿𝑖 , 𝛿𝑖 = ( ( )) ∈ −1, +1 ⇒ 𝜈 ∈ ℤPf 2 (7.86)𝑤 𝚪 𝑖=0 𝑖 A T-symmetric 3D Bloch system has eight TRIMs and thus eight 𝛿𝑖 . Consequently, there are still many possibilities of how the signs can be distributed over the cube. However, five important cases can be distinguished: 1) All eight 𝛿𝑖 are positive. 2) Two 𝛿𝑖 on an edge of the cube are negative, while the others are positive. 3) Two 𝛿𝑖 on a face diagonal are negative, while the others are positive. 4) Two 𝛿𝑖 on a space diagonal are negative, while the others are positive. 5) One 𝛿𝑖 is negative, while the others are positive. All other configurations can be transformed into one of the above cases by gauge transformations, i.e. by changing the signs of four 𝛿𝑖 lying in the same plane. In particular, case 5) is equivalent to all cases where an odd number of 𝛿𝑖 is negative. Case 1) is the trivial case as each TRIM square yields a 𝛿𝑖 product of +1. Consequently, each surface path supports a topologically trivial surface band structure. Cases 2), 3) and 4) produce topologically non-trivial surface band structures for a differing num- ber of surfaces and paths. These systems can be imagined as a stack of independent 2D ℤ2 insulators whose one-dimensional edges host topologically protected metallic states. However, because the non-triviality is unstable against redefining the Bril- louin zone, as it may occur in the presence of disorder, the edge states are not protec- ted against all adiabatic, T-symmetric transformations. For this reason, these systems are referred to as weak topological insulators [147]. The last case 5) is relevant for this work. Since only one 𝛿𝑖 is negative, any surface contains one TRIM from which all surface paths produce non-trivial band structures. Since case 5) is equivalent to all cases with an odd number of negative 𝛿𝑖 , a 3D ℤ2 invariant 𝜈0 ca be de∏fined via the√︁sign of the product of all eight 𝛿𝑖 .7 𝜈 det(𝑤(𝚪(−1) 0 𝑖 )) { } = 𝛿𝑖 , 𝛿𝑖 = ∈ −1, +1 ⇒ 𝜈Pf ( ( )) 0 ∈ ℤ2 (7.87)𝑤 𝚪 𝑖=0 𝑖 7This is the reason why the T-symmetric 2D Bloch system with two edges and four TRIMs is character- ised by only one ℤ2 invariant. 129 7 Geometric phases As the non-triviality is stable against redefining the Brillouin zone, the metallic sur- face states enjoy a topological protection which is similar to that of the metallic edge states of the 2Dℤ2 insulator. For this reason, systems with non-trivial 𝜈0 are referred to as strong topological insulators [147] and the subsequent surface states as topolo- gically protected surface states (TSSs). As strongℤ2 TIs are the only TIs considered in this work, the term topological insulator is used for them in an equalising manner. Topological insulators with inversion symmetry Although the definition and general properties ofℤ2 insulators are clear, the calcula- tion of the ℤ2 topological invariant for a concrete system remains complicated. The definition via the time-reversal Berry potential (Eq. (7.81c)) in principle resolves the continuous-gauge problem since the Berry potential can be calculated locally by the matrix elements of well-defined derivatives of the Hamilton operator (Eq. (7.58)). On the other hand, the evaluation will be tedious in the 3D case because of the above- mentioned combinatorial inflation due to the different surfaces and paths. The ap- proach of [147] provides a simplification of the combinatorics by defining a single 3D ℤ2 invariant via the product of the 𝛿𝑖 of all TRIMs (Eq. (7.87)). Though, concrete calculations still involve some exertion since the branches of the numerator and the denominator of Eq. (7.87) have to be continuos over the entire TRIM cube. Fortunately, there is one case in which the evaluation of Eq. (7.87) becomes partic- ularly simple: If inversion symmetry (P-symmetry) is present, then the 𝛿𝑖 equal the product of the parity eigenvalues of the respective Kramers pairs. Thus, if the solving Bloch eigenspace is given, the 𝛿𝑖 can directly be determined from the Bloch factors without need for caring about the conti∏nuity of the branches of Eq. (7.87).?̄? 𝛿𝑖 = 𝑝𝛼(𝚪𝑖) (7.88) 𝛼=1 where 𝑝𝛼(𝚪𝑖) denotes the parity eigenvalue of the Kramers pair with index 𝛼 at the TRIM 𝒌 = 𝚪𝑖 [147]. Of course, Eq. (7.88) works both in the 2D and the 3D case. As a last remark, the parity method does not only simplify the topological classific- ation of insulators with P-symmetry. Also systems without P-symmetry are suitable for it if they are proven to be adiabatically connected to a TI with P-symmetry. This holds because any adiabatic transformation which retains the band gap conserves the topological invariants. This considerably broadens the applicability of the parity method and makes it the method of choice for identifying ℤ2 insulators [147]. 130 8 The bulk phases of bismuth selenide and antimony Bismuth selenide and antimony, the materials composing the Sb@Bi2Se3 heterostruc- tures, have two properties in common: 1) They involve heavy p-block elements: the medium-heavy fourth-row element selenium (Se, 𝑍 = 34), the heavier fifth-row element antimony (Sb, 𝑍 = 51), and the sixth-row element bismuth (Bi, 𝑍 = 83), the heaviest stable element of the periodic table1. As a consequence of the high atomic numbers, strong relativistic effects are expected to determine the physical properties of both the Bi2Se3 substrate and the 𝛽-Sb adsorbate. 2) They are van der Waals materials, i.e. they comprise chemically saturated lay- ers whose mutual bonding underlies vdW interactions. These peculiarities make approaches beyond standard DFT necessary in order to cor- rectly describe the Sb@Bi2Se3 heterostructures. Spin-orbit coupling (SOC) allows for the most important relativistic effects in the valence shells (Sec. 2.1.2). The vdW inter- actions are introduced by adding the London dispersion to the total energy (Sec. 2.1.3). The first purpose of this chapter is to test the SOC and vdW approaches on the bulk phases of Bi2Se3 and 𝛽-Sb, for which experimental lattice parameters exist, contrarily to the heterostructures. Particularly the vdW corrections have to be tested because there are several implementations with different degrees of empiricism. The inter- layer distances are the central quantities to check since it is expectable that the vdW corrections mostly influence those. Furthermore, the structural effects from SOC are paid a special attention. In many previous investigations, SOC is only included for calculating the final band structures, while it is omitted during the structure optim- isations. It will turn out that SOC alters the interlayer distances in a non-negligible manner. The second purpose of this chapter concerns the electronic and topological properties of Bi2Se3 and 𝛽-Sb, which are absolutely crucial for understanding the topological phase transitions in the heterostructures. In detail, the band structures of bulk Bi2Se3 and 𝛽-Sb and, in addition, the two-dimensional band structures of free-standing thin 𝛽-Sb sheets are calculated and analysed. The ℤ2 invariant is explicitly calculated for these systems with the parity method (Sec. 7.4.3). 1In fact, bismuth is radioactive. However, the decay is extremely slow with an half-life of 1.9 · 1019 yr. It was not before 2003 until the instability of the most abundant isotope was proven [149]. 131 8 The bulk phases of bismuth selenide and antimony 8.1 Bismuth selenide Bismuth selenide is a compound of the pnictogen2 bismuth (Bi) and the chalcogen3 selenium (Se). Its structure belongs to the trigonal, P-symmetric 𝑅3̄𝑚 space group (No. 166), in which the atoms occupy the Wyckoff positions Se→1𝑎, Se→2𝑐 and Bi→ 2𝑐. Bi2Se3 belongs to the class of van der Waals materials or sparse matter. It is clearly layered, consisting of covalently bound sheets which in turn stick together by vdW in- teractions. In detail, each Bi2Se3 unit forms a quintuple layer (QL) of five atoms, which arrange themselves as hexagonal atomic layers according to an (ABCAB) sequence. The QLs in turn stack in such a way that the layer sequence continues, implying an axial periodicity length of 15 atomic layers or 3 QL: (ABCAB)(CABCA)(BCABC). The structure is sketched in Fig. 8.1 (a) and (b). The microscopic reason for the layered structure is a closed-shell configuration of the individual QLs [150]. However, the octet rule is obviously not fulfilled as the Bi and Se atoms coordinate each other in (half-complete) octahedral geometry. Since Se is more electronegative than Bi by Δ𝜒 = 2.48 − 1.67 = 0.81 according to the Allred- Rochow scale [17, Tafel III], the bonding is expected to be covalent with a partial ionic character of ca. 10 % [17, p. 158]. The Se atoms are in a negative oxidation state of – II, which completes the octets. The Bi atoms, on the other hand, are in a positive oxidation state of III, two less than the maximum of V for members of group 15. The reasons for the stability of the III-valent state of Bi are numerous, but all based on the low ability of the 6s electrons to participate in chemical bonding. A comparison between the pnictogens (Pn) shows that P assumes the oxidation state V in many compounds, while all other Pn prefer the oxidation state III. This manifests itself, for instance, in the redox potentials of the Pn(V)/Pn(III) acidic systems: they are negative for P, positive for N, As and Sb, and by far the highest for Bi [17, p. 962]. Four different, countercurrent effects are responsible for this: 1) The higher main quantum number of heavier homologues loosens the valence electrons. Therefore, P is more willing to donate its s electrons than N. 2) The d-block contraction binds the valence electrons closer the nucleus. There- fore, As and Sb are less willing to donate their s electrons than P. 3) The f-block contraction or lanthanoid contraction binds the valence electrons closer the nucleus. Therefore, Bi is even less willing to donate its s electrons than As and Sb. 4) The relativistic effects contract mainly the s orbitals and bind them closer to the nucleus. They are considerable for the heaviest elements, to which Bi belongs. As a consequence of 2), 3) and 4), the 6s electrons of Bi are so tightly bound that they behave almost like core states and become chemically inert. Hence, this effect is called the inert-pair effect (or lone-pair effect) [17, pp. 335ff, 373ff, 961ff]. As only the 6p orbitals of the Bi atoms are left for binding, the bonds arrange themselves in 2Also denoted pentels, group 15, the nitrogen group: N, P, As, Sb and Bi. 3Group 16, the oxygen group: O, S, Se, Te and Po. 132 8.1 Bismuth selenide (a) (b) H3 T4 31.0 (c) 30.5T c (Å)1 30.0 30.4 C d B a 29.0 28.9 28.8 28.8 w 28.4 28.7 A 28.6 28.7 C [‾11.0] 28.0 28.4SOC 28.2 28.3 28.5 B c A no SOC C 4.25 B QL (d) 4.21 a (Å) 3.2 (e) 3.144.20 d (Å) A 4.20 4.19 3.0 3.09 C 4.19 4.20 4.18 B 4.15 4.17 4.15 4.16 2.8 2.65 A 4.14 4.13 2.63 2.63 4.14 2.6 2.58 C 4.10 4.12 2.58 2.55 2.49 B 2.4 2.47 2.47 2.38 2.44A 4.05 2.2 [‾11.0] no vdW D2 D3 D3+BJ TS TSH no vdW D2 D3 D3+BJ TS TSH Figure 8.1: Bulk structure of Bi2Se3. Purple circles are Bi atoms; Light green circles are Se atoms. (a) Side view; (b) Top view. The conventional unit cell is marked red. (a) shows the rhombohedrally centred primitive unit cell as well. Orange stars in (b) mark some T1 posi- tions of the (0001) surface. (c) – (e) Lattice parameters 𝑎, 𝑐 and 𝑑 of the structures optimised by code-level relaxations within different approaches for the vdW corrections (x-axis, see text) and SOC (empty and filled circles). The experimental reference [153] is inserted as dashed lines. an octahedral geometry with angles close to 90◦. This obviously holds in Bi2Se3 quite strictly as Bi and Se coordinate each other in (half-)complete octahedral geometries. That also the Se atoms obviously underlie p bonding without hybridising with the 4s electron is presumably due to a steric effect from the p orbitals of the Bi atom. In the lighter homologous compounds Sb2Se3 and As2Se3, the pnictogens coordinate the Se atoms in a twofold and threefold manner [151, 152], not in a sixfold manner as in the case of the Se atom in the middle of the QLs. 8.1.1 Structure optimisation Computational parameters The DFT calculations are carried out with VASP [2, 3] using PAW potentials and the PBE xc-functional [10, 21, 22]. The Bi and Se atoms have PAW valences of 5d106s26p3 and 4s24p4, respectively. The kinetic energy cutoff is 𝐸cut = 400 eV. The Brillouin zone is sampled by a Γ-centred Monkhorst-Pack mesh [117] at a density of 12×12×3 k-points for the hexagonal unit cell. The Brillouin-zone integration is carried out by the tetrahedron method with Blöchl corrections [118] for all calculations as the compound is a semiconductor. A convergence test proves that increasing the energy cutoff or the sampling density alters the total energy by ≈1 meV per atom. The ionic forces are calculated according to the Hellmann-Feynman theorem [119]. The ions move along them towards the equilibrium positions and stop to relax if the forces acting on each ion are smaller than 0.005 eV/Å. 133 [00.1] [11.0] 8 The bulk phases of bismuth selenide and antimony For different combinations of vdW approaches and SOC/no SOC, the volume of the hexagonal unit cell is optimised by the code-level routines. For the combination re- producing the experimental reference best, the volume optimisation is repeated by the more exact Murnaghan equation of state (vide supra, p. 41). The six vdW ap- proaches tested are: no vdW correction, DFT-D2 [26], DFT-D3 + zero-damping [27], DFT-D3 + Becke-Johnson damping [28], the Tkatchenko-Scheffler method [29], and the Tkatchenko-Scheffler method with iterative Hirshfeld partitioning [30, 31]. Volume optimisation The relevant lattice parameters of the hexagonal unit cell of Bi2Se3 are: the basal lattice constant 𝑎, the axial lattice constant 𝑐, the interlayer distance 𝑑, defined as the difference between the z-coordinates of the outer Se atoms of adjacent QLs, the QL width 𝑤 = 𝑐/3 − 𝑑, and the Wyckoff parameters of the rhombohedral centring 𝑢 (Bi atoms) and 𝑣 (outer Se atoms). The results for 𝑎, 𝑐 and 𝑑 may be found in Fig. 8.1 (c), (d) and (e), respectively. The experimental lattice parameters (XRD, [153]) are inserted as dashed lines. The vdW correction strongly affects the interlayer distance 𝑑, which is expectable for a vdW material. Compared to the calculations without dispersion correction (no vdW), the QLs move closer together by more than 0.5 Å when any vdW implementa- tion is enabled. This corresponds to a shrinkage of 𝑑 by more than −15 %. Once vdW is enabled, the different implementations alter 𝑑 to a lesser extent. All vdW-corrected values vary by ca. 0.15 Å and approach the experimental reference up to a few per- cent. The QL width 𝑤 and the basal lattice constant 𝑎 are almost indifferent to the vdW implementation. Their values range around the experimental reference by ca. 1 %, even if vdW is disabled. As 𝑤 is rigid, the axial lattice constant 𝑐 congruently follows the interlayer distances 𝑑. Concerning the effects from SOC, the geometry of the individual QLs is almost indifferent. Both 𝑎 and 𝑤 slightly increase by ca. +0.3 % upon enabling SOC, which is far less than the already small variations due to the vdW implementation. In contrast, 𝑑 is surprisingly quite sensitive to whether SOC is en- abled or not. It contracts by −1.5 % when vdW is disabled, and even stronger by ca. −5 % when any vdW correction is enabled. The axial lattice constant 𝑐 inherits the behaviour of 𝑑. As the combination of DFT-D2 and SOC yields lattice parameters which are very close to the experimental reference, the volume for this set is redetermined with a Murnaghan relaxation. The resulting lattice parameters are compiled and com- pared with the experimental reference [153] in Tab. 8.1. The Murnaghan-relaxed DFT-D2+SOC structure is very close to the experimental one. The lattice constants deviate by less than 0.5 %. The unit-cell volumes are even closer together, deviating by only −0.1 %. The distribution of 𝑐 into 𝑑 and 𝑤 shows a bit more deviation (−0.9 % and +0.8 %, respectively). The experimental and theoretical Wyckoff parameters 𝑢 and 𝑣 agree. The bond distances in the DFT-D2+SOC structure are 2.87 Å for the outer Bi–Se bond and 3.06 Å for the inner Bi–Se bond. This accords with the coordination polyhedrons: Because the inner Se atom is higher coordinated (six Bi atoms), the in- dividual Bi–Se bonds are expected to be weaker and thus to be longer than those of the lower coordinated outer Se atom (three Bi atoms). Another point of view on this 134 8.1 Bismuth selenide 𝑎 (Å) 𝑐 (Å) 𝑐/𝑎 𝑑 (Å) 𝑤 (Å) 𝑢 𝑣 Exp. [153] 4.143 28.636 6.91 2.579 6.966 0.4008 0.2117 DFT-D2 + SOC 4.134 28.732 6.95 2.556 7.021 0.4000 0.2111 Deviation −0.2 % +0.3 % −0.9 % +0.8 % Table 8.1: Lattice parameters of bulk Bi2Se3. The unit cell was optimised by Murnaghan re- laxations, including DFT-D2 and SOC. Experimental reference: [153]. is based on the ionic character of the Se–Bi bonds: The inner one is expected to be more ionic wherefore it is closer to the sum of the respective ionic radii. On the con- trary, the outer one is expected to be closer to the sum of the respective covalent radii [153]. The bond angles are all close to 90◦: 92.1◦, 91.3◦, 84.9◦, 84.9◦ and 95.1◦ from the outer to the inner angles. Hence, the Bi–Se octahedrons are almost regular, which indicates a bonding majorly based on the p orbitals. 8.1.2 Electronic properties The band structure of the bulk phase of Bi2Se3 is calculated in a manner which is analogous to Sec. 4.3. All system-specific parameters are the same as those of the relaxations above. In particular, the bands are calculated with SOC. The trigonal unit cell of the DFT-D2+SOC structure is used. As the (0001) surface of the hexagonal unit cell is of special interest later in this work, the corresponding symmetry points K, M and Z are identified in the trigonal unit cell and chosen as vertices for the k-path4. Fig. 8.2 shows the bands. The line colour corresponds to the colour bars above the panels: (a) Bi/Se portions in all PAW projections, (b) s-orbital/p-orbital portions in all PAW projections, (c) (p𝑥 + p𝑦)-orbital/p𝑧-orbital portions in all p-orbital projections. The line widths in (a) and (b) are constant and that in (c) corresponds to the sum of the p-orbital projections of all atoms. The first conspicuous feature of the band structures consists in a strict separation of the p bands lying near the Fermi level from the s bands lying at higher binding en- ergies. The three lower, electron-like s bands accept the six Se-4s electrons. The two higher, hole-like s bands accept the four Bi-6s electrons. The two sets of s bands are separated by a gap of≈1.5 eV. The s bands in total are separated from the p bands by a gap of ≈2.5 eV. That the deep and almost localised s electrons do not contribute much to the bonding between the Bi and Se atoms perfectly agrees with the expectations from the inert-pair effect. The nine p bands below the Fermi level host the 18 p elec- trons from the constituents and are separated from the conduction bands by a global band gap of ≈0.3 eV. The mixed Se-Bi character indicates a nearly covalent binding between the p orbitals, which agrees well with the intermediate electronegativity difference and the regular coordination octahedrons in the Bi2Se3 structure. 4The MΓ path and the ΓKM path in Fig. 8.2 are straight, perpendicular line segments. Since the two M-points are not the same in the rhombohedral centring, they provide different eigenvalues. They become identical in the hexagonal unit cell up to 𝐶3 and PT symmetry operations. 135 8 The bulk phases of bismuth selenide and antimony (a) Bi Se (b) s p (c) px+py pz 0 1 -5 band gap~0.3 eV Inert-pair effect 0 -10 -1 1Å−1 -15M Γ K M|ΓZ M Γ K M|ΓZ M Γ K M|ΓZ Figure 8.2: Band structure of Bi2Se3 for the rhombohedrally centred DFT-D2+SOC structure. The symmetry points correspond to the hexagonal unit cell. The line colour corresponds to the colour bars above the panels. (a) Bi portion (purple) vs. Se portion (light green) in all PAW projections; (b) s-orbital portion (red) vs. p-orbital portion (blue) in all PAW projections; (c) (p𝑥 + p𝑦)-orbital portion (blue) vs. p𝑧-orbital portion (yellow) in the p-orbital projections of all atoms (line width) round the Fermi level. The respective PDOSs are plotted against the y-axis to right of the band structures in arbitrary units. The ℤ2 invariant The relativistic effects are so strong in Bi2Se3 that its bands distort into a ℤ2 insu- lating system. From an interpretational point of view, SOC gives rise to a repulsion between states having the same parity and total angular momentum, e.g. |p+𝑧 , ↑⟩ and |p++i , ↓⟩. In Bi2Se3, the repulsion inverts the band order at the Γ-point in that a P-𝑥 𝑦 symmetric p𝑧 band and a P-antisymmetric p𝑧 band swap occupations. At the other TRIMs, the band order remains the same [154]. The remainder of the band inversion is visible in Fig. 8.2 (c). Near the Γ-point, the CBM and the VBM are of p𝑧 character. The valence band shows a clear furrow and the conduction band is apparently flat- ter than a parabola. This is the result of overlaying two parabolae and introducing gaps at the intersections. A comparison between the Bi2Se3 band structures with and without SOC which clarifies this issue can be found in [154]. The new point about the band structures presented here consists in the colouring which proves that the band inversion at the Γ-point indeed happens in the p𝑧 states, as schematised in [154]. In order to verify that Bi2Se3 is indeed a TI, the ℤ2 topological invariant is explicitly calculated by means of the parity method. For this purpose, the Quantum Espresso (QE) package [4, 5] is employed because the parities are difficult to extract from the VASP output. The pseudopotentials are taken from the pslibrary (straightforward valency + d electrons, PAW, PBE, full-relativistic). The ground-state density of the above rhombohedrally centred DFT-D2+SOC structure is obtained at a cutoff energy of 60 Ry and a k-point mesh of 12×12×12. For this ground-state density, the eigenval- ues and eigenfunctions are calculated at the eight TRIMs {𝚪𝑖}. Then 𝛿𝑖 is calculated for each TRIM by extracting the parities of the Kramers pairs and inserting them into Eq. (7.88). The product of all eight 𝛿𝑖 returns the ℤ2 invariant by Eq. (7.87). The rhombohedral unit cell of Bi2Se3 has 78 valence electrons (30 Se-3d + 20 Bi-5d + 6 Se-4s + 4 Bi-6s + 12 Se-4p + 6 Bi-6p). Including all the bands returns 𝛿0 = −1 for 𝚪0, 136 E − EF (eV) 8.2 Antimony the Γ-point, and 𝛿𝑖 = +1 for all other TRIMs. Thus, Bi2Se3 is a topologically non-trivial ℤ2 insulator. According to the classification by Fu and Kane [147], the topological class is 1;(000), where the first integer indicates the 3D ℤ2 invariant. Repeating the procedure for the s and p electrons only, leaving the d electrons out, yields the same result, as expected from the non-binding nature of the complete d shells. A more interesting case consists in calculating the ℤ2 invariant for the p electrons only and leaving the s electrons out. This is rectified as the two submanifolds are separated by a finite band gap. The s bands yield a trivial ℤ2 invariant, while the p bands yield a non-trivialℤ2 invariant. This underlines the inert-pair effect and that the topological non-triviality is solely due to the ordering of the p bands. Repeating the procedure for the hexagonal unit cell yields the same results, as expected from the stability of the 3D ℤ2 invariant against band folding. 8.2 Antimony Antimony denotes the elemental phase of the pnictogen Sb5, the lighter, fifth-row homologue of Bi. Before analysing its structure, also in view of the heterostruc- tures, it is instructive to compare the different bulk phases of the pentels: In order to complete the octets, the pentels tend to cross-link in mutual trigonal-pyramidal coordination. There are several possibilities for saturating all bonds, the allotropes, whose stability depends on the thermodynamic boundary conditions. Phosphorus, for instance, adopts two metastable phases: white phosphorus (P4 tetrahedrons) and violet phosphorus (cross-linked tubes). Mediated by pressure, they transition into the orthorhombic black phase, which is stable under normal conditions. Under high pressure (>80 000 bar), the black phase reversibly transitions into a further, rhombo- hedral modification [17, pp. 849ff]. Both the orthorhombic, low-pressure phase (the 𝛼-phase6) and the rhombohedral, high-pressure phase (the 𝛽-phase) are vdW mater- ials consisting of few-atoms layers wherefore they are of special interest in this work. Bulk As adopts both the 𝛼-phase and the 𝛽-phase under normal conditions, with the difference to P that the 𝛼-phase (semiconducting, black) is only metastable and the 𝛽-phase (semimetallic, grey-lustrous) is stable. For Sb and Bi, the 𝛼-phase becomes 5There are two Latin words for the mineral stibnite Sb2S3: the medieval word antimonium and the ancient word stibium. While the origin of the former is unclear, the latter stems from an ancient cosmetic product. Today, antimony refers only to the elemental metal, while for derivates, the root stib- should be used. Of course, the latter name is the origin of the chemical symbol Sb. Heilen mit Antimon: Von der Chemiatrie zur Chemotherapie, https://www.pharmazeutische-zeitung. de/inhalt-10-2000/titel-10-2000/ (visited on 21/05/2022). The ancient Romans used “stimi” or “stibi” for pharmaceutical and cosmetic purposes. Plinius Maior wrote in his famous encyclopedia “Naturalis Historia” that the drying and tightening properties of stibium were used by women, who covered the area round their eyes with stibnite additives so as to widen them. The literal translation of “platyophthalmon” exactly means that. Gaius Plinius Secundus (Plinius Maior), Naturalis Historia, Liber XXXIII.101f 6The terminology is not unique as, in fact, 𝛼-P and 𝛽-P denote modifications of white phosphorus P4 [17, p. 850]. The nomenclature used in this work concentrates on the distinction between the isolated layers from the two layered phases (black phosphorus and high-pressure phosphorus). It is common in most works dealing with thin Pn sheets [P5, P10, 155–157]. 137 8 The bulk phases of bismuth selenide and antimony (a) C (b) T H w T 1 3 (c) 2.4 2.31 B 4 2.3 d (Å) A d C c 2.2 2.24 B 2.14 2.22 A BL a 2.13 2.1 d d [‾11.0] SOC no vdW D2 D3 D3+BJ [‾11.0] 1 2 Figure 8.3: Bulk structure of 𝛽-Sb. Blue circles are Sb atoms. (a) Side view; (b) Top view. The conventional unit cell is marked red. (a) shows the rhombohedrally centred primitive unit cell as well. Yellow stars in (b) mark some T1 positions of the (0001) surface. (c) Interlayer distance 𝑑 of the structures optimised by Murnaghan relaxations with SOC and different vdW schemes (see text). The experimental reference [158] is inserted as a dashed line. entirely unstable and only the 𝛽-phase remains stable under normal conditions. 𝛽-Sb is a semimetal like 𝛽-As, while 𝛽-Bi is a metal [17, pp. 943ff]. Like Bi2Se3, 𝛽-Sb belongs to the trigonal, P-symmetric𝑅3̄𝑚 space group (No. 166). The Sb atoms occupy the Wyckoff position 2𝑐. 𝛽-Sb consists of covalently bound sheets of buckled-honeycomb-Sb which stack in a manner that the lower Sb atom of the next sheet is over the H3 position of the previous one. The individual sheets are considered as bilayers (BLs), which stack according to a layer sequence with an axial periodicity of 6 atomic layers or 3 BL: (AB)(CA)(BC). The structure of 𝛽-Sb is sketched in Fig. 8.3 (a) and (b). Obviously, the BLs come so close together that the distance between them is not much larger than their width. In terms of bond distances, 𝑑1, the Sb–Sb distance within the BLs, is not much larger than 𝑑2, the distance across. This is a trend in the pnictogens as the 𝑑2/𝑑1 ratio decreases from P to Bi so that the 𝛽-phase approaches the cubic primitive structure [17, p. 943]. 8.2.1 Structure optimisation Computational parameters The DFT calculations are carried out with VASP [2, 3] using PAW potentials and the PBE xc-functional [10, 21, 22]. The Sb atoms have a PAW valence of 5s25p3. The kin- etic energy cutoff is 𝐸cut = 300 eV. The Brillouin zone is sampled by a Γ-centred Monkhorst-Pack mesh [117] at a density of 16 × 16 × 16 k-points for the rhombo- hedrally centred unit cell. The Brillouin-zone integration is carried out by a Gaus- sian occupation with a smearing of 5 meV for the ionic relaxations and by the tet- rahedron method with Blöchl corrections [118] for all static calculations. A conver- gence test proves that increasing the energy cutoff or the sampling density alters the total energy by ≈1 meV per atom. The ionic forces are calculated according to the Hellmann-Feynman theorem [119]. The ions move along them towards the equilib- rium positions and stop to relax if the forces acting on each ion are smaller than 0.005 eV/Å. SOC is enabled in all calculations. For different vdW approaches, the volume of the rhombohedrally centred unit cell is optimised by the Murnaghan equa- tion of state. The four approaches tested are: no vdW correction, DFT-D2 [26], DFT-D3 + zero-damping [27], and DFT-D3 + Becke-Johnson damping [28] 138 [00.1] [11.0] 8.2 Antimony Volume optimisation The relevant lattice parameters of the hexagonal unit cell are: the basal lattice con- stant 𝑎, the axial lattice constant 𝑐, the interlayer distance 𝑑, the BL width 𝑤, and the Wyckoff parameter of the rhombohedral centring 𝑧. The results for 𝑑 may be found in Fig. 8.3 (c). The experimental reference (XRD, [158]) is inserted as a dashed line. As in the case of Bi2Se3, the vdW correction makes the BLs come closer together. Though, 𝑑 does not shrink as strongly as in Bi2Se3, only by −0.1 Å to −0.2 Å (−4 % to −8 %) upon activating any vdW implementation. In fact, disabling the vdW cor- rections already produces an interlayer distance which lies only +3 % above the ex- perimental reference. DFT-D2 and DFT-D3+BJ even underestimate 𝑑 by −4 %, while DFT-D3 yields a good value for 𝑑 lying only−0.7 % below the reference. However, 𝑑 is not the only vdW-sensitive lattice parameter. Also 𝑎 varies up to 2 %, where DFT-D2 and DFT-D3+BJ yield good values and DFT-D3 a value which is too large by +1.4 %. In- terestingly, all vdW approaches as well as disabling vdW overestimate 𝑤 by ca. +2 %. The unit-cell volumes𝑉 agree with the experiment in the order: DFT-D3+BJ (−0.6 %), DFT-D2 (−1.2 %), DFT-D3 (+3.1 %) and no vdW (+6.5 %). The different qualities of the approaches can be summarised by the Sb–Sb bond distances 𝑑1 and 𝑑2 (Tab. 8.2 (a)): DFT-D2 describes 𝑑1 better, DFT-D3 describes 𝑑2 better and omitting the vdW correc- tion overestimates both distances. Since DFT-D2+SOC yielded good results for Bi2Se3 and will be chosen as the central approach for the heterostructures, the complete set of lattice parameters of the respective 𝛽-Sb unit cell are compiled in Tab. 8.2 (b). (a) Exp. [158] no vdW DFT-D2 DFT-D3 DFT-D3+BJ 𝑑1 (Å) 2.90 2.95 (+1.8 %) 2.92 (+0.7 %) 2.95 (+1.5 %) 2.94 (+1.2 %) 𝑑2 (Å) 3.34 3.43 (+2.5 %) 3.29 (−1.7 %) 3.36 (+0.5 %) 3.28 (−1.8 %) (b) 𝑎 (Å) 𝑐 (Å) 𝑉 (Å3) 𝑑 (Å) 𝑤 (Å) 𝑧 Exp. [158] 4.301 11.222 179.75 2.238 1.503 0.2336 DFT-D2 + SOC 4.311 11.033 177.59 2.144 1.534 0.2362 Deviation +0.2 % −1.7 % −1.2 % −4.2 % +2.1 % Table 8.2: Lattice parameters of bulk 𝛽-Sb. (a) Bond distances 𝑑1 and 𝑑2 under different vdW treatments. (b) Lattice parameters optimised with DFT-D2 and SOC. The unit cell was optim- ised by Murnaghan relaxations. Experimental reference: [158]. 8.2.2 Electronic properties The band structure of the DFT-D2+SOC structure of 𝛽-Sb is plotted in Fig. 8.4 (a) and (b), where the plotting procedure is analogous to that in Fig. 8.2 (b) and (c). In ad- dition, the bands of the isolated Sb sheets are inserted in (c)→ 1 BL and (d)→ 2 BL. Their structures are the peeled-off structures from the respective heterostructures in Sec. 9.1, i.e. the heterostructure optimised with DFT-D2+SOC after removal of the substrate. In this manner, the bands of the isolated sheets can directly be compared 139 8 The bulk phases of bismuth selenide and antimony (a) s p (b) px+py pz (c) 1BL (d) 2BL 2 0 0 -5 -2 unhyb- -10 ridiseds orbital -4 -15 -6M Γ K M|Γ Z M Γ K M|Γ Z M Γ K M Γ K Figure 8.4: Band structure of (a) and (b) bulk 𝛽-Sb and (c) and (d) isolated 𝛽-Sb sheets. For bulk 𝛽-Sb, the rhombohedrally centred DFT-D2+SOC structure was used. The 𝛽-Sb sheets are the peeled-off structures of the respective Sb@Bi2Se3 heterostructures. The symmetry points correspond to the hexagonal unit cell. The line colour corresponds to the colour bars above the panels. (a) s-orbital portion (red) vs. p-orbital portion (blue) in all PAW projections for bulk 𝛽-Sb; (b) – (d) (p𝑥 + p𝑦)-orbital portion (blue) vs. p𝑧-orbital portion (yellow) in the p- orbital projections of all atoms (line width) for (b) bulk 𝛽-Sb, (c) an isolated 1 BL sheet and (d) an isolated 2 BL sheet of 𝛽-Sb. The respective PDOSs are plotted against the y-axis to right of the band structures in arbitrary units. with the Sb bands in the layer-desorption-and-readsorption procedure for tracking the topological phase transitions in Sec. 9.3. Since the bands of the isolated sheets thematically fit better to this chapter, they are presented here. 𝛽-Sb is a semimetal in which the valence bands and the conduction bands are sep- arated locally in 𝒌, but not by a global band gap. In other words, the band gap is negative, which will be important for the ℤ2 classification. There are two s bands, which host the four 5s electrons from the two Sb atoms. The other three valence bands are of p character and host the six 5p electrons. The s bands and the p bands are clearly separated from each other at every k-point, which reminds of the Bi2Se3 case. Though, the band width of the s bands is larger than that of Bi2Se3. That the s orbitals do not hybridise with the p orbitals indicates a certain unwillingness of the s electrons to take part in chemistry and agrees with the nearly right-angled geometry of the BLs (95.0◦ bond angle within the BLs). While the local band gap is large in-plane, the valence bands and the conduction bands approach each other near the out-of-plane Z-point. The p𝑧-resolved bands show a peculiarity there: The uppermost valence band is of p𝑧 character and has a positive slope on most of the Γ𝑍 path. Near the Z-point, however, it bends down and changes to a hybrid between p𝑧 and (p𝑥 + p𝑦). That the opposite happens in the lowermost conduction band hints at a band inversion at that point. This might produce a topologically non-trivial state if the negativeness of band gap is ignored. The ℤ2 invariant The supposed topologically non-trivial state due to the apparent band inversion can easily be checked by calculating the ℤ2 invariant. In doing so, the negative band gap is ignored, i.e. the bands are assumed to be separated into a well-defined set of 140 E − EF (eV) 8.2 Antimony valence and conduction bands by making the local band gaps global. The procedure is completely analogous to the calculation of theℤ2 invariant of Bi2Se3. Again the QE package [4, 5] is employed with a 5s25p4-valent, full-relativistic PAW-PBE potential for Sb. The rhombohedrally centred DFT-D2+SOC structure used as input. All other parameters are the same as those in the Bi2Se3 case (vide supra). Applying the parity method on all ten bands returns a set of five positive and three negative 𝛿𝑖 . Their negative product proves that the valence manifold is topologically non-trivial wherefore 𝛽-Sb can be considered a 3D topologically non-trivialℤ2 semi- metal. A series of gauge transformations on the TRIM cube reduces the number of negative 𝛿𝑖 to one, which then sits at the vertex 𝚪7, the space-diagonally opposite of 𝚪0. According to the classification of Fu and Kane [147], the full topological class is 1;(111), in agreement with earlier results [159]. Repeating the ℤ2 classification for the s bands only yields a trivial character, while the p-band subspace produces the same results as those above. In particular, the position of the non-trivial 𝛿𝑖 at 𝚪7 im- plies that the band inversion happens the Z-point of the hexagonal unit cell, which confirms the presumed band inversion in Fig. 8.4. The classification of bulk 𝛽-Sb as a topologically non-trivial semimetal is per se not very useful due to the fractional occupation. The practical consequences emerge in thin films: Since the band widths of a bulk material decrease for thin slices, the band gaps increase and, in particular, semimetals (negative band gap) can adiabatically turn into semiconductors/insulators (positive band gap). For (0001)-slices of 𝛽-Sb, this happens at a thickness of ∼20 BL [160]. In doing so, the valence manifold of the 𝛽-Sb slice inherits the topological character from the bulk semimetal as the bands do not reorder. Therefore, such films of 𝛽-Sb are 3D ℤ2 TIs with two surfaces bearing the corresponding TSSs. A further reduction of the slice thickness below 8 BL opens a small band gap in the TSSs as the bulk character vanishes. However, being a 2D insulator, the thin slice permits the calculation of the 2Dℤ2 invariant, with the result that it is a 2D ℤ2 TI. Finally, at a thickness of 3 BL, the gap closes and reopens at a thickness of 2 BL. This is a topological phase transition which renders 𝛽-Sb sheets with thicknesses of 2 BL and 1 BL topologically trivial 2D insulators [160]. This is confirmed in this work, as the 𝛿𝑖 are negative for all four (2D) TRIMs. As a last remark, the band structures of the isolated sheets in Fig. 8.4 (c) and (d) already hint at the band inversion in bulk 𝛽-Sb. In the 1 BL structure, the VBM is at the Γ-point and the three p bands resemble a rehybridised superposition of a blue, hole-like parabola, a yellow p𝑧 band shaped like a lying B and a deeper, greyish, W-shaped band. When a second BL is added to the structure, one of the B bands is shifted to the Fermi level and approaches the conduction bands. Adding further BLs is likely to invert the band order at the Γ-point. If the band gap reopens after such a reordering, the system will be in a topologically non-trivial state. This exactly agrees with the observations in [160] and furthermore confirms the band inversion of the bulk band structure at the Z-point, which coincides with the Γ-point of the (0001) surface. 141 8 The bulk phases of bismuth selenide and antimony 8.3 Discussion and summary The investigations of the bulk phases proved that it is absolutely crucial to take vdW corrections into account when treating a vdW material like Bi2Se3 within DFT. Omit- ting them results in too large interlayer distances. The DFT-D2 approach in combin- ation with SOC produced lattice parameters for Bi2Se3 which agree with the experi- mental reference [153] up to 1 %. The unit cell volumes deviate by only 0.1 %. While the strong attractive effect between the QLs from the vdW corrections is expectable, the intermediate, also attractive effect from SOC is uncommon. There are not many detailed works on the first-principles structure of Bi2Se3, probably because not giving attention to all peculiarities of the material results in lattice constants which consid- erably deviate from the experiment. An extensive DFT work testing more vdW ap- proaches than this work was published by Luo et al. [161]. Their DFT-D2+SOC lattice constants (QE) are very close to those obtained in this work (VASP). Luo et al. invest- igated the SOC-related effects on the structure as well. Although they found that SOC reduces the interlayer distances, in agreement with this work, they doubt the phys- ical relevance (“it is still unclear if SOC is truly important for structural optimization” [161]). There is, however, a good reason why SOC is likely to exert physical effects on the structure: Since Bi2Se3 is a TI, the orbital occupation changes at the Γ-point when SOC is enabled. This point is certainly worth investigating in more detail, e.g. by analysing partial charges with and without SOC. While the results for Bi2Se3 agree with the experiment very well, those obtained for 𝛽-Sb show slightly larger deviations. Although DFT-D3 produces better interlayer distances, DFT-D2 is considered more suitable for describing 𝛽-Sb for two reasons. Firstly, the bond distance within the BLs is better predicted by DFT-D2. Secondly, the unit-cell volume within DFT-D2 is close to the experimental one, contrarily to that of the DFT-D3 structure. This will be important for the heterostructures since the Bi2Se3 substrate strains the thin Sb adlayer, so part of the substrate-adlayer distance is due to anisotropic deformation. In this case, a correct description of the volumes is obligatory. Aktürk et al. had a work published which investigates the structure of 𝛽-Sb with VASP [155]. Although, they used a parameter set very similar to that of this work (PBE, cutoff energy, k-mesh), they obtained DFT-D2+SOC lattice constants which are closer to the experiment [153] than those of this work. Thus, it is not excluded that the approach of this work was not free of errors. It is certainly worth testing further vdW functionals on 𝛽-Sb in order to explain the deviances. 𝛽-Sb and Bi2Se3 have in common that the bond angles are close to the right angle (regular coordination octahedrons in the QLs and right-angled trigonal-pyramidal coordination in the BLs). This accords very well with the electronic properties as the major part of the bonding is due to the p orbitals, while the s orbitals are proven not to hybridise with them. This is much pronounced in Bi2Se3, where the inert- pair effect prevents the s electrons from chemical bonding. The topological classes were calculated and agree with previous knowledge. The band inversion in the bulk phases is clearly visible from the p𝑧-resolved bands. In the topologically trivial, isol- ated 1 BL and 2 BL sheets of 𝛽-Sb, the beginning band inversion at the surface Γ-point was demonstrated. 142 9 The antimony on bismuth selenide heterostructure The (0001) surface of Bi2Se3, denoted BS(0001), results from cleaving the material ex- actly between two QLs, in accordance with the layered structure of the vdW material [157, 162]. Since the atomic shells of the QLs are closed, the surface morphology is de- termined by a unreconstructed buckled honeycomb. The upper atoms are the outer Se atoms of the terminating QL and the lower atoms are Bi atoms (vide supra, Fig. 8.1 (b)). The metallic TSSs of BS(0001) give rise to a single Dirac cone at the Γ-point, as confirmed by DFT calculations and ARPES measurements [P7, P9, 154, 163–166]. The adsorption of Sb on BS(0001) produces the Sb@BS system, a class of vdW het- erostructures consisting of a Sb adlayer which binds to the Bi2Se3 substrate by vdW interactions. The thinnest films occur in two different phases depending on the ther- modynamic boundary conditions: the 𝛼-phase and the 𝛽-phase, named after the two layered bulk phases of the pnictogens (vide supra, Sec. 8.2). The films can indeed be considered as single sheets from the respective bulk phases, which are attached to the substrate. The 𝛼-Sb adlayers are incommensurate with BS(0001) due to differ- ent 2D symmetry groups (orthorhombic vs. trigonal). In contrast, the 𝛽-Sb adlayers have the same symmetry as BS(0001) and, in addition, similar basal lattice constants (𝑎𝛽-Sb = 4.30 Å and 𝑎Bi2Se3 = 4.14 Å, vide supra). Therefore, they are commensurate with the substrate and can reach heights of several BLs by vdW epitaxy [157, 166, 167]. The 𝛼-phase can be transformed into the 𝛽-phase by a temperature-induced phase transition [P5, P10]. Beside the structural peculiarities of the system, the electronic properties attract sci- entific attention as well. The interplay between the TSSs of the substrate and the 2D states of the 𝛽-Sb adlayer (CIs for thicknesses of 1 BL and 2 BL, vide supra) is complex. Although the adsorption is based on vdW interactions, the electronic band structure of the heterostructure looks rather different from that of the constituents. Theoret- ical work on the heterostructures has been published [166, 168], though the origin of the features has not yet been satisfyingly understood. Furthermore, high-quality band structures and ARPES images are missing and the connection between the band structures and the structural details has not been investigated yet. The aim of this chapter is thus to understand the electronic properties of commen- surate 𝛽-Sb films with thicknesses of 1 BL and 2 BL by means of DFT. For this pur- pose, structure models are set up and subjected to ionic relaxations. The focus of the structure analysis is on the way how the Sb BLs stack on the substrate, which has not yet been paid sufficient attention, but proves to be thermodynamically rel- evant. Also the effects from vdW corrections and SOC are illuminated. In particu- 143 9 The antimony on bismuth selenide heterostructure lar, the effects from SOC are paid a special attention since one tends to deactivate SOC during the relaxations to save computation time. For the thermodynamically stable structure models, the electronic band structures are calculated and compared with experimental ARPES measurements. Furthermore, ab initio STM simulations are performed and the results are compared with experimental STM measurements. The last section contains a simulated desorption-and-readsorption process for the 𝛽-Sb adlayer which permits tracking the evolution of the TSSs upon Sb layer adsorp- tion. By this means, a complex series of topological phase transitions is illuminated. The deep analysis finally explains all details of the electronic band structures of the heterostructures. The results have already been published [P5, P7, P9]. 9.1 Structure optimisation The Bi2Se3 slab The topological non-triviality of Bi2Se3 discourages the asymmetric-slab approach since the passivation of the metallic surface states is impossible. Therefore, the Bi2Se3 substrate is modelled by a symmetric, right-prismatic slab, which provides identical, hexagonal surfaces on the top and the bottom side. The lattice parameters of the slab are taken from the Bi2Se3 unit cell optimised with DFT-D2 and SOC (vide supra, Tab. 8.1). The thickness of the slab amounts to 6 QL, which is thick enough so that the TSSs of the two surfaces do not interfere1. The periodic replica in the z-direction are separated from each other by 60 Å of vacuum, which reduces to 40 Å when the slab bears the Sb adlayers. The slab is sketched in Fig. 9.1 (a) and (b). Computational details The computational parameters for the heterostructure are similar to those used for the constituents (Chap. 8). The DFT calculations are carried out with VASP [2, 3] us- ing PAW potentials and the PBE xc-functional [10, 21, 22]. The Bi, Se and Sb atoms have PAW valences of 5d106s26p3, 4s24p4 and 5s25p3, respectively. The kinetic en- ergy cutoff is 𝐸cut = 400 eV. The Brillouin zone is sampled by a Γ-centred surface Monkhorst-Pack mesh [117] at a density of 12 × 12 × 1 k-points. The Brillouin-zone integration is carried out by a Gaussian occupation with a smearing of 5 meV for the ionic relaxations and by the tetrahedron method with Blöchl corrections [118] for all static calculations. The ionic forces are calculated according to the Hellmann- Feynman theorem [119]. The unit cell is fixed. The atoms of the inner four QLs are held fixed at the bulk positions during all relaxations. All other ions, the top QL right below the surface and the Sb adlayer, move within the unit cell along the forces to- wards the equilibrium positions and stop to relax if the forces acting on each ion are smaller than 0.005 eV/Å. Unless otherwise noted, SOC is explicitly included also in the relaxations and the DFT-D2 method [26] is employed for the vdW contributions. 1Test calculations show that the real-space distributions of the wavefunctions near the Dirac points of the two surfaces overlap up to 5 QL, although the band gap already closes at a thicknesses of 3 QL. 144 9.1 Structure optimisation (a) (b) 60 Å (c) P01 P10 P0120 P0102 vacuum H3 T1 BL2 T1 H3 T4 w tQL T1 H3 T H d4 3 T4 T 24 BL1 H3 T4 H3 H3 QL d1 tQL [‾11.0] + symmetriclower half [‾11.0] Figure 9.1: Sketch of the BS(0001) slab. Purple circles are Bi atoms; Light green circles are Se atoms; Blue circles are Sb atoms. (a) Top view of the slab. (b) Upper half slab in side view. The lower half results from inversion symmetry. Dark red lines mark the unit cells. The light red rectangle in (b) encircles the subset of atoms fixed in position during the relaxations. (c) Examples for Sb adlayers with heights of 1 BL and 2 BL. The symmetry positions of the substrate T1, T4 and H3 are indicated in (a) and (c). Reproduced from [P9]. 9.1.1 The stacking sequence of the adlayer The structure models for a vdW heterostructure in general have many degrees of freedom, in particular if the adlayer is not commensurate with the substrate. In the case of Sb@BS, the experimentally confirmed commensurateness [P5, 157] constrains the configuration space to those structures where the Sb atoms of the first BL sit at the symmetry positions of the terminating buckled honeycomb of the BS(0001) surface: T1, T4 and H3 (Fig. 9.1 (a)). The adsorption of multiple BLs can be understood in a successive manner as the BLs themselves are buckled honeycombs, which provide each a new triple of symmetry positions for the next BL. The bulk phases of both Bi2Se3 and 𝛽-Sb underlie a stacking which consists in the strict repetition of the atomic-layer sequence ABC with differing vdW-layer interrup- tions (Chap. 8). In order to transfer this to the Sb@BS heterostructure, let tQL, BL1 and BL2 be the top QL of the substrate, the first BL of 𝛽-Sb and the second BL of 𝛽-Sb, respectively. If BL1 continues the sequence of tQL (ABCAB) by (CA), the lower Sb atom is above H3 and the upper Sb atom above T4. BL2 can proceed with the indices (BC). Though, this is not the only possibility to stack the Sb adlayer in a reasonable manner: An inverted BL1 having the atomic-layer indices (AC), i.e. lower Sb atom above T4 and upper Sb atom above H3, would fit into the same two surface symmetry positions as the non-inverted (natural) BL1. Of course, the combinatorics inflate the configuration space with growing thicknesses of the Sb adlayer. Therefore, the configurations are systematised by the following nomenclature: The three original symmetry positions of the BS(0001) surface are mapped onto integers: H3→ 0, T4→ 1 and T1→ 2. A Sb@BS structure model is denoted “P” plus the sequence of integers which the adsorbed Sb atoms occupy. Thus, the single, natural BL is called “P01”, the single, inverted BL is called “P10” and two natural BLs are called “P0120”. The nomenclature simplifies the identification of inverted structures as it reduces to permutations between integers. 145 [11.0] fixed in position [00.1] 9 The antimony on bismuth selenide heterostructure Fig. 9.1 (c) shows examples for the nomenclature. Furthermore it defines the inter- layer distance between tQL and BL1 𝑑1 and that between BL1 and BL2 𝑑2, which will be used for the analysis of the structures. In order to compare the different structure models regarding their thermodynamical stability, consider the following boundary conditions: The Sb adlayer forms in the annealing stage, which translates into a constant temperature𝑇 , a constant pressure (𝑝 = 0, UHV) and a constant amount of Sb 𝑁Sb. If Sb diffusion into the Bi2Se3 substrate is negligible, the amounts of Bi and Se can be considered constant, as well as their chemical potentials. Consequently, the substrate contributes a constant term to the DFT total energy of the Sb@SB slabs 𝐸Sb@BSDFT . If this term is set to the DFT total energy of the “clean” slab without adsorbates 𝐸Clean and subtracted from 𝐸Sb@BSDFT DFT , the remainder approximately equals the internal energy of the Sb adsorbate only. Transforming it into the Gibbs free energy and dividing it by 𝑁Sb results in the Sb chemical potential 𝜇Sb. This is the quantity to minimise and permits the direct comparison between structure models with different amounts of Sb [P5, P9, P10, 157]. The Sb chemical potential of (natural) bulk 𝛽-Sb 𝜇b-natSb is chosen as the reference point. 𝐺adlayer ( ) 𝜇 = − 𝜇b-nat ≈ 1 𝐸Sb@BS − 𝐸Clean b-natSb 𝑁 Sb 𝑁 DFT DFT − 𝜇Sb (9.89) Sb Sb Analogously to the RESi2 case, the phononic contributions to the free energy are neg- lected (vide supra, Sec. 4.2.1). This approximation is based on the similar, layered structures, suggesting that each Sb BL has similar lattice-dynamical contributions. Since the following analysis concentrates on the adsorption of different 𝛽-Sb adlay- ers, it is instructive to consider hypothetical phases of elemental 𝛽-Sb: • An alternative bulk phase in which every second BL is inverted (b-inv). The stacking sequence is thus (AB)(CB). Its structure is optimised at code level. • Free-standing 𝛽-Sb sheets, whose structures are optimised by the 2D-version of the Murnaghan equation of state: the 1 BL sheet (f1BL) and the 2 BL sheet with natural (f2BL-nat) and inverted stacking (f2BL-inv). Tab. 9.1 shows the Sb chemical potentials 𝜇Sb, the basal lattice constants 𝑎 and the interlayer distances 𝑑 for these structures and for the stable bulk phase (b-nat). System Abbr. 𝜇Sb (meV) 𝑎 (Å) 𝑑 (Å) Bulk (natural stacking) b-nat 0 4.31 2.14 Bulk (inverted stacking) b-inv 40 4.12 2.84 Free-standing 1BL sheet f1BL 276 4.05 — Free-standing 2BL sheet (nat. stacking) f2BL-nat 178 4.16 2.43 Free-standing 2BL sheet (inv. stacking) f2BL-inv 173 4.08 2.97 Table 9.1: Sb chemical potentials 𝜇Sb, basal lattice constants 𝑎 and interlayer distances 𝑑 of 2D and 3D 𝛽-Sb systems. Reproduced from [P9]. 146 9.1 Structure optimisation (a) 200 180 184 150 100 89 76 55 65 36 39 45 Figure 9.2: (a) Sb chemical potentials50 33 µb-invSb 𝜇Sb and (b) interlayer distances 𝑑1 0 (squares) and 𝑑2 (circles) of the struc- (b) 3.8 ture models for the Sb@BS system 3.6 d1 optimised with DFT-D2+SOC. The two 3.4 d2 bulk phases of 𝛽-Sb b-nat (stable) and3.2 3.0 b-inv (hypothetical) are inserted asi i 2.8 db-inv dashed lines. Red “i” symbols mark in- 2.6 i i verted Sb BLs. Reproduced from [P9]. 2.4 2.2 db-nat 01 12 20 10 21 02 20 02 1 2P P P P P P 1 1 02 01 P0 P0 P1 P1 At first, 1 BL thick 𝛽-Sb adlayers (1BL-Sb) are investigated by optimising the structure of all six adsorption possibilities. The results for 𝜇Sb and 𝑑1 are plotted in Fig. 9.2 (blue). P01 is the most stable structure with 𝜇Sb = 55 meV. Hence, it is indeed favour- able if BL1 continues the stacking sequence of the Bi2Se3 substrate. Surprisingly, P10, the inverted BL, is only slightly less stable by Δ𝜇Sb = +10 meV. Since the difference is below 𝑘B𝑇 at room temperature, the two structures are energetically so close to each other that a conclusion about their relative stability is not possible. The next stable pair of structures are P02 and P12, where the upper Sb atom is above T1. Their 𝜇Sb are higher than that of P01 by +21 meV and +34 meV, also in the order of 𝑘B𝑇 at room temperature. Only P20 and P21, where the lower Sb atom is above T1, are clearly unstable against P01 by Δ𝜇Sb > +100 meV. The interlayer distances 𝑑1 congruently follow 𝜇Sb. This permits the conclusion that the binding energy between the substrate and BL1 depends on how close the former lets the latter approach. Continuing the stacking sequence of the substrate, P01 has the smallest interlayer distance of all 1BL-Sb structures (𝑑1 = 2.40 Å, +0.25 Å com- pared to 𝑑b-nat), which minimises 𝜇Sb. In contrast, in P20 and P21, the top Se atom of the substrate pushes the BL quite far away (𝑑1 > 3.5 Å). This reduces the binding energy to such an extent that the BL is energetically closer to the free-standing 1 BL sheet (f1BL) than to P01. In all other structures, 𝑑1 is close to 𝑑b-inv. The results for 2 BL thick 𝛽-Sb adlayers (2BL-Sb) are plotted in Fig. 9.2 (green). Us- ing the 1BL-Sb results, only those 2BL-Sb structures are considered where both BLs successively occupy the convenient H3 and T4 positions. These are P0120 (BL1 nat, BL2 nat), P0102 (BL1 nat, BL2 inv), P1021 (BL1 inv, BL2 nat) and P1012 (BL1 inv, BL2 inv). Similarly to the 1BL-Sb case, it is favourable if the BLs continue the stacking sequence of the substrate (P0120). However, the 𝜇Sb of the other three structures are surprisingly close to that of P0120 (Δ𝜇Sb ≤ 12 meV ∼ 𝑘B𝑇 ). Hence, it remains unclear which structure model represents the real 2BL-Sb structure. The interlayer distances clearly conform with the layer inversions: If the BL is natural/inverted, the corres- ponding 𝑑𝑖 is closer to 𝑑b-nat/𝑑b-inv (red “i” symbols in Fig. 9.2). 147 d (Å) μSb (meV) 9 The antimony on bismuth selenide heterostructure Tab. 9.1 shows that in both cases of natural and inverted stacking, the basis of bulk 𝛽-Sb is larger than that of a free-standing 2 BL sheet, which in turn is larger than that of a free-standing 1 BL sheet. This suggests that a free-standing 𝛽-Sb sheet laterally expands with growing thickness. In the bulk limit, the basal lattice constant of b- inv is smaller than that of b-nat (−4.4 %). The behaviour of the sheets has important consequences for the lattice matching between the Sb adlayer and the Bi2Se3 sub- strate (surface lattice constant 𝑎BS = 4.13 Å): Considerations based on natural bulk 𝛽-Sb would lead to the wrong conclusion that the Sb adsorbate underlies compress- ive strain (−4.0 %). On the contrary, the strain is even tensile in the 1BL-Sb hetero- structure (+2.1 % w.r.t. f1BL), while in 2BL-Sb, the lattice matching is almost perfect (f2BL-nat: −0.7 %, f2BL-inv: +1.2 %). For thick, bulk-like Sb adlayers, the inverted stacking (b-inv) produces an almost perfect lattice matching (+0.2 %), while a natur- ally stacked layer (b-nat) has to be compressed by−4.0 % (vide supra). Since adapting the bases to BS(0001) makes 𝜇b-nat b-invSb approach 𝜇Sb , it is not clear which of the two stackings occurs even for thick Sb adlayers [P9]. The free-standing 2 BL sheets demonstrate why the inversion of BL1 and BL2 in the 2BL-Sb heterostructure has such a low impact on the stability. Firstly, 𝜇Sb is almost equal in both free-standing 2 BL sheets, differing only by 5 meV. Secondly, both sheets match the substrate equally well, so the adsorption energy per atom depends only on the relation between BL1 and the substrate. Thirdly, BL1 is indifferent to inversions in the 1BL-Sb case. These three points combined explain why all four investigated 2BL-Sb structures are energetically so close together that a conclusion about the real structure is not possible. In spite of this, the subsequent work concentrates on the heterostructures with natural stacking: P01 and P0120. 9.1.2 The effects from SOC and vdW on the heterostructure Although DFT-D2+SOC is the approach of choice in this chapter, it is interesting to investigate how SOC and the vdW corrections affect the heterostructures. For this purpose, the structure models for 1BL-Sb and 2BL-Sb are optimised with disabled cor- rections: pure PBE, DFT-D2 only, SOC only. All other parameters remain unchanged. The results for the interlayer distances 𝑑1 and 𝑑2 are plotted in Fig. 9.3. Similarly to bulk Bi2Se3, both vdW and SOC make the adlayer move closer to the substrate. In 1BL-Sb, 𝑑1 (blue squares) shrinks by a similar percentage if either vdW or SOC is en- abled (−15 % and −10 %, respectively). If both are enabled, the shrinkage amounts to −20 %. The overall contractive effect and the importance of SOC are more pro- nounced in the heterostructure than in bulk Bi2Se3. In 2BL-Sb, 𝑑1 (green squares) is generally smaller, but the relative shrinkage is similar (−17 % if enabling vdW and SOC). 𝑑2 (green circles) behaves differently from 𝑑1 as it contracts upon enabling vdW (−5 % to −6 %), but varies much less due to SOC (+1 % to +2 %). Back to DFT-D2+SOC, while 𝑑1 and 𝑑2 vary considerably, the other lattice parameters are close to the respective bulk values. The thickness of tQL and the interlayer dis- tance beneath tQL are very close to 𝑤 and 𝑑 in bulk Bi2Se3, deviating by <0.5 % and <1 %, respectively. The Sb–Sb bond distances in the adsorbed BLs (2.88 Å to 2.89 Å) are close to the respective bond distance in bulk 𝛽-Sb (2.92 Å, −1.4 % to −1.0 %). 148 9.2 Electronic properties 3.0 2.98 1BL-Sb 2BL-Sb d1 d1 2.8 d2 Figure 9.3: Interlayer distances 𝑑1 (squares)2.78 2.68 and 𝑑2 (circles) of 1BL-Sb (blue) and 2BL-Sb (green) optimised with different corrections. 2.6 2.59 bulk Bi2Se3: 2.58 Å 2.53 Treatments: 1) pure PBE, 2) SOC only, 3) DFT-2.58 2.57 2.46 D2 only, 4) DFT-D2 and SOC. The experimental 2.42 interlayer distance of the bulk phases are inser- 2.4 2.382.41 ted as horizontal dashes lines: purple→ Bi2Se3 [153]; blue→ 𝛽-Sb [158]. bulk β-Sb: 2.24 Å 2.30 2.2 PBE SOC D2 D2+SOC 9.2 Electronic properties The band structures of the three slabs (clean Bi2Se3, 1BL-Sb and 2BL-Sb, all optim- ised with DFT-D2+SOC) are calculated analogously to Sec. 4.3 and may be found in Fig. 9.4. All system-specific parameters are the same as those of the relaxations above. The surface k-path comprises two line segments: KΓ ∥ (112̄0) in the x-direction and ΓM ∥ (1̄100) in the y-direction (sketch in Fig. 9.4 (b)). The bands associated with the surface are separated from the bands of the simulated bulk by means of the PAW projections. In detail, the summed PAW projections of the atoms of the upper half slab divided by the sum over all PAW projections of the slab minus 0.5 defines the line width. Thus, a band which is totally localised in the upper half slab is displayed at the maximum line width, while bands localised to less than 50 % in the upper half are displayed as hair lines. The colour corresponds to whether the bands are associ- ated with the substrate (Bi and Se atoms, blue) or with the adlayer (Sb atoms, red). The projected bands of bulk Bi2Se3 are shaded grey. 9.2.1 Band structures The bands of the clean substrate (Fig. 9.4 (a)) show that the topologically protected surface states arising from the bulk (TSSBS) manifest themselves as a single Dirac cone at the Γ-point. The Dirac point D is right above the VBM and defines the Fermi energy of the slab. The presence of an odd number of Dirac cones located at the surface Γ-point confirms the expectations from the parities of bulk Bi2Se3, according to the extended topological classification of TIs with inversion symmetry [159]. The theoretical band structure compares qualitatively well with ARPES measurements [P7, 165, 166, 169]. However, there are two minor, quantitative discrepancies: Firstly, the experimental binding energy of D is positive in ARPES, e.g. 0.2 eV in [166], while the theoretical one is zero. The reason for this deviation is that Bi2Se3 samples are often 𝑛-type due to Se vacancies [166, 169]. Furthermore, ARPES measurements on TIs suffer from the problem that the inherent UV illumination varies the binding energy of D by 0.2 eV over the time of measurement [165]. The second quantitative 149 d (Å) 9 The antimony on bismuth selenide heterostructure K Γ M D* PSb 0.3 (a) Clean Bi2Se (b)3 0.8 0.2 0.1 0.4 0.0 D TSSBS -0.1 0.0 -0.2 -0.4 PBE SOC D2 D2+SOC Structural optimisation scheme -0.8 M y BS Sb K xΓ 0.8 0.4 0.0 0.4 0.8 K Γ M K Γ M (c) 1BL-Sb (d) 2BL-Sb 0.8 0.8 0.4 0.4 TSSSb D* TSSSb D* 0.0 0.0 PSb RBS B -0.4 -0.4 B PSb RBS -0.8 -0.8 0.8 0.4 0.0 0.4 0.8 0.8 0.4 0.0 0.4 0.8 k (Å−1) k (Å−1ǁ ǁ ) Figure 9.4: Band structures of (a) the clean Bi2Se3 surface and the heterostructures (b) 1BL- Sb and (d) 2BL-Sb optimised with DFT-D2+SOC. All energies are reset by the Fermi levels of the respective slabs. The line colour corresponds to the colour bar: blue → substrate; red→ Sb adlayer. The line width corresponds to the localisation in the upper half slab (see text). The projected bands of bulk Bi2Se3 are shaded grey. Arrows highlight special features. (b) Positions of the Γ-point features (red uptriangles→ PSb; red crosses→ D∗; grey boxes→ bulk conduction bands) of the 1BL-Sb structure optimised with different schemes (pure PBE, PBE+SOC, PBE+D2, PBE+D2+SOC). Reproduced from [P7]. discrepancy consists in differing shapes of the Dirac cone. The lower part is rather flat in the theoretical band structure, while it is more X-like in ARPES [P7, 169]. The origin of this lies in the many-body effects which are insufficiently accounted for by DFT. The correction of the self-energy of the quasiparticles within GW partially reverts the band inversion so that the furrow at the Γ-point bulges out [170]. As a further consequence, the band gap decreases, which is exactly the opposite of what is expectable from GW corrections in other, topologically trivial semiconductors [170]. Concerning the surface, GW renders the shape of the Dirac cone more X-like, which then compares very well to the ARPES bands [163, 164]. When the first Sb BL adsorbs to the substrate, the band structure undergoes funda- mental changes (Fig. 9.4 (c)). The Fermi level rises into the CBs of bulk Bi2Se3. The 150 E − EF (eV) E − EF (eV) E − EF (eV) 9.2 Electronic properties original Dirac cone is not recognisable any more and new surface features emerge instead. Above the first set of bulk CBs, two X-shaped bands form a Dirac point D∗ at −0.2 eV binding energy. They are interrupted by the bulk CBs and have a visible gap in the ΓM direction, while the gap along ΓK vanishes. If they are nevertheless continued across the bulk CBs, they represent a set of surface bands which are more associated with the Sb adlayer than with the substrate. Since it seems that they preserve the topological non-triviality of the surface band structure, they are denoted TSSSb. A detailed analysis of the topological properties can be found below (Sec. 9.3). Beside TSSSb, there are three further important surface features: A band resembling a lying B, a hole-like double parabola PSb with apex at 0.1 eV binding energy and Rashba-like bands RBS at the bulk CBs. The B band is more associated with the substrate than with the adlayer. PSb is completely localised in the adlayer and represents the Sb–Sb bonds as indicated by the strong (p𝑥 + p𝑦) character (not shown). The RBS states are completely localised in the surface region of the substrate, more precisely mostly in tQL. Since they are right and left shifted versions of the bulk CBs, they seem indeed to represent a Rashba splitting at tQL. The addition of a second BL alters the band structure further (Fig. 9.4 (d)), though, the qualitative features of the 1BL-Sb band structure are all still present. Compared to 1BL-Sb, the Fermi level rises by 0.1 eV. TSSSb becomes completely localised in the adlayer and overall steeper. It has a clear gap where the bulk CBs pass, in both along ΓM and ΓK. The lower part of the former X-shaped band around D∗ bends upwards. The B band is steeper and its apices are near the Fermi level. In contrast, PSb shifts to higher binding energies so that it is right above the bulk VBM. The Rashba splitting RBS is much stronger. An additional feature, which is less pronounced in 1BL-Sb, consists in the wavy bands in the ΓM direction at −0.2 eV binding energy. The interlayer distance 𝑑1 of 1BL-Sb influences the alignment of the bands, which becomes visible in the bands of the structure models which were optimised with different corrections disabled (Sec. 9.1.2). Fig. 9.4 (b) shows the schematic alignment between the bulk CBs, PSb and D∗ for these structures. The more the adlayer moves away from the surface (from right to left in the diagram), the closer PSb and D∗ are together, until D∗ finally dips into the CBs. It confirms the importance of accounting for both vdW corrections and SOC during the relaxations. A detailed explanation for the dependence between D∗, PSb and 𝑑1 can be found below (Sec. 9.3). Comparison with ARPES The theoretical band structures are in well agreement with ARPES measurements, which were performed by the experimental collaborators of the joint-experimental- theoretical work [P7] (Fig. 9.5). The surface states TSSSb and B are clearly visible in ARPES and conform with the theory regarding the relative binding energies, slopes and curvatures. Also the quantitative differences between 1BL-Sb and 2BL-Sb and between the directions of the k-path are well reproduced. The ARPES image of 2BL- Sb clearly confirms the Rashba splitting RBS. In the ARPES image of 1BL-Sb, RBS is difficult to recognise as the splitting is weaker and superposed by blurry bulk CBs. The peak PSb is missing because in-plane p𝑥 and p𝑦 bands are generally suppressed by matrix-element effects [P7]. 151 9 The antimony on bismuth selenide heterostructure (a) 1BL-Sb (b) 2BL-Sb -M Γ M -M Γ M -M Γ M -M Γ M 0.0 0.0 0.2 0.2 0.4 0.4 0.6 0.6 0.8 0.8 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 -K Γ K -K Γ K -K Γ K -K Γ K 0.0 0.0 0.2 0.2 0.4 0.4 0.6 0.6 0.8 0.8 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 -0.4 -0.2 0.0 0.2 0.4 k (Å−1) k (Å−1) k (Å−1ǁ ǁ ǁ ) k (Å−1ǁ ) Figure 9.5: Comparison between the theoretical band structures (right, cut out from Fig. 9.4) and ARPES images (left, features highlighted) of the heterostructures (a) 1B-Sb and (b) 2BL- Sb. Top row: mid part of -MΓM; Bottom row: mid part of -KΓK. Reproduced from [P7]. The good agreement between the ARPES image of the 2BL-Sb system and the respect- ive theoretical band structure supports the P0120 structure model. The energetically competitive, inverted structure models P0102, P1012 and P1021 yield band structures which look similar at first glance, but differ in the details. The band structures can be found in the appendix (Fig. C.1). Along ΓM, the inwards bending of TSSSb towards the Γ-point and the crossing between the B band and RBS are unique for P0120. Along ΓK, P0120 produces a slope for TSSSb which agrees with the ARPES image, while the slopes of all other structure models are too steep. The two BL-Sb band structures are too similar to each other to evaluate them by the ARPES image. The main distinctive feature is the distance between P and D∗Sb , which is 0.3 eV for P01 and 0.2 eV for P10. However, both points are not visible in ARPES. The changes in the band structures upon layer inversion can partially be attributed to the larger interlayer distances, as further elaborated in Sec. 9.3. The spin texture The magnetic projections𝑚𝑙𝑚,𝐼 (𝑛, 𝒌〈) are an alog〉ous to〈the band projections 𝑃 𝑙𝑚,𝐼 (𝑛, 𝒌) 𝑖 in Eq. (4.41): 𝑚𝑙𝑚,𝐼 〉 (𝑛, 𝒌) 1= Ψ𝑛(𝒌) 𝑌 𝐼𝑙𝑚 · 𝜎 𝐼 𝑖 2 𝑖 · 𝑌𝑙𝑚 Ψ𝑛(𝒌) (9.90) where 𝜎𝑖 are the Pauli matrices, 𝑖 is the spatial direction, |Ψ𝑛(𝒌)⟩ is the spinor at band index 𝑛 and k-point 𝒌, and |𝑌 𝐼 ⟩ are the spherical harmonics centred at atom 𝑙𝑚 𝐼 . The sum over 𝑙𝑚 of the magnetic projections of a specific atom 𝐼 for a specific 𝒌 and 𝑛 is simply denoted atomic magnetisation 𝑚𝐼 (𝑛, 𝒌). The atomic magnetisations 𝑖 permit a band colouring which is completely analogous to that based on the PAW projections. In order to obtain the local spin texture, the upper half slab is divided into two groups: the substrate and the adsorbate. For each group, the magnetisation 𝑚𝑖 (𝑛, 𝒌) is obtained by summing the 𝑚𝐼 (𝑛, 𝒌) of the respective atoms. The line col-𝑖 our corresponds to a colour bar: blue→ negative magnetisations and red→ positive 152 Binding energy (eV) Binding energy (eV) 9.2 Electronic properties K Γ M K Γ M (a) z y M 0.8 1BL-Sb (b) 2BL-Sb (c) K 0.4 x 0.0 Γ m -0.4 (a.u.) -0.5 +0.5 -0.8 my mx my mx K Γ M 0.8 0.4 0 0.4 0.8 0.8 0.4 0 0.4 0.8 0.8 Clean (d) (e) (f) 0.4 0.0 -0.4 -0.8 my mx my mx my mx 0.8 0.4 0 0.4 0.8 0.8 0.4 0 0.4 0.8 0.8 0.4 0 0.4 0.8 K Γ -K K Γ -K K Γ -K 0.8 (g) (h) (i) 0.4 0.0 -0.4 -0.8 mz mz mz mz mz mz 0.8 0.4 0 -0.4 -0.8 0.8 0.4 0 -0.4 -0.8 0.8 0.4 0 -0.4 -0.8 k (Å−1 −1ǁ ) kǁ(Å ) kǁ (Å−1) Figure 9.6: Spin texture of (d), (g) the clean Bi2Se3 surface and the heterostructures (b), (e), (h) 1BL-Sb and (c), (f), (i) 2BL-Sb. The line width and Fermi levels are analogous to Fig. 9.4. The line colour corresponds to the colour bar in (a): blue→ negative magnetisations; red→ positive magnetisations. (b), (c): 𝑚 𝑦 along ΓK and 𝑚𝑥 along ΓM of the Sb adlayer: (d) – (f): the same for the Bi2Se3 substrate only; (g) – (h): 𝑚𝑧 along ΓK and Γ-K. (a) Brillouin zone with a schematic helical spin texture. Reproduced from [P7]. magnetisations. The line width corresponds to the localisation in the upper half slab (as in Fig. 9.4) and suppresses bands of the simulated bulk and the other surface. The results are shown in Fig. 9.6. The three columns represent the clean substrate, 1BL- Sb and 2BL-Sb. The first row shows 𝑚 𝑦 (𝑛, 𝒌) along ΓK (the x-direction) and 𝑚𝑥 (𝑛, 𝒌) along ΓM (the y-direction) for the Sb adlayer only. The second row shows the same for BS only. The third row shows the z-component 𝑚𝑧 (𝑛, 𝒌) along KΓ-K (the x-direction) for the entire half slab. The Kramers degeneracy (Eq. (7.74)) tells that the magnet- isations along Γ-𝑋 and Γ𝑋 , 𝑋 ∈ [K,M], are equal up to the sign, which was explicitly confirmed (not shown). All other components, e.g. 𝑚𝑥 (𝑛, 𝒌) along ΓK, vanish. 153 E − EF (eV) E − EF (eV) E − EF (eV) BS (+ Sb) BS only Sb only 9 The antimony on bismuth selenide heterostructure The Dirac cone of the clean substrate (panel (d)) is characterised by a magnetisation which is strictly perpendicular to the k-vector. The in-plane magnetisations rotate clockwise in the upper cone (sketched in (a)) and anticlockwise in the lower cone. The perpendicular relation between the magnetisation (“spin”) and the k-vector (“mo- mentum”) is called spin-momentum locking: In the model of nearly free electrons, the spins of surface states underlying strong SOC are perpendicular to the momenta, as the CPD Hamiltonian shows (Eq. (2.17), ∇𝑉 ∥ e𝑧 ⇒ 𝝈 ⊥ 𝒑 ∼ 𝒌, [171]). The spin-momentum locking is confirmed by the band structures of this work as all mag- netisations parallel to 𝒌 and associated with the surface remain white (not shown). The T-symmetry causes the rotation (Kramers degeneracy). Panel (g) shows that the magnetisation is in-plane near the Γ-point, but gains a finite out-of-plane component 𝑚𝑧 along ΓK. This component positive for three 𝐶3-related K-points, negative for the other three K-points and zero along ΓM. The wave-like out-of-plane tilting of the spin when circling D is due to the hexagonal warping effect, as arising from higher-order 𝒌 · 𝒑 theory combined with the spatial symmetries of the surface [172]. In 1BL-Sb, the Dirac cone around D∗ (b) has the same spin texture as the original Dirac cone of the clean substrate (d). However, progressing on the ΓK path, the spin of TSSSb rotates from (𝑚 𝑦 > 0, 𝑚𝑧 = 0) via (𝑚 𝑦 = 0, 𝑚𝑧 < 0) to (𝑚 𝑦 < 0, 𝑚𝑧 < 0) ((b) and (h)). The B band behaves similarly, but with opposite signs and an additional sign change when it connects to double peak PSb. The latter has thus a coloured up- per branch, while the lower branch shows only weak magnetisations. The overall 𝑚𝑧 contributions are considerably stronger than those of the clean substrate, indic- ating a stronger hexagonal warping effect. Along ΓM, the magnetisations of TSSSb, B and PSb remain strictly in plane and are rather constant due to hexagonal warping. The magnetisations furthermore show that the substrate (e) the adlayer (b) equally contribute to TSSSb and the B band, while PSb is only barely visible in the substrate bands. The magnetisations of RBS are completely in-plane and localised in the sub- strate (e). They resemble a double parabola, whose branches are shifted leftwards (red) and rightwards (blue). This agrees with the expectations from the Rashba effect. The spin texture of 2BL-Sb looks very similar to that of 1BL-Sb. The states around D∗ have the same spin texture as the original Dirac cone and are mainly localised in the adlayer (c). The in-plane magnetisation of TSSSb is weaker and the out-of-plane component very strong (i). Also the Rashba splitting is stronger than in 1BL-Sb (f). 9.2.2 Simulated STM images Scanning tunnelling microscopy STM sheds light on the morphology of surfaces. For the above systems, theoretical STM images are simulated according to the theorem of Tersoff and Hamann [173]. For this purpose, the local density of states (LDOS) is integrated over an energy frame from the Fermi level to the STM bias voltage 𝑉bias. The experimental STM images of 1BL-Sb and 2BL-Sb were taken by the collaborators of [P7] at 𝑉bias = +0.2 V in constant-current mode, where the positive sign indicates the probing of empty states (for details, see [P7]). Therefore, the STM simulation consists in summing the partial charges of all states between 𝐸F and 𝐸F + 0.2 eV. The resulting isosurfaces are shown in Fig. 9.7. 154 9.2 Electronic properties The experimental STM image of 1BL-Sb (panel (a)) is characterised by a hexagonal pattern of bright spots. The theoretical image (b) reproduces this very well and the underlying structure (c) shows that these spots indeed stem from charge bulbs above the upper Sb atoms. Since the lower Sb atoms are invisible due to the strong signal from the upper Sb atoms, the symmetry of the STM image is hexagonal, despite the trigonal symmetry of the atomic arrangement. The STM image of 2BL-Sb (d) looks different from 1BL-Sb, despite the same isocurrents and same geometry of the up- permost atomic layer. The bright spots are triangular and connected by bridges. Consequently the symmetry of the image is clearly trigonal. The theoretical image (e) reproduces this very well. An astonishing detail is that the triangular spots point upwards, not downwards as the geometry of the top BL (f) suggests. In other words, the apparent buckled-honeycomb pattern is exactly inverted with respect to the real buckled Sb honeycombs: The bright spots are above the upper Sb atoms of the top BL, the bridges above the holes and the apparent holes above the lower Sb atoms. The reason for this is that the charge bulbs of the upper Sb atoms of BL1 penetrate the holes of BL2 and are thus probed beside the bright charge bulbs of the upper Sb atom of BL2. This demonstrates that STM only measures charge densities, which can deviate from real atomic arrangements. max. (a) (b) (c) (d) (e) (f) min. 1nm 1nm Figure 9.7: STM images of (a) – (c) 1BL-Sb and (d) – (f) 2BL-Sb. (a) and (d) are experimental images taken at𝑉bias = +0.2 V and 𝐼 = 20 nA (empty states). (b) and (e) are theoretical images simulated at the same voltage. (c) and (f) show the underlying structure of the simulated images. Grey circle are Se atoms; Blue/purple circles are the Sb atoms of BL1/BL2. Red rhombi indicate the surface unit cell. Reproduced from [P7]. The last point of this section concerns the real-space distribution of the Dirac points D and D∗. As these points are fourfold degenerated due to T-symmetry and the P- symmetry of the entire slab, the DFT code produces states which are artificially loc- alised at both surfaces, although the partial charges still show a thick region with vanishing density in the simulated bulk. Averaging all four states of the Dirac points solves this symmetrisation problem. Fig. 9.8 shows the isosurfaces of the partial charges of the Dirac points in real space. For the clean substrate, the isosurface (panel (a)) as well as the plane average (d) show that the partial charge of D extends deeply into the substrate, far beyond tQL. This explains why the slab has to be so thick in order to entirely disentangle the simulated surfaces. The highest peak in the plane average coincides with the bond between the top Se atom and the Bi atom below (grey dashed line). The partial charge of D∗ in 1BL-Sb (b) is more localised in the sur- face region as it already decays to near zero in tQL. The two largest peaks in (d) stem from charge bulbs at the top Sb atom (blue dashed lines), which look like p𝑧 orbitals. The PAW projections indeed confirm that TSSSb has strong Sb-p𝑧 contributions (not 155 height (a.u.) 9 The antimony on bismuth selenide heterostructure (a) Clean (b) 1BL-Sb (c) 2BL-Sb (d) xy-average Figure 9.8: Real-space isosurfaces of D* the wavefunctions at the Dirac points D* D and D∗ of (a) the clean substrate, (b) 1BL-Sb and (c) 2BL-Sb. Grey circles D are Bi and Se atoms; Blue/purple circles are the Sb atoms of BL1/BL2. (d) Corresponding plane averages. Reproduced from [P7]. shown). In 2BL-Sb (c) the partial charge even more concentrates in the top Sb layer. As in 1BL-Sb, the p𝑧 orbitals of the top Sb atom contribute most to the plane aver- age (purple dashed lines). The comparison between the three systems shows that the TSSs are transported to the surface upon Sb layer adsorption and that the vertical confinement of the states strengthens from the clean substrate to the 2BL-Sb hetero- structure. This clearly hints that the transition between different topological classes of the bulk and the vacuum occurs in the topmost vdW layer. For this reason, the Sb adlayer can be considered topologised in that it quasi belongs to the non-trivial substrate and produces TSSs at its interface with the trivial vacuum [P7]. 9.3 The topological phase transition The chapter concludes with a deeper analysis of the electronic interplay between the Bi2Se3 substrate and the Sb adlayer and confirms that latter indeed topologises. For this purpose, the distances between the Sb BLs and the substrate are varied between long (free-standing sheets plus clean substrate) and short (heterostructure). By cal- culating the bands for each configuration, the electronic properties can be tracked along a simulated desorption-and-readsorption process. According to pp. 126ff, the fundamental behaviour of surface states follows from the topological class of the underlying bulk. The combined band structure of the clean substrate and a free-standing, thin Sb sheet is expected to be a superposition of two uncoupled entities: the non-trivial surface bands of the former (analogous to Fig. 7.5 (c)) and the trivial 2D bands of the latter (analogous to Fig. 7.5 (b)). In the adsorp- tion process, which finally results in the bands of the heterostructure (Fig. 9.4), the non-trivial Dirac cone of BS(0001) and the trivial 2D bands of the Sb sheet obviously vanish and, instead, a joint, non-trivial band structure appears. Thus, at a certain point, the Kramers pairs of the Sb sheet have to change partners so that they integ- rate themselves into the non-trivial surface states of the substrate. Such a partner change is a topological phase transition. The topological class of surface states can be checked by means of the connections between the surface Kramers pairs (SKPs) at the four surface TRIMs. Fig. 9.9 shows the hexagonal surface Brillouin zone and the TRIM rhombus (bold). One TRIM is Γ0, the Γ-point of the zeroth Brillouin zone (black). Two further TRIMs are the M-points M0 and M010 01. The last TRIM is M011, which equals 156 2BL-Sb 1BL-Sb Clean 9.3 The topological phase transition M11̄1 of the next Brillouin zone (grey) and is thus identical to M01̄1. The original k-path of Fig. 9.4 connects the non-TRIM K0 0 011 and the TRIMs Γ and M1̄1. It is prolonged in the x-direction so that it comprises the line segments Γ0M0 0 01̄1 and Γ M11 (dashed lines in Fig. 9.9). The symmetries of BS(0001) imply that all M-points yield the same 𝛿𝑖 product2. Thus, the SKP connections between Γ0 and any of the M-points character- ises the entire surface band structure. Nevertheless, the complete k-path in Fig. 9.9 is employed for the band structure calculations as it permits tracking the spin texture at the same time. The details of the desorption-and-readsorption procedure are as follows: Starting from the stable 2BL-Sb hetero- M01̄1 M001 structure (𝑑1 = 2.3 Å and 𝑑2 = 2.5 Å), BL2 is rigidly lif- K0 M 0 11 11 0 ted by 6.0 Å in the first stage so that Γ𝑑2 = 8.5 Å. This rep- 0 resents the 1BL-Sb heterostructure plus an isolated 1 BL M10 Γ1 sheet. In the second stage, BL1 is lifted by 6.0 Å as well so y that 𝑑1 = 8.3 Å and 𝑑2 = 2.5 Å. This represents the clean x substrate plus an isolated 2 BL sheet. In the third and last stage, the 2 BL sheet is lowered by −6 0 Å so that the ori- Figure 9.9: Hexagonal Bril-. louin zone with TRIMs. ginal 2BL-Sb heterostructure is restored. Each stage is lin- early sampled by 20 structures, the images, so that the BLs move vertically by 0.3 Å from image to image. For each image, the band structure is calculated and plotted according to four different modes: 1) Line width → localisation in the upper half slab; RGB colour → localisation in BL2, BL1 and tQL, respectively. This extracts the surface bands and indicates whether they belong to BL2 (red), BL1 (green) and/or tQL (blue). 2) Line width → tQL; blue/red colour → magnetisation of tQL. This extracts the bands of tQL and shows their spin texture. The direction of the plotted magnet- isation is perpendicular to the k-path (spin-momentum locking, vide supra). 3) Same as 2) for BL1. 4) Same as 2) for BL2. Fig. 9.10 shows the band structures of 16 out of the total 56 images. The energies are reset to the VBM of bulk Bi2Se3. The Fermi energy of the slab is in- serted as a black horizontal line. Four special cases provide important subsystems (yellow background, blue labels): (a) the 2BL-Sb heterostructure, (e) the 1BL-Sb het- erostructure plus an isolated 1 BL sheet, (i) the quasi clean substrate plus two quasi isolated 1 BL sheets, (m) the clean substrate plus an isolated 2 BL sheet. For these im- ages, all four plotting modes are shown. For all other images, only mode 1) is shown. The respective structure model with the interlayer distances is displayed next to the band structures. Red arrows indicate the order of the images. The full set of images is available as an mp4 file in the supplementary material of [P9]. 2This restriction of the combinatorics is the central reason why it is easier to realise TIs in trigonal systems, than in any other symmetry class. 157 M0 K0 Γ0 M0 M0 K0 Γ0 M0 M0 K0 Γ0 M0 M0 0 0 011 11 1̄1 11 11 1̄1 11 11 1̄1 11 K11 Γ M1̄1 (a) 1 D* tQL BL1 BL2 0.5 R my mx my mx my mx 0 P -0.5 -1.5 -1 -0.5 0 0.5 -1.5 -1 -0.5 0 0.5 -1.5 -1 -0.5 0 0.5 -1.5 -1 -0.5 0 0.5 tQL BL1 BL2 k‖ (Å−1) m (a.u.): − + (b) (c) (d) gap closing (e) D* 1BL-Sb 1BL-Sb isolated BL R P (f) (g) (h) partner change (i) P1 quasi quasi R clean BS isolated BL isolated BL P2 D' (j) (k) (l) gap closing (m) R clean BS isolated 2BL isolated 2BL P1' D' P2' (n) (o) (p) gap gap opening opening Figure 9.10: Topological phase transitions in the Sb@BS heterostructure (details in text). Reproduced from [P9]. E (eV) 9.3 The topological phase transition In the first image (a), the surface band structure of 2BL-Sb is recognisable as well as the SKPs D∗, RBS (abbreviated to R) and PSb (abbreviated to P). The SKP connections are as follows: Both branches of P are connected to the VBs (denoted v-trivial) and both branches of D∗ are connected to the CBs (denoted c-trivial). Only the branches of R are connected to the VBs and CBs (denoted non-trivial) wherefore this SKP reflects the topological non-triviality of the substrate. That the lower branch going from R into the VBs changes its character from tQL to Sb indicates a strong mixing between the substrate and the adlayer. Lifting BL2 deforms the bands. The lower branch of D∗ bends down from (a) to (d) and closes a gap along Γ0M011, while the character changes from BL2 (red) to BL1 (green). The gap closing does not entail any partner changes, though, as R is still non- trivially connected along Γ0M01̄1. At the same time, P rises towards R and two further v-trivial peaks belonging to BL2 (red) emerge from the bulk VBs. From (d) to (e), the band structure changes only slightly when it transforms into a superposition of the surface band structure of 1BL-Sb and the (red) 2D band structure of an isolated 1 BL sheet (Fig. 8.4 (c)). The Kramers pairs of the latter are v-trivial. The SKPs of 1BL-Sb show the same connections as those of 2BL-Sb, even though P and R are very close together. The PAW projections shift so that TSSSb has a mixed BL1+tQL character (cyan). Since no partner change occurs from (a) to (e), the desorption of BL2 does not involve topological phase transitions. In the reverse direction of adsorbing a second BL to 1BL-Sb, the transport of the TSSs across BL2 towards the surface of the 2BL- Sb heterostructure consists in a continuos shifting of the PAW projections, while the valence bands of BL2 disappear in the bulk VBs. Lifting BL1 drops D∗ into the bulk CBs (f). At 𝑑1 = 3.6 Å (g), D∗ touches P and R whereupon a partner changing occurs: The upper branch of D∗ connects with the up- per branch of R, which produces a c-trivial SKP, again denoted R. The lower branch of R connects with the upper branch of P, which produces a v-trivial SKP, denoted P1. The lower branch of P connects with the lower branch of D∗, which produces a non- trivial SKP, denoted P2. The partner change is a topological phase transition which detaches the topological non-triviality from R and stores it into a new point P2. It fur- thermore unravels the zigzag manner in which PSb is connected in the 1BL-Sb bands (the B band is connected to the same TRIM as TSSSb, while the lower branch goes elsewhere). As a consequence, the upper Sb peak it is allowed to form the twofold degenerated peak of the isolated 1 BL sheet. In the course of the topological phase transition, the projections of TSSSb shift from tQL+BL1 (cyan) to BL1 (green). In ad- dition to the three new SKPs from mixing the old ones, an additional, v-trivial SKP denoted D′ emerges from the bulk VBs. The middle image of the second stage (i) rep- resents a system of two quasi isolated 1 BL sheets and the quasi clean substrate. The SKPs are clearly visible, as well as the almost complete formation of the valence band peaks of the isolated BL1. Lifting BL1 further towards BL2 closes a gap (j) which restores the Dirac cone of the substrate. By this means P2 swaps SKP connections with D′ whereupon P2 is v-trivial and D′ is non-trivial. This proves that D′ is indeed the original D. From (i) to (m), the bands of the two isolated 1 BL sheets transform into the bands of an isolated 2 BL sheet (Fig. 8.4 (d)). At 𝑑2 = 4.0 Å (k), the upper peak of BL2 and the two peaks of BL1 159 9 The antimony on bismuth selenide heterostructure (P1 and P2) merge whereupon the topmost band with its characteristic furrow comes off. After the merging, the two topmost 2D Kramers pairs are denoted P′1 and P ′ 2. As no gap closing occurs in the bands of the Sb BLs, the topological class of the 2 BL sheet is trivial, as already confirmed in Sec. 8.2.2. In the last stage, the 2 BL sheet is lowered towards the substrate. At 𝑑1 = 4.8 Å (n), the topmost (twofold degenerate) band of the sheet (the 2 BL band) arranges with the bulk CBs so that P′1 touches R. At the same time, a gap opens between the 2 BL band, the Dirac cone of the substrate and the lowermost bulk CB. As a result, D′, P′1 and R change the SKP connections: The inner upper branch of D′ connects to the outer lower branch of the 2 BL band, which produces the B band and renders D′ is v- trivial. The inner upper branch of P′1 connects to the outer upper branch of the Dirac cone and the inner lower branch of P′1 connects to the bulk CBs, so P ′ 1 c-trivial. The inner lower branch of R connects to the outer upper branch of the 2 BL band, which produces TSSSb and renders R non-trivial. The new connections are better visible along Γ0M01̄1, while along Γ0M011, R is apparently c-trivial and P′1 non-trivial due to the vanishing gap. This gap opens at 𝑑1 = 3.1 Å (p) as well and, after a reordering between D’ and the lower Sb peak, the final band structure of 2BL-Sb is restored (a). The magnetisations show that the twofold degenerate bands of the Sb sheets polar- ise when they approach the substrate. The spin texture of the Dirac cone picks the band with the same magnetisation and hybridises with it, while the other band re- mains unaffected. This explains why D* inherits the spin texture of D. If a Sb sheet approaches the substrate, the first topological phase transition happens in the top- most VB, if the sheet is 2 BL thick, and in the second top VB (lower Sb peak) if the sheet is 1 BL thick. The latter is quite surprising at first glance, but can be explained: The bands of the isolated sheets in Fig. 8.4 (c) and (d) show that the topological phase transition happens in those VBs which provide p𝑧 contributions. On the contrary, the topmost VB of a 1 BL thick sheet provides only (p𝑥 + p𝑦) contributions. This leads to the conclusion that the p𝑧 orbitals play the decisive role in the surface topological phase transitions, similarly to the band inversion in the bulk TIs. 9.4 Discussion and summary The analysis of the 1BL-Sb@BS and 2BL-Sb@BS heterostructures showed that the structure models in which the BLs continue the natural stacking sequence of the sub- strate are indeed favourable. However, structure models with inverted BLs are less stable by only ∼10 meV per Sb atom, less than 𝑘B𝑇 at room temperature. Hence, it is likely that inverted structures are thermodynamically relevant. Similarly to the case of bulk Bi2Se3, vdW corrections and SOC reduce the interlayer distance between the substrate and the Sb adlayer. SOC is even more important in the heterostructures than in bulk Bi2Se3. The favourable natural stacking minimises the interlayer dis- tances for geometrical reasons and thus the Sb chemical potential. The band structures were calculated for the naturally stacked heterostructures as well as for the inverted ones. The bands of the latter have never been published be- 160 9.4 Discussion and summary fore, while the bands of the former are unprecedented in detail and clarity. The theor- etical band structures of the natural structure models compare very well with ARPES measurements. Unfortunately, the band features which distinguish the natural struc- ture models from those with inverted BLs, D* and PSb, are invisible in ARPES due to negative binding energies and matrix-element effects. Thus, ARPES is limited in identifying BL inversions. The STM simulation of the natural structure models agrees very well with experimental STM images. In particular, the differences in the sym- metries of the image and the atomic geometry, hexagonal vs. trigonal for 1BL-Sb and inversion for 2BL-Sb, were proven to be due to deviations between the charge dens- ities at the Fermi surface and the atomic geometries. The topological phase transitions during a simulated desorption-and-readsorption process illuminated how the Dirac cone of the clean BS(0001) surface merges with the bands of a thin Sb sheet to produce the joint bands of the heterostructure. When a 1 BL or a 2 BL thick Sb sheet approaches the substrate, a complicated series of to- pological phase transitions occurs. It begins in the p𝑧 bands of the Sb sheet and trivi- alises the Dirac cone, whose Dirac point D is buried in the bulk valence bands. The Dirac point D* is shown to be a remainder of the Sb peaks and topologically trivi- ally connected to the conduction bands. The only non-trivial surface Kramers pair is RBS, the crossing of the Rashba split conduction bands. Its lower branch follows an electron-like parabola near the Γ-point and bends down away from the Γ-point to form TSSSb, which is visible in ARPES. The topological phase transitions occur only when the first (set of) BLs approach the BS surface. The addition of a second BL to 1BL-Sb is continuous in that there are no partner changes. The literature on DFT band structures of the Sb@BS system is spare. Two works with intersecting author lists exist, containing band structures for 1BL-Sb (Jin et al. [168]) and 2BL-Sb (Kim et al. [166]). The band structures look rather different from those presented here. The 1BL-Sb band structure of Jin et al. shows a strong misalignment of D∗ and PSb, the former being in the bulk CBs and the latter being 0.2 eV below D∗, not 0.3 eV. The desorption-and-readsorption process shows that the misalignment is due to a too large interlayer distance 𝑑1 (Jin et al.: 2.71 Å vs. this work: 2.38 Å). Jin et al. obtained the structure by DFT relaxations, but they do not mention whether they employed vdW corrections and/or SOC. The large 𝑑1 suggests that they omitted at least one of the two. The 2BL-Sb band structure of Kim et al. shows the major qualitative difference that there is no gap in the ΓK direction, contrarily to the bands presented here. Unfortunately, Kim et al. give neither details on whether they em- ployed vdW corrections or SOC during the DFT relaxations, nor structure parameters like the interlayer distances. In both [168] and [166], the band structures of the ΓM path are missing, despite the pronounced gaps in TSSSb which show that D∗ does not reflect the topological non-triviality of the substrate. Jin et al. calculated also the band structures under variation of the interlayer distance between a 1 BL sheet and a Bi2Te2Se substrate. Although this reminds of the desorption-and-readsorption pro- cess of this work, they did not analyse the details and did not extract the topological phase transitions. In particular, they did not notice that RBS is non-trivially connected while D∗ is trivially connected to the CBs. Consequently, the SKP method developed in this thesis is indeed a novel tool for extracting surface topological phase transitions in a systematic manner. 161 10 Conclusions In this PhD thesis, two classes of nano-scaled systems were successfully investigated by means of DFT: rare-earth silicide nanowires on silicon surfaces and thin antimony layers on bismuth selenide. A close inspection of the configuration spaces of the nanostructures resulted in different structure models, which were optimised with respect to the atomic positions and whose stability was evaluated by means of ab initio thermodynamics. The detailed analysis of the structural and electronic prop- erties lead to several conclusions about the underlying chemistry and provided a deep insight into the physics of the nanostructures. All results agree well with exper- imental reference on lattice parameters as well as ARPES measurements, Fermi sur- faces and STM images. Finally, the systematic approach of this work and the detailed cross-linking between the structural and electronic properties permitted the identi- fication of fascinating novel phenomena: the dimensional crossover in the nanowire- Tb@Si(557) system and the complex series of multiple topological phase transitions in the Sb@Bi2Se3 heterostructure. The heavy sixth-row elements which occur in both systems were shown to have prop- erties which are unique in the periodic table. The lanthanoids Tb and Er have a stable trivalent state in the bulk RESi2 – x phases, despite the incompleteness of the 4f shell. Also the lanthanoid contraction was demonstrated for the silicides from TbSi2 to ErSi2. The element Bi is even more peculiar as the relativistic effects give rise to the inert-pair effect of the 6s electron, as explicitly demonstrated in bulk Bi2Se3. Furthermore, the relativistic effects are so strong that Bi2Se3 transitions into a topo- logical insulator. Similarly, bulk 𝛽-Sb turned out to be a topological semimetal. The heavy elements necessitated methods beyond DFT: the LDA+U method, vdW correc- tions and spin-orbit coupling, which all proved to be crucial for obtaining correct results for the respective systems. The ℤ2 invariant was elaborated in order to ex- plicitly calculate the topological classes of Bi2Se3 and 𝛽-Sb. Furthermore, it is the basis of the novel SKP method, which was developed in this thesis in order to track and understand the topological phase transitions in the Sb@Bi2Se3 heterostructure. This method can be applied to all surfaces which establish a connection between a substrate and an adsorbate of different topological classes. It proved to be indispensable to analyse the related higher-dimensional systems be- fore treating the actual nanostructures. These were the bulk RESi2 – x phases and the monolayer-Tb@Si(111) system in the case of the TbSi2 nanowires, and bulk Bi2Se3 and 𝛽-Sb in the case of the Sb@Bi2Se3 heterostructures. Though, the connection between the bulk and the respective nanostructure is not as obvious as one may assume. For example, the popular argument of surface-induced strain, which can describe the epitaxy of bulk-like layers on substrates, was proven to be inapplicable 162 for the herein investigated nanostructures. On the contrary, surface strain is neg- ligible in both the monolayer-Tb@Si(111) and the Sb@Bi2Se3 systems, while a large strain would follow from the lattice constants of the respective bulk phases. Hence, the strain in nanostructures has to be evaluated with special caution in order to avoid wrong conclusions. The bulk structures are useful because they permit to understand fundamental structural mechanisms, which apply to the nanostructures as well, e.g. the silicon cages in RESi2 – x or the layer-layer interaction in Bi2Se3 and 𝛽-Sb. Fur- thermore, the electronic properties of the higher-dimensional systems can help to identify the origins of the band structures of the nano-scaled systems. In this work, the bands of the nanowire-Tb@Si(557) system were shown to be derivable from the bands of the monolayer-Tb@Si(111) system, which in turn can be derived from the bands of (hypothetical) CaSi2-TbSi2 by zone projection. In the case of the Sb@Bi2Se3 system, the topological classes of Bi2Se3 and 𝛽-Sb were absolutely crucial for under- standing the topological phase transitions in the heterostructure. Outlook: Rare-earth silicide nanowires on silicon surfaces The investigations of the RESi2 nanowires give rise to at least two further scientific questions. Firstly, the bulk phases of RESi2 – x were shown to be a complex system, despite the small unit cells of the structure prototypes. The results for the supercells with ordered Si vacancies give only an idea about the underlying physics. In order to fully understand the RESi2 – x phases, it is necessary to introduce probability into the distribution of the Si vacancies and to calculate the temperature-concentration phase diagrams. This may finally explain why the stoichiometry of AlB2-RESi2 – b is equal for all REs (𝑏 ≈ 13 ), while the stoichiometry of ThSi2-RESi2 – a depends on the atomic number of the employed RE (𝑎 < 13 ). The second scientific question concerns the RESi2 nanowires on Si(001). Although not included in this thesis, investigations of this system were published in [P3, P6]. Despite a systematic inspection of the configuration space of reasonable structure models, the stability of the nanowires could not be proven by means of straightfor- ward grand-potential-chemical-potential phase diagrams. On the contrary, the width of the nanowires was found to be unlimited in that they would aggregate to a 2D film. The contradictory results were attributed to kinetic effects. Understanding the peri- tectic transition in the bulk phase diagrams might support the further elaboration of this issue. A further consequence of this work for the nanowire-RE@Si(001) system results from the finding that the stable structure models of the monolayer-Tb@Si(111) and nanowire-Tb@Si(557) systems obey the ZKB concept in a strict manner. Thus, it is likely that the nanowires on Si(001) obey the ZKB concept as well. Therefore, the structure models already found should be evaluated regarding this issue and further structure models could be set up based on this. Outlook: Thin antimony layers on bismuth selenide The findings on the Sb@Bi2Se3 heterostructure raise two further scientific questions. The first concerns the layer inversions, which introduce only small penalties into the 163 10 Conclusions Sb chemical potentials. Since these are below 𝑘B𝑇 at room temperature, the omis- sion of the lattice dynamics is certainly questionable. This point should be further investigated, e.g. by the frozen-phonon method. The second question is about GW-corrected band structures. Since Bi2Se3 is a topo- logical insulator, GW exerts a strong effect on the region around the Γ-point, which reduces the band gap and alters the Dirac cones of the clean surfaces of Bi2Se3, Bi2Te3 and Sb2Te3 [163, 164, 170]. For this reason, GW effects on the bands of the Sb@Bi2Se3 heterostructures would be very interesting from a general point of view. Moreover, a corrected band structure would permit a quantitative comparison with ARPES, which could be very helpful for identifying layer inversions, e.g. by the slopes of TSSSb. 164 Appendix A A.1 Mathematical identities The Dirac notation Dirac∑︁nota ti〉o〈n (discrete basis): 〈 𝑛〉 𝑛〉 𝑛 = 𝐼 Completeness 𝑛 𝑛′∑︁= 𝛿 𝑛𝑛〉′〈 〉 ∑︁ 〉 〈 〉 OrthogonalityΨ = 〈 𝑛 𝑛〉Ψ = Ψ𝑛 𝑛 , Ψ𝑛 = 𝑛 Ψ Series expansion𝑛 𝑛𝑂𝑛𝑛′ = 𝑛 𝑂 𝑛′ Matrix representation Dira∫c not at〉i〈on (continuous basis):〈 d𝑥 〉 𝑥 𝑥 = 𝐼 Completeness 𝑥〉𝑥′ ∫= 𝛿(𝑥〈 − 〉〈 𝑥′ ) 〉 〉 ∫ 〉 〈 〉 Orthogonality Ψ = d𝑥 𝑥 𝑥 Ψ = d𝑥 Ψ(𝑥) 𝑥 , Ψ(𝑥) = 𝑥 Ψ Series expansion 𝑂(𝑥, 𝑥′) = 𝑥 𝑂 𝑥′ Matrix representation The translation operator The translation operator𝑈𝑡 is defined by its action on the real space distribution of a state, i.e. 𝑈𝑡 shifts the expectation value of the position operator 𝑥 for every |Ψ⟩ ∈ H by a constant 𝑡. Since translations do not change the scalar product, the translation opera〈to〉r ha〈s to 𝑥 = Ψ b e u〉nitary〈. 〉 〈 〉 〈 〉 〈 〉𝑥 Ψ ⇔ 𝑥 + 𝑡 = Ψ̃ 𝑥 Ψ̃ = Ψ 𝑈−1𝑥𝑈 Ψ = Ψ 𝑥 + 𝑡 𝑡 𝑡 Ψ As this is, in particular, valid for the vectors of any basis, the condition can be sum- marised as a commutator equation. 𝑈−1𝑡 𝑥𝑈𝑡 = 𝑥 + 𝑡 ⇔ [𝑈𝑡, 𝑥] = −𝑡𝑈𝑡 (∗) A1 Appendix A This can be solved by the following ansatz: 𝑈𝑡 = exp(i𝑡 · 𝑠) with [𝑠, 𝑥] = 𝑐 ∈ ℂ ⇒ [𝑈𝑡, 𝑥] = [exp(i𝑡 · 𝑠), 𝑥] (A.93) = i𝑡 exp(i𝑡 · 𝑠) [𝑠, 𝑥] = i𝑡 𝑈𝑡 [𝑠, 𝑥] ⇔(∗) [𝑠, 𝑥] = i As the position ope[rator 𝑥]and the momentum operator 𝑝 hold the canonical com- mutation relation ( − 1ℏ 𝑝, 𝑥 = i), the solution for the translation operator𝑈𝑡 is: 𝑈𝑡 = exp(−i 1ℏ 𝑡 · 𝑝) (A.91) The commutator of operator functions Let 𝐴 and 𝐵 be linear operators acting on a Hilbert space H . The commutator of 𝐴 and 𝐵 shall be a constant 𝑐 [∈ ℂ. It] then holds for 𝑛 ∈ ℕ \ 0: 𝐴𝑛, 𝐵 = 𝐴𝑛𝐵 − 𝐵𝐴𝑛 = 𝑛𝐴𝑛−1𝑐 (A.92) Proof. (A.92) will be prov[en by]mathematical induction. ∀𝑛 ∈ ℕ \ 0 : [𝐴𝑛, 𝐵] = 𝐴𝑛𝐵 − 𝐵𝐴𝑛 = 𝑛𝐴𝑛−1𝑐 (IV) 𝑛 = 1 : [ 𝐴1, 𝐵] = 𝐴𝐵 − 𝐵𝐴 = 𝑐 ⇔ 𝐵𝐴 = 𝐴𝐵 − 𝑐 (IA) 𝑛→ 𝑛 + 1 : 𝐴𝑛+1, 𝐵 = 𝐴𝑛+1𝐵 − (𝐵𝐴)𝐴𝑛 = 𝐴𝑛+1𝐵 − (𝐴𝐵 − 𝑐)𝐴𝑛 (IS) = 𝐴(𝐴𝑛𝐵 − 𝐵𝐴𝑛 + 𝑐𝐴𝑛−1) IV= 𝐴(𝑛𝐴𝑛−1𝑐 + 𝐴𝑛−1𝑐) = (𝑛 + 1)𝐴𝑛𝑐 □ A corollary is the generalisation to functions of operators which per definitionem have a converging p∑︁ower series:∞ 𝑓 (𝐴) = 𝑓 𝑘𝑘𝐴 ⇒ [ 𝑓 (𝐴), 𝐵] = 𝑓 (𝐴)𝐵 − 𝐵 𝑓 (𝐴) = 𝑓 ′(𝐴)𝑐 (A.93) 𝑘=0 The Fourier series in multiple dimensions Let 𝑓 : 𝒙 → ℂ be a periodic function on a latticeR with unit cell C ∑︁ R , reciprocal lattice G, as defined in Sec. 2.2. Then 𝑓 has the following Fourier series: 𝑓 (𝒙) ∼ e∫xp(i𝒙 · 𝑮) 𝑓𝑮 (A.94a)𝑮∈G 1 𝑓𝑮 = |C | d𝑥 exp(−i𝒙 · 𝑮) 𝑓 (𝒙) (A.94b)R CR A2 A.2 The principle of minimal coupling If, in addition, 𝑓 ∈ H = 𝐿2(C∑︁R), the Parseva∫l theore m ho 2 1 lds [174, pp. 145f]: 𝑓𝑮 = |C | d𝑥 2𝑓 (𝒙) (A.95) 𝑮∈G R CR If, in addition, 𝑓 is co∑︁ntinu ous and its d∫erivative is piecewis e continuous, then: 𝑮 2 −i𝑓𝑮 = |C | d𝑥 𝑓 ∗(𝒙)∇ ′ 𝑓 (𝒙′𝑥 ) (A.96)′ 𝑮∈G R CR 𝒙 =𝒙 Eq. (A.96) follows from the Parseval theorem applied on 𝑓 and its gradient ∇ 𝑓 ex- pressed as a Fourier series [174, pp. 148ff]. (∇ 𝑓 )𝑮 = i𝑮 𝑓𝑮 A.2 The principle of minimal coupling Let 𝐻0(𝒑, 𝒙) be the Hamilton operator of a f〉ree p ar〉ticle with charge 𝑞:𝐻0(𝒑, 𝒙) Ψ = 𝐸 Ψ Then, the introduction of a vector potential 𝑨(𝒙, 𝑡) and a scalar potential 𝜑(𝒙, 𝑡) mod- ifies the Hamilton ia〉n as follows : 〉 〉 〉𝐻0(𝒑 − 𝑞𝑨, 𝒙) Ψ = (𝐸 − 𝑞𝜑) Ψ ⇔ [𝐻0(𝒑 − 𝑞𝑨, 𝒙) + 𝑞𝜑] Ψ = 𝐸 Ψ (A.97) Eq. (A.97) can be motivated by the Hamiltonian for the classical problem. For this purpose, let 𝐻c0 be the Hamiltonian for a classical charged particle moving in an elec- tromagnetic field. The coupling of the electromagnetic potentials to the momentum and the energy yields the correct classical Lorentz force: c (𝒑 − 𝑞𝑨)2𝐻0 = + 𝑞𝜑2𝑚 𝒙¤ ∇ c 1= 𝑝𝐻0 = (𝒑 − 𝑞𝑨)𝑚 1 𝒑¤ = −∇ 𝐻{c𝑥 0 = − } ∇𝑥 (𝒑 − 𝑞𝑨)2 − 𝑞∇𝑥𝜑 = 𝑞 [(𝒙¤ · ∇𝑥) 𝑨 + 𝒙¤ × (∇𝑥 × 𝑨) − ∇ 𝜑]2 𝑥𝑚d𝑨 𝜕𝑨 + c 𝜕𝑨= 𝑨, 𝐻0 = + (𝒙¤ · ∇𝑥[) 𝑨d𝑡 𝜕𝑡 𝜕𝑡 ] ⇒ d d𝑨𝑚𝒙¥ = (𝒑 − 𝑞𝑨) = 𝒑¤ − 𝑞 = 𝑞 −∇ − 𝜕𝑨𝑥𝜑 + 𝑞𝒙¤ × (∇d d 𝑥 × 𝑨) = 𝑞𝐸 + 𝑞𝒗 × 𝑩𝑡 𝑡 𝜕𝑡 Eq. (A.97) follows from the correspondence principle by replacing the canonical vari- ables 𝒑 and 𝒒 in 𝐻c0 with the respective operators. A3 Appendix A The proof for the meaningfulne ss〉of Eq. (A.97) consists in a test for gauge invariance.For this purpose, the solution Ψ is subjected to a 𝑈 (1) unitary transformation, i.e. for an arbitrary function 𝜒 (𝒙〉, 𝑡), th e t〉ransformed states ar〉e:Ψ̃ = 𝑈𝜒 Ψ = exp(i 𝑞ℏ 𝜒(𝒙, 𝑡)) Ψ (A.98) Consequently, the operators transform as follows: 𝒙 = 𝑈 −1𝜒 𝒙𝑈𝜒 = 𝒙 (A.99a) ?̃? = 𝒑 −1 = exp(i 𝑞 ) (−i ∇ ) exp(−i 𝑞𝑈𝜒 𝑈𝜒 ℏ𝜒 ℏ 𝑥 ℏ𝜒) == 𝒑 − 𝑞∇𝑥𝜒 (A.99b) 𝑞 𝜕 𝑞 𝜕𝜒 𝐸 = 𝑈𝜒 𝐸𝑈 −1 𝜒 = exp(i ℏ𝜒) (iℏ ) exp(−i𝜕𝑡 ℏ𝜒) = 𝐸 + 𝑞 (A.99c)𝜕𝑡 With these relations, the full transformation of the holds: 〉 differential equation Eq. (A.97) 0 = [𝐻0(𝒑 − 𝑞𝑨, 𝒙) − (𝐸 − 𝑞𝜑)] Ψ = 𝑈[ [𝐻 −1 〉𝜒 0(𝒑 − 𝑞𝑨, 𝒙) − (𝐸 − 𝑞𝜑)]𝑈 𝑈 𝜒 𝜒 Ψ ] [ 〉 (A.100) = 𝐻0(𝒑 − 𝑞∇𝑥𝜒 − 𝑞𝑨, 𝒙) − (𝐸] 𝜕𝜒 = 𝐻0(𝒑 − 𝑞?̃?, 𝒙) − (𝐸 − 𝑞?̃?) + 𝑞〉 − 𝑞𝜑) Ψ̃𝜕𝑡 Ψ̃ with 𝜕𝜒 ?̃? = 𝑨 + ∇𝑥𝜒, and ?̃? = 𝜑 − (A.101) 𝜕𝑡 The transformed differential equation in Eq. (A.100) has the same structure as be- fore, but different electromagnetic potentials. These are linked to the original ones by a gauge transformation according to Eq. (A.101). Therefore, the electromagnetic fields and, thus, the physics remain the same. The consistency between the phase shift applied to |Ψ⟩ (Eq. (A.98)) and the gauge-transformed electromagnetic poten- tials (Eq. (A.101)) proves that the way of introducing electromagnetic potentials into the Hamilton operator is gauge invariant. Eq. (A.97) is called the principle of minimal coupling since the electromagnetic poten- tials couple only via the charge 𝑞 to the physics of the quantum particle. In other words, the effect of electromagnetic fields on the physics of a quantum particle is determined only by one scalar parameter and nothing else (like dipoles etc.). The principle of minimal coupling fulfils the physical prerequisite of gauge invariance. Moreover, it is a proof that the properties of electromagnetic fields arise naturally from the phase degree of freedom of the wavefunction |Ψ⟩. The principle of min- imal coupling can introduce electromagnetic fields into any differential equation of a free particle depending on energy and momentum operators, in particular the non- relativistic Schrödinger equation and the relativistic Dirac equation. A4 Appendix B B.1 The terms and ionisation energies of the lanthanoids Hund’s second rule can be illustrated by the terms of the isolated, neutral Ln atoms. The divalent VEC of the first lanthanoid La (4f15d06s2) is not stabilised by any ex- change energy. Therefore, the electron leaves the 4f shell and occupies the 5d shell instead (actual VEC: 4f05d16s2, trivalent). For the second lanthanoid Ce, the exchange energy of two 4f electrons is too small to stabilise the divalent VEC wherefore, as in the case of La, one 4f electron is lifted to the 5d shell (actual VEC: 4f15d16s2, trival- ent). For all subsequent lanthanoids of the first half-series, the divalent VEC is more stable than the trivalent one, ensured by the increasing exchange-related penalty on removing an electron from the 4f shell. The exchange penalty peaks in Eu, whose 4f shell is half-filled (4f75d06s2). The first element of the second half-series Gd assumes the trivalent VEC (4f75d16s2), because it would otherwise begin to fill the other spin component of the 4f shell without stabilising exchange energy. The next element Tb assumes the divalent VEC, unlike Ce, though, the trivalent VEC is less stable by only 0.04 eV. All subsequent lanthanoids of the second half-series assume the divalent VEC, analogously to the first half-series. The last lanthanoid, Lu, is trivalent, the only possible VEC due to exhaustion of the 4f shell [75]. Hund’s rules manifest themselves also in the third ionisation energies. All Ln2+ ions assume the divalent VEC (4f𝑛5d06s0), except La2+, Gd2+ and Lu2+, whose VECs are trivalent (4f05d16s0, 4f75d16s0 and 4f145d06s1, respectively). However, the energy difference to the divalent state for La2+ and Gd2+ is small compared the ionisation energies. The peculiar dependence between the third ionisation energies and the atomic numbers can completely be explained by three contributions: 1) The exchange penalty, which increases the ionisation energy proportional to the number of 4f electrons with same spin. It is the most effective contribution and gives rise to the zigzag pattern with maxima at Eu and Yb. 2) The paring energy, which destabilises the second half shell and thus reduces the ionisation energies of the second half-series by a constant offset. 3) The orbital energy from the change in total angular momentum, which mod- ulates the zigzag patterns by wavy slopes with inflexion points at quarter and tree-quarters fillings of the 4f shell. If all three contributions are subtracted, the residual of the third ionisation energies would follow a quite smooth curve with a positive slope, which accords with the increasing charge of the nucleus [75]. A5 Appendix B B.2 The RESi2 – x bulk phases 𝑎 𝑏 𝑐 𝑐/𝑎 𝑉/𝑁Er 𝜇Er Δ𝜇Er (Å) (Å) (Å) (Å3) (eV) (meV) PBE ort-AlB2-ErSi2 4.24 3.84 3.91 0.923 56.8 −1.484 266 hex-AlB2-ErSi2 – b 3.76 4.14 1.101 50.8 −1.750 0 tet-ThSi2-ErSi2 3.89 3.89 14.99 3.852 56.7 −1.500 251 ort-ThSi2-ErSi2 – b (1) 3.92 3.98 13.25 3.376 51.8 −1.708 43 ort-ThSi2-ErSi2 – b (2) 3.89 4.02 13.31 3.422 52.0 −1.658 92 PBEsol ort-AlB2-ErSi2 4.18 3.85 3.83 0.917 54.7 −1.739 259 hex-AlB2-ErSi2 – b 3.72 4.10 1.102 49.0 −1.998 0 tet-ThSi2-ErSi2 3.85 3.85 14.79 3.838 54.9 −1.750 247 ort-ThSi2-ErSi2 – b (1) 3.86 3.95 13.09 3.393 49.8 −1.942 56 ort-ThSi2-ErSi2 – b (2) 3.83 3.98 13.18 3.444 50.1 −1.888 110 Exp. reference hex-AlB2-ErSi2 – x 3.78 4.09 1.082 50.6 ort-ThSi2-ErSi2 – x — — — — — Table B.1: Lattice parameters and relative Er chemical potentials of ErSi2 – x in the stoi- chiometric and the vacancy-populated AlB2 and ThSi2 phases. The unit cells were optim- ised by code-level relaxations with different xc-functionals. Experimental reference: [85]. TbSi2 – x: Tab. 4.6. 𝑑a(Å) 𝑑b(Å) 𝑑c(Å) PBE hex-AlB2-TbSi2 2.37 (mm) ort-AlB2-TbSi2 2.36/2.42 (mm) hex-AlB2-TbSi2 – b 2.43 (mv) tet-ThSi2-TbSi2 2.36 (mm) 2.36 (mm) 2.43 (mm) ort-ThSi2-TbSi2 – b (1) 2.49 (vv) 2.44 (mv) 2.36 (mm) 2.39 (mv) ort-ThSi2-TbSi2 – b (2) 2.50 (vv) 2.47 (vv) 2.37 (mm) 2.37 (mv) PBEsol ort-AlB2-TbSi2 2.34/2.39 (mm) hex-AlB2-TbSi2 – b 2.42 (mv) tet-ThSi2-TbSi2 2.33 (mm) 2.33 (mm) 2.38 (mm) ort-ThSi2-TbSi2 – b (1) 2.48 (vv) 2.44 (mv) 2.35 (mm) 2.38 (mv) ort-ThSi2-TbSi2 – b (2) 2.49 (vv) 2.46 (vv) 2.36 (mm) 2.37 (mv) Table B.2: Si–Si nearest neighbour distances in TbSi2 – x optimised with different xc-function- als. Complement to Tab. 4.7. A6 B.2 The RESi2 – x bulk phases 𝑑a(Å) 𝑑b(Å) 𝑑c(Å) PBE hex-AlB2-ErSi2 2.36 (mm) ort-AlB2-ErSi2 2.35/2.42 (mm) hex-AlB2-ErSi2 – b 2.43 (mv) tet-ThSi2-ErSi2 2.35 (mm) 2.35 (mm) 2.42 (mm) ort-ThSi2-ErSi2 – b (1) 2.51 (vv) 2.45 (mv) 2.36 (mm) 2.39 (mv) ort-ThSi2-ErSi2 – b (2) 2.52 (vv) 2.48 (vv) 2.36 (mm) 2.37 (mv) PBEsol ort-AlB2-ErSi2 2.39/2.34 (mm) hex-AlB2-ErSi2 – b 2.42 (mv) tet-ThSi2-ErSi2 2.33 (mm) 2.33 (mm) 2.39 (mm) ort-ThSi2-ErSi2 – b (1) 2.51 (vv) 2.44 (mv) 2.35 (mm) 2.38 (mv) ort-ThSi2-ErSi2 – b (2) 2.51 (vv) 2.48 (vv) 2.35 (mm) 2.37 (mv) Table B.3: Si–Si nearest neighbour distances in ErSi2 – x optimised with different xc- functionals. Complement to Tab. 4.7. 𝑎 𝑐 𝑉/𝑁RE 𝛾 𝑑a Δ𝜇RE (Å) (Å) (Å3) (Å) (meV) TbSi2, PBE ort-AlB 4.23 (3.88)𝑎 4.00 58.3 124.7◦ (110.6◦)𝑎 2.36 (2.42)𝑎2 0 tr3-CaSi2 3.78 5.56 63.4 103.6◦ 2.41 −30 tr6-CaSi2 3.94 9.34 60.9 104.9◦, 118.9◦ 2.49, 2.29 −67 TbSi2, PBEsol ort-AlB2 4.17 (3.88)𝑎 3.92 56.2 124.0◦ (112.0◦)𝑎 2.34 (2.39)𝑎 0 tr3-CaSi2 3.75 5.50 61.6 103.2◦ 2.39 +54 tr6-CaSi2 3.91 9.22 59.1 104.6◦, 118.9◦ 2.47, 2.27 −34 ErSi2, PBE ort-AlB2 4.24 (3.84)𝑎 3.91 56.8 125.4◦ (109.3◦)𝑎 2.35 (2.42)𝑎 0 tr3-CaSi2 3.75 5.48 61.4 103.0◦ 2.40 −140 tr6-CaSi ◦2 3.92 9.17 59.1 104.6 , 118.8◦ 2.48, 2.28 −100 ErSi2, PBEsol ort-AlB 4.18 (3.85)𝑎 3.83 54.7 124.6◦ (110.8◦)𝑎 2.34 (2.39)𝑎2 0 tr3-CaSi2 3.72 5.42 59.7 102.6◦ 2.38 −63 tr6-CaSi2 3.89 9.05 57.4 104.3◦, 118.8◦ 2.46, 2.26 −70 Table B.4: Lattice parameters and relative RE chemical potentials of TbSi2 and ErSi2 in the stoichiometric AlB2 and CaSi2 phases. The unit cells were optimised by code-level relaxations with different xc-functionals. tr6-CaSi2-RESi2 has two independent M-star angles 𝛾 and Si–Si bond lengths 𝑑 . 𝑎a orthorhombic distortion. Complement to Tab. 4.8. A7 Appendix B (i) s-orbitals (ii) py-orbitals (iii) pz-orbitals (iv) px-orbitals (a) 0 -2 -4 -6 -8 -10 -12 hex-AlB2-TbSi2 (b) 0 -2 -4 -6 -8 -10 -12 hex-AlB2-Si2 (c) 0 -2 -4 -6 -8 -10 -12 ort-AlB2-TbSi2 (d) 0 -2 -4 -6 -8 -10 -12 ort-AlB2-Si2 M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A M Γ K|Γ A Figure B.1: Band structures of (a) hex-AlB2-TbSi2, (b) hex-AlB2-Si2, (c) ort-AlB2-TbSi2 and (d) ort-AlB2-Si2. The Si2 structures are the respective TbSi2 structures without the Tb atoms. The line width corresponds to the PAW projections of (i) the s orbitals, (ii) the p𝑦 orbitals, (iii) the p𝑧 orbitals and (iv) the p𝑥 orbitals of the Si atoms. (i) tet-ThSi2-TbSi2 (trivalent) (ii) AMO (iii) FMO (up) (iv) FMO (down) 0 -2 -4 -6 -8 -10 -12 Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Γ XY Σ ΓZ Y1Σ1 Z Figure B.2: Band structures of (i) trivalent and (ii) – (iv) f-valent tet-ThSi2-TbSi2. The plotting mode is analogous to Fig. 4.9 (b,i). Colour: yellow→ Tb, blue→ Si. AlB2 structures: Fig. 4.11. A8 E − EF (eV) E − EF (eV) E − EF (eV) E − EF (eV) E − EF (eV) Appendix C C.1 The geometric interpretation of the Ahanorov-Bohm effect As already mentioned, geometric phases become manifest in the Aharonov-Bohm effect, although not always recognised as such. Berry transferred his considerations to the ABE by a gedankenexperiment consisting in a thin solenoid like that in Sec. 7.1 and a particle with charge 𝑞 confined to a box outside the magnetic field [127]. The origin of the box defines the three-dimensional parameter 𝝃, so a variation of 𝝃 corresponds to a movement of the box. Unlike that in [127], the approach presented here works with the operator algebra, independently of the representation. Let Ω be the space outside the solenoid and 𝑊 (𝝃) ⊂ Ω the box domain, a simply connected neighbourhood of 𝝃. Let 𝐻off(𝒑, 𝒙) be the Hamilton operator on𝑊 for the solenoid-off setup and 𝝃 = 0. Let |Ψoff(0)⟩ be the eigenstate of the corresponding stationary Schrödinger equation. 〉 〉 𝐻off(𝒑, 𝒙) Ψoff(0) = 𝐸 Ψoff(0) (C.102) Without loss of generality, the expectation values of 𝒙 and 𝒑may be 0, i.e. the particle is located at the b〈ox origin and do〉es not move〈on average. 〉 Ψoff(0) |𝒙 |Ψoff(0) = 0 ∧ Ψoff(0) |𝒑|Ψoff(0) = 0 (C.103) Now, two gauge transformations are being applied to the system: switching on the solenoid and moving the box. Since the box is simply connected and field-free, the solenoid field alters the wavefunction by a unitary transformation 𝑈D, also called Dirac factor [127]. The movement of the box transforms implies the unitary trans- formation linked to the translation group𝑈T (vide appendicem, A.1). 𝑈T = exp(−i 1 ℏ 𝝃 · 𝒑) (Translation) (C.104) 𝑈D = exp(i 𝑞 ℏ 𝜒(𝒙 | 𝝃)) (Dirac factor) with ∫ 𝒙 𝜒(𝒙 | 𝝃) B d𝒙′ · 𝑨(𝒙′) = 𝜒(𝒙) − 𝜒(𝝃) ⇔ 𝑨(𝒙) = ∇𝑥𝜒(𝒙 | 𝝃) 𝝃 A9 Appendix C Nota bene, the Dirac factor depends on 𝝃, which is chosen to be the origin of the box. The entire integration path has to stay within the box so as to ensure the integrability of 𝑨. The 𝝃-dependent term of 𝜒 must not be omitted, as it accommodates the pre- condition that 𝜒 is defined only locally. Omitting it would correspond to trying ti find a global scalar potential for 𝑨, which, of course, does not exist. By means of the explicit representations of both unitary transformations, the station- ary Schrödinger equation for the solenoid-on, boxed-moved setup, described by the Hamilton operator 𝐻on(𝒑, 𝒙 | 𝝃), can be solved. 𝑈 −1D 𝒙𝑈D = 𝒙, 𝑈T 𝒙𝑈 −1 T = 𝒙 + 𝝃 − (C.105a)𝑈D 𝒑𝑈 1 D = 𝒑 − 𝑞𝑨(𝒙), 𝑈 −1T 𝒑𝑈T = 𝒑 ⇒ 𝐻on(𝒑〉, 𝒙 | 𝝃) = 𝑈T𝑈D 𝐻〉off(𝒑, 𝒙)𝑈−1D 𝑈−1T = 𝐻off(𝒑 − 𝑞𝑨(𝒙 + 𝝃), 𝒙 +〉 𝝃) (C.105b)Ψon(𝝃) = 1 𝑞𝑈T𝑈D Ψoff(0) = exp(−i ℏ 𝝃 · 𝒑) exp(i ℏ 𝜒(𝒙 | 𝝃)) Ψoff(0) (C.105c) Inserting the transformed wavefunctions from Eq. (C.105c) into Eq. (7.55) yields the Berry connection A. In the last step, Eq. (C.103) is used. 𝑈〈 −1D i∇𝜉𝑈 D = i∇𝜉 +〉 𝑞 ℏ𝑨〈(𝝃), 𝑈−1T i∇𝜉𝑈T = i∇𝜉 + 1ℏ𝒑⇒ A(𝝃) = i〈 〉Ψon(𝝃) ∇𝜉 Ψon(𝝃) = Ψ (0) 𝑈−1𝑈−1off D T i∇ 𝜉𝑈T〉𝑈D Ψoff(0)= Ψoff(0) (i∇ 𝑞𝜉 + 𝑨(𝝃) + 1 (𝒑 + 𝑞𝑨(𝒙))) Ψoff(0) (C.106)ℏ ℏ = 𝑞ℏ𝑨(𝝃) + 𝑞 ℏ𝑨(0) The Berry connection A indeed equals the magnetic vector potential 𝑨 up to prefact- ors. Integrating it over a cl∮osed loop 𝛾 : 𝑡 ∮∈ [0, 1] → 𝝃(𝑡) ∈ Ω returns the observableBerry phase 𝜒B: 𝜒B = A(𝝃) · d𝝃 = 𝑞ℏ 𝑨(𝝃) · d𝝃 = 𝑞 𝑁 ℏΦ0 (C.107) 𝛾 𝛾 where 𝑁 is the winding number around the solenoid. In conclusion, the treatment of the ABE within the Berry approach results in the same findings as those obtained in Sec. 7.1. The Berry phase in Eq. (C.107) equals the ABE phase in Eq. (7.52), both being proportional to the flux through the solenoid. The Berry approach confirms that the central origin of the ABE consists in the non- existence of a global gauge for 𝑨 onΩ. Like Berry phases from closed loops in general, the ABE is observable, although the box containing the particle has never penetrated the magnetic field inside the solenoid. Moreover, Eq. (C.107) reveals an important property of geometric phases. Because the phase function exp(i𝜒B) is periodic, it remains unchanged after the addition of integer multiples of 2π to 𝜒B. Consequently, a magnetic flux Φ0 being integer mul- tiples of the quantum flux ℎ𝑞 results in the same phase factor as the zero-flux con- figuration and, thus, in the same interference patterns. Therefore, the presence of a magnetic field permits the prediction of interference patterns, but the reverse, i.e. A10 C.2 Proofs concerning the ℤ2 insulator the unambiguous deduction of the magnetic field from the interference patters, is impossible. Only continuous flux changes allow the observer to count the number of circles passed and thus to determine the absolute magnetic flux difference from the interference patterns. C.2 Proofs concerning the ℤ2 insulator Gauge invariance of the electric polarisation under𝑈 (?̄?) transformations The electric polarisation in terms of the Berry phase in Eq. (7.64) is invariant under 𝑈 (?̄?) transformations, which mix the states of the occupied Bloch space. Proof. Let 𝑆(𝝃) be a 𝝃-dependent, unitaryℂ?̄?×?̄?-matrix rotating the occupied subspace spanned by the Bloch factors {|𝑢𝜆𝛼(𝒌)⟩}. Then the Berry connection A(𝝃) transforms as follows: ( )ᵀ ( ) 𝑢(𝝃) | (𝜆 )(𝒔)⟩ (𝜆 ) (𝜆 ) (𝜆 )B 𝑢1 , . . . , |𝑢 (𝒔)⟩ , 𝑢 ∗(𝝃) B ⟨𝑢1 (𝒔) |, . . . , ⟨𝑢 (𝒔) |?̄? ?̄? ?̃?(𝝃) B∏𝑆(𝝃[)∫𝑢(𝝃)1 ] ∏ [∫ 1 ] Ã𝑖 (𝝃) = d𝑠 ?̃?∗(𝝃) i∇ ?̃?(𝝃) = d𝑠 𝑢∗(𝝃) 𝑆∗(𝝃) i∇ 𝑆(𝝃) 𝑢(𝝃)∏ 𝑗𝑗≠𝑖 [∫0 1 ] ∑︁ 𝜉 𝑗 𝜉𝑗≠𝑖 0 = ( [∫ d𝑠 ∗ 𝑗 ] 𝑢𝑘 (𝝃) 𝑆 ∗ 𝑘𝑙 (𝝃)) i∇𝜉 𝑆∏ ∑︁𝑙𝑚 (𝝃) 𝑢𝑚(𝝃) 𝑗≠𝑖 0 𝑘,𝑙,𝑚 1 = d𝑠 𝑢∗(𝑗 (𝝃) i∇ ∗ 𝜉 𝑢)(𝝃) + 𝑆𝑘𝑙 (𝝃) i∇𝜉 𝑆𝑙𝑘 (𝝃)𝑗≠𝑖 0 𝑘,𝑙 = A𝑖 (𝝃) + i tr 𝑆∗(𝝃) ∇ ( ) [128] 𝜉 𝑆 𝝃 = A𝑖 (𝝃) + i∇𝜉 ln(det(𝑆(𝝃))) = A𝑖 (𝝃) − ∇𝜉𝜃(𝝃) (C.108) where 𝜃(𝝃) is the phase of the determinant of 𝑆(𝝃). Thus, also a more general 𝑈 (?̄?) transformation basically reproduces Eq. (7.59), so band reordering within the occu- pied subspace during the parametric variation does not affect the results. □ Nota bene, while band reordering within the occupied manifold is allowed, the con- servation of the band gap separating the occupied subspace from its unoccupied com- plement is still obligatory for all values of 𝝃. In other words, the system strictly has to remain a band insulator for all 𝜆. A11 Appendix C Kramers link between the Berry connections of the components of a T-symmetric system Eqs. (7.79) state that the Berry connections of the components of a T-symmetric Bloch system are linked by the Kramers symmetry. For the proof, it is crucial to keep track of the chain rule when dealing with the gradient of a function like 𝑓 (−𝒌). This is facilitated by the following notation: ∇𝑘 𝑓 (−𝒌) means that the ∇-operator has not yet been executed and acts on the exterior 𝑓 and its argument −𝒌, the latter giving a minus sign. On the other hand, (∇𝑘 𝑓 ) (−𝒌) means that the gradient of 𝑓 is evaluated at −𝒌. The difference between the two definitions is thus a minus sign. By using the Kramers symmetry (Eq. (7.74)), the above remarks and the relations ⟨𝑎 | ∇𝑎⟩ = −⟨∇𝑎 | 𝑎〈⟩ and ⟨Θ 𝑎 | Θ𝑏⟩ = ⟨𝑏〉| 𝑎⟩, the Berry connection of component Ievaluated at −𝒌 can be rewr itten as follows:AI𝛼(−𝒌) = i 𝑢〈I𝛼(−𝒌) (〈 ∇ I𝑘𝑢 𝛼) (−𝒌) 〉 = −i 𝑢I (−𝒌) 𝛼 ∇ 𝑘 𝑢I𝛼(−𝒌) = −i Θ𝑢II 〉 𝛼 (𝒌) exp〈(−i𝜒𝛼(𝒌 ))∇𝑘 exp(i𝜒 (𝒌)) Θ𝑢II𝛼 𝛼 (𝒌)= +(∇ 𝜒 ) (𝒌) − i〈Θ𝑢II(𝒌) Θ( ∇ 𝑢II 〉𝑘 𝛼 𝛼 𝑘 𝛼)〉(𝒌) (C.109a)= +(∇ 𝜒 ) (𝒌) − i〈 (∇ 𝑢II𝑘 𝛼 𝑘 𝛼 ) (𝒌) 𝑢II𝛼 (𝒌)= +(∇ 𝜒 II II 〉𝑘 𝛼) (𝒌) + i 𝑢𝛼 (𝒌) (∇𝑘𝑢𝛼) (𝒌) = +(∇ 𝜒 ) (𝒌) + AII𝑘 𝛼 𝛼 (𝒌) For the second com〈 ponent , a similar expressiII 〈II II 〉 〈 on holds : 〉 A𝛼 (−𝒌) = i 𝑢𝛼 (−𝒌) (∇𝑘𝑢𝛼) (−𝒌) = −i 𝑢II(−𝒌) ∇ 𝑢II𝛼 𝑘 𝛼 (−𝒌) 〉 = −i Θ𝑢I𝛼(𝒌) exp(〈−i𝜒𝛼(−𝒌))∇𝑘 exp(i𝜒𝛼(−𝒌)) Θ𝑢I (𝒌) = −(∇ 𝜒 ) (−𝒌) − i〈 𝛼Θ𝑢I (𝒌 ) 〉𝑘 𝛼 𝛼 Θ(∇ I𝑘𝑢𝛼) (𝒌) (C.109b)= −(∇ 𝜒 ) (−𝒌) + i 𝑢I (𝒌) (∇ 𝑢I 〉𝑘 𝛼 𝛼 𝑘 𝛼) (𝒌) = −(∇𝑘𝜒𝛼) (−𝒌) + AI𝛼(𝒌) The sum and the difference between the partial connections of component I and II, defined asA+(𝒌) andA− (𝒌) in Eqs. (7.80a) and (7.81a), can be transformed into each other as follows: A− (𝒌) + A− (−𝒌) AI(𝒌) − AII(𝒌) + AI(−𝒌) − AII(−𝒌) Eqs. (C.109) ∑︁ = AI(𝒌) + AII(𝒌) − A∑︁I(𝒌) − AII(𝒌) + (∇𝑘𝜒𝛼) (𝒌) + (∇𝑘𝜒𝛼) (−𝒌) (C.110)𝛼 A+(𝒌) − A+(−𝒌) + (∇𝑘𝜒𝛼) (𝒌) + (∇𝑘𝜒𝛼) (−𝒌) 𝛼 A12 C.3 The antimony on bismuth selenide heterostructure C.3 The antimony on bismuth selenide heterostructure 1.0K Γ M K Γ M K Γ M(a) P01 (b) P0102 (c) P0120 0.5 D* TSS PSb 0.0 TSS D* R B BS B PSb -0.5 RBS -1.0 1.0 (d) P10 (e) P1012 (f) P1021 0.5 0.0 -0.5 -1.0 -1.0 -0.5 0.0 0.5 -1.0 -0.5 0.0 0.5 -1.0 -0.5 0.0 0.5 BS Sb k (Å−1) k (Å−1ǁ ǁ ) kǁ (Å−1) Figure C.1: Band structures of 1BL-Sb and 2BL-Sb structure models with different stacking optimised with DFT-D2+SOC. (a) P01, (b) P0102, (c) P0120, (d) P10, (e) P1012, (f) 1021. The line width, colouring and Fermi levels are analogous to Fig. 9.4. Arrows highlight special features. Stable structure models are highlighted bold. Reproduced from [P7]. A13 E − EF (eV) E − EF (eV) Bibliography [1] M. M. Waldrop, The chips are down for Moore’s law, Nature 530, 144–147 (2016), doi: 10.1038/ 530144a. [2] G. Kresse and J. Furthmüller, Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set, Comput. Mater. Sci. 6, 15–50 (1996), doi: 10.1016/ 0927-0256(96)00008-0. [3] G. Kresse and J. Furthmüller, Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set, Phys. Rev. B 54, 11169–11186 (1996), doi: 10.1103/PhysRevB.54. 11169. [4] P. Giannozzi et al., QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials, J. Phys. Condens. Matter 21, 395502 (2009), doi: 10.1088/ 0953-8984/21/39/395502. [5] P. Giannozzi et al., Advanced capabilities for materials modelling with Quantum ESPRESSO, J. Phys. Condens. Matter 29, 465901 (2017), doi: 10.1088/1361-648X/aa8f79. [6] G. Czycholl, Theoretische Festkörperphysik Band 1: Grundlagen: Phononen und Elektronen in Kristallen, 4th ed. (Springer Spektrum, Berlin, Heidelberg, 2016), doi: 10.1007/978-3-662- 47141-8. [7] W. Kohn, Nobel Lecture: Electronic structure of matter – wave functions and density functionals, Rev. Mod. Phys. 71, 1253–1266 (1999), doi: 10.1103/RevModPhys.71.1253. [8] P. Hohenberg and W. Kohn, Inhomogeneous Electron Gas, Phys. Rev. 136, B864–B871 (1964), doi: 10.1103/PhysRev.136.B864. [9] W. Kohn and L. J. Sham, Self-Consistent Equations Including Exchange and Correlation Effects, Phys. Rev. 140, A1133–A1138 (1965), doi: 10.1103/PhysRev.140.A1133. [10] J. P. Perdew et al., Generalized Gradient Approximation Made Simple, Phys. Rev. Lett. 77, 3865– 3868 (1996), doi: 10.1103/PhysRevLett.77.3865. [11] J. P. Perdew et al., Restoring the Density-Gradient Expansion for Exchange in Solids and Surfaces, Phys. Rev. Lett. 100, 136406 (2008), doi: 10.1103/PhysRevLett.100.136406. [12] D. Hobbs et al., Fully unconstrained noncollinear magnetism within the projector augmented- wave method, Phys. Rev. B - Condens. Matter Mater. Phys. 62, 11556–11570 (2000), doi: 10. 1103/PhysRevB.62.11556. [13] M. Cococcioni, The LDA+U Approach: A Simple Hubbard Correction for Correlated Ground States, in Correl. electrons model. simul. Vol. 2, edited by E. Pavarini et al., 1st ed. (Verlag des Forschungszentrum Jülich, Jülich, 2012) Chap. 4, pp. 1–40. [14] S. L. Dudarev et al., Electron-energy-loss spectra and the structural stability of nickel oxide: An LSDA+U study, Phys. Rev. B 57, 1505–1509 (1998), doi: 10.1103/PhysRevB.57.1505. [15] A. Rohrbach et al., Electronic correlation effects in transition-metal sulfides, J. Phys. Condens. Matter 15, 979–996 (2003), doi: 10.1088/0953-8984/15/6/325. [16] A. I. Liechtenstein et al., Density-functional theory and strong interactions: Orbital ordering in Mott-Hubbard insulators, Phys. Rev. B 52, R5467–R5470 (1995), doi: 10.1103/PhysRevB.52. R5467. [17] A. F. Holleman et al., Anorganische Chemie Band 1: Grundlagen und Hauptgruppenelemente, 103rd ed. (De Gruyter, Berlin, Boston, 2016), doi: 10.1515/9783110495850. A15 Bibliography [18] A. F. Holleman et al., Anorganische Chemie Band 2: Nebengruppenelemente, Lanthanoide, Actin- oide, Transactinoide, 103rd ed. (De Gruyter, Berlin, Boston, 2016), doi: 10.1515/9783110495904. [19] E. van Lenthe et al., Relativistic regular two-component Hamiltonians, J. Chem. Phys. 99, 4597– 4610 (1993), doi: 10.1063/1.466059. [20] C. Chang et al., Regular Two-Component Pauli-Like Effective Hamiltonians in Dirac Theory, Phys. Scr. 34, 394–404 (1986), doi: 10.1088/0031-8949/34/5/007. [21] P. E. Blöchl, Projector augmented-wave method, Phys. Rev. B 50, 17953–17979 (1994), doi: 10. 1103/PhysRevB.50.17953. [22] G. Kresse and D. Joubert, From ultrasoft pseudopotentials to the projector augmented-wave method, Phys. Rev. B 59, 1758–1775 (1999), doi: 10.1103/PhysRevB.59.1758. [23] S. Steiner et al., Calculation of the magnetic anisotropy with projected-augmented-wave method- ology and the case study of disordered Fe1 − 𝑥Co𝑥 alloys, Phys. Rev. B 93, 224425 (2016), doi: 10. 1103/PhysRevB.93.224425. [24] F. London, Zur Theorie und Systematik der Molekularkräfte, Zeitschrift für Phys. 63, 245–279 (1930), doi: 10.1007/BF01421741. [25] J. P. Wagner and P. R. Schreiner, London’sche Dispersionswechselwirkungen in der Molekül- chemie - eine Neubetrachtung sterischer Effekte, Angew. Chemie 127, 12446–12471 (2015), doi: 10.1002/ange.201503476. [26] S. Grimme, Semiempirical GGA-type density functional constructed with a long-range dispersion correction, J. Comput. Chem. 27, 1787–1799 (2006), doi: 10.1002/jcc.20495. [27] S. Grimme et al., A consistent and accurate ab initio parametrization of density functional dis- persion correction (DFT-D) for the 94 elements H-Pu, J. Chem. Phys. 132, 154104 (2010), doi: 10. 1063/1.3382344. [28] S. Grimme et al., Effect of the damping function in dispersion corrected density functional theory, J. Comput. Chem. 32, 1456–1465 (2011), doi: 10.1002/jcc.21759. [29] A. Tkatchenko and M. Scheffler, Accurate Molecular Van Der Waals Interactions from Ground- State Electron Density and Free-Atom Reference Data, Phys. Rev. Lett. 102, 073005 (2009), doi: 10.1103/PhysRevLett.102.073005. [30] T. Bučko et al., Improved Density Dependent Correction for the Description of London Dispersion Forces, J. Chem. Theory Comput. 9, 4293–4299 (2013), doi: 10.1021/ct400694h. [31] T. Bučko et al., Extending the applicability of the Tkatchenko-Scheffler dispersion correction via iterative Hirshfeld partitioning, J. Chem. Phys. 141, 034114 (2014), doi: 10.1063/1.4890003. [32] K. N. Tu et al., Low Schottky barrier of rare-earth silicide on 𝑛-Si, Appl. Phys. Lett. 38, 626–628 (1981), doi: 10.1063/1.92457. [33] J. A. Knapp and S. T. Picraux, Epitaxial growth of rare-earth silicides on (111) Si, Appl. Phys. Lett. 48, 466–468 (1986), doi: 10.1063/1.96532. [34] P. Wetzel et al., Experimental band structure and Fermi surface of a two-dimensional Er silicide on Si(111), Solid State Commun. 82, 235–238 (1992), doi: 10.1016/0038-1098(92)90633-K. [35] S. Vandré et al., Flatband Conditions Observed for Lanthanide-Silicide Monolayers on 𝑛-Type Si(111), Phys. Rev. Lett. 82, 1927–1930 (1999), doi: 10.1103/PhysRevLett.82.1927. [36] F. A. D’Avitaya et al., Fabrication and structure of epitaxial Er silicide films on (111) Si, Appl. Phys. Lett. 54, 2198–2200 (1989), doi: 10.1063/1.101517. √ √ [37] M. Lohmeier et al., Interface structure of Si(111)-( 3× 3)R30°-ErSi2−𝑥, Surf. Sci. 345, 247–260 (1996), doi: 10.1016/0039-6028(95)00875-6. [38] P. Paki et al., Growth of a two-dimensional Er silicide on Si(111), Phys. Rev. B 45, 8490–8493 (1992), doi: 10.1103/PhysRevB.45.8490. [39] L. Stauffer et al., Electronic structure and interfacial geometry of epitaxial two-dimensional Er silicide on Si(111), Phys. Rev. B 47, 10555–10562 (1993), doi: 10.1103/PhysRevB.47.10555. A16 Bibliography [40] M. H. Tuilier et al., Interfacial structure of two-dimensional epitaxial Er silicide on Si(111), Phys. Rev. B 50, 2333–2338 (1994), doi: 10.1103/PhysRevB.50.2333. [41] M. Lohmeier et al., Atomic structure and thermal stability of two-dimensional Er silicide on Si(111), Phys. Rev. B - Condens. Matter Mater. Phys. 54, 2004–2009 (1996), doi: 10 . 1103 / PhysRevB.54.2004. [42] T. Kalka et al., STM study of epitaxial Dy silicides on Si(111) and Si(001) using ultra-sharp tips prepared by ion bombardment, Appl. Phys. A Mater. Sci. Process. 66, S1073–S1075 (1998), doi: 10.1007/s003390051300. [43] D. J. Spence et al., Medium-energy ion scattering studies of two-dimensional rare-earth silicides, Phys. Rev. B 61, 5707–5713 (2000), doi: 10.1103/PhysRevB.61.5707. [44] S. Vandré et al., Conservation of flatband conditions for DySi2 monolayers on 𝑛-type Si(111), Appl. Phys. Lett. 78, 2012–2014 (2001), doi: 10.1063/1.1360782. [45] H. Kitayama et al., Structure analysis of two-dimensional Holmium silicide by low energy elec- tron diffraction, Surf. Sci. 482-485, 1481–1486 (2001), doi: 10.1016/S0039-6028(01)00859-7. [46] D. Spence et al., Structural studies of two- and three-dimensional dysprosium silicides using medium-energy ion scattering, Surf. Sci. 512, 61–66 (2002), doi: 10.1016/S0039-6028(02)01573- X. [47] C. Rogero et al., Electronic structure and Fermi surface of two-dimensional rare-earth silicides epitaxially grown on Si(111), Phys. Rev. B 69, 045312 (2004), doi: 10.1103/PhysRevB.69.045312. [48] M. Tuilier et al., Structure of clean and H-saturated epitaxial two-dimensional Er silicide on Si(111) studied by SEXAFS, Surf. Sci. 555, 94–100 (2004), doi: 10.1016/j.susc.2004.02.023. [49] M. Wanke et al., Energy surfaces of rare-earth silicide films on Si(111), Surf. Sci. 603, 2808–2814 (2009), doi: 10.1016/j.susc.2009.07.026. [50] I. Engelhardt et al., Atomic structure of thin dysprosium-silicide layers on Si(111), Surf. Sci. 600, 755–761 (2006), doi: 10.1016/j.susc.2005.11.029. [51] M. Franz et al., Terbium induced nanostructures on Si(111), Surf. Sci. 637-638, 149–155 (2015), doi: 10.1016/j.susc.2015.03.026. [52] M. Franz et al., Growth and electronic properties of Tb silicide layers on Si(111), J. Vac. Sci. Technol. A Vacuum, Surfaces, Film. 34, 061503 (2016), doi: 10.1116/1.4964132. [53] S. Sanna et al., Rare-earth silicide thin films on the Si(111) surface, Phys. Rev. B 93, 195407 (2016), doi: 10.1103/PhysRevB.93.195407. [54] C. Preinesberger et al., Formation of dysprosium silicide wires on Si(001), J. Phys. D. Appl. Phys. 31, L43–L45 (1998), doi: 10.1088/0022-3727/31/12/001. [55] Y. Chen et al., Self-assembled growth of epitaxial erbium disilicide nanowires on silicon (001), Appl. Phys. Lett. 76, 4004–4006 (2000), doi: 10.1063/1.126848. [56] J. Nogami et al., Self-assembled rare-earth silicide nanowires on Si(001), Phys. Rev. B 63, 233305 (2001), doi: 10.1103/PhysRevB.63.233305. [57] C. Preinesberger et al., Structure of DySi2 nanowires on Si(001), J. Appl. Phys. 91, 1695–1697 (2002), doi: 10.1063/1.1430540. [58] Y. Chen et al., Growth and evolution of epitaxial erbium disilicide nanowires on Si(001), Appl. Phys. A Mater. Sci. Process. 75, 353–361 (2002), doi: 10.1007/s003390201315. [59] C. Preinesberger et al., Structure and electronic properties of dysprosium-silicide nanowires on vicinal Si(001), Appl. Phys. Lett. 87, 083107 (2005), doi: 10.1063/1.2032620. [60] H. W. Yeom et al., Robust One-Dimensional Metallic Band Structure of Silicide Nanowires, Phys. Rev. Lett. 95, 205504 (2005), doi: 10.1103/PhysRevLett.95.205504. [61] T. Kobayashi, Structural analysis of Er silicide nanowires on Si(001) using three-dimensional medium-energy ion scattering, Phys. Rev. B - Condens. Matter Mater. Phys. 75, 2–6 (2007), doi: 10.1103/PhysRevB.75.125401. A17 Bibliography [62] C. Zeng et al., Charge-order fluctuations in one-dimensional silicides, Nat. Mater. 7, 539–542 (2008), doi: 10.1038/nmat2209. [63] M. Wanke et al., Electronic properties of self-assembled rare-earth silicide nanowires on Si(001), Phys. Rev. B 83, 205417 (2011), doi: 10.1103/PhysRevB.83.205417. [64] M. Dähne and M. Wanke, Metallic rare-earth silicide nanowires on silicon surfaces, J. Phys. Condens. Matter 25, 014012 (2012), doi: 10.1088/0953-8984/25/1/014012. [65] S. Appelfeller et al., Tb silicide nanowire growth on planar and vicinal Si(001) surfaces, Surf. Sci. 641, 180–190 (2015), doi: 10.1016/j.susc.2015.07.001. [66] S. Appelfeller et al., The electronic structure of Tb silicide nanowires on Si(001), New J. Phys. 18, 113005 (2016), doi: 10.1088/1367-2630/18/11/113005. [67] S. Appelfeller et al., Refined structure model of rare earth silicide nanowires on Si(001), Appl. Phys. Lett. 114, 093104 (2019), doi: 10.1063/1.5086369. [68] M. Wanke et al., Structural and electronic properties of rare earth silicide nanowires on Si(557), Phys. Rev. B 79, 155428 (2009), doi: 10.1103/PhysRevB.79.155428. [69] M. Wanke et al., Electronic properties of dysprosium silicide nanowires on Si(557), J. Appl. Phys. 108, 064304 (2010), doi: 10.1063/1.3481456. [70] S. Appelfeller et al., Growth and characterization of Tb silicide nanostructures on Si(ℎℎ𝑘) sub- strates, Phys. Rev. Mater. 3, 126002 (2019), doi: 10.1103/PhysRevMaterials.3.126002. [71] N. G. Connelly et al., Nomenclature of Inorganic Chemistry (Royal Society of Chemistry, Cam- bridge, 2005). [72] B. Zhou et al., Global Potential of Rare Earth Resources and Rare Earth Demand from Clean Technologies, Minerals 7, 203 (2017), doi: 10.3390/min7110203. [73] H. Sicius, Seltenerdmetalle: Lanthanoide und dritte Nebengruppe, in Handb. der chem. elem. (Springer Spektrum, Berlin, Heidelberg, 2019), pp. 1–116, doi: 10.1007/978-3-662-55944- 4_18-2. [74] A. J. Kenyon, Erbium in silicon, Semicond. Sci. Technol. 20, R65–R84 (2005), doi: 10.1088/0268- 1242/20/12/R02. [75] P. F. Lang and B. C. Smith, Ionization Energies of Lanthanides, J. Chem. Educ. 87, 875–881 (2010), doi: 10.1021/ed100215q. [76] D. A. Johnson and P. G. Nelson, Valencies of the lanthanides, Found. Chem. 20, 15–27 (2018), doi: 10.1007/s10698-017-9291-6. [77] M. Luyckx et al., Silicon and Plants: Current Knowledge and Technological Perspectives, Front. Plant Sci. 8, 1–8 (2017), doi: 10.3389/fpls.2017.00411. [78] P. Treguer et al., The Silica Balance in the World Ocean: A Reestimate, Science (80-. ). 268, 375– 379 (1995), doi: 10.1126/science.268.5209.375. [79] K. S. Novoselov, Electric Field Effect in Atomically Thin Carbon Films, Science (80-. ). 306, 666– 669 (2004), doi: 10.1126/science.1102896. [80] H. Schäfer et al., Zintl Phases: Transitions between Metallic and Ionic Bonding, Angew. Chemie Int. Ed. English 12, 694–712 (1973), doi: 10.1002/anie.197306941. [81] J. A. Perri et al., Rare Earth Metal “Disilicides”, J. Phys. Chem. 63, 616–619 (1959), doi: 10.1021/ j150574a041. [82] J. A. Perri et al., Polymorphism of Rare Earth Disilicides, J. Phys. Chem. 63, 2073–2074 (1959), doi: 10.1021/j150582a030. [83] I. P. Mayer et al., Rare earth disilicides, J. Phys. Chem. 66, 693–696 (1962), doi: 10 . 1021 / j100810a028. [84] G. V. Samsonov, The chemistry of the silicides of the rare-earth elements, Russ. Chem. Rev. 31, 702–712 (1962), doi: 10.1070/RC1962v031n12ABEH001333. [85] I. Mayer et al., Dimorphism of rare earth disilicides, Inorg. Chem. 6, 842–844 (1967), doi: 10. 1021/ic50050a043. A18 Bibliography [86] I. Shidlovsky and I. Mayer, Mössbauer spectra of rare earth silicides and germanides, J. Phys. Chem. Solids 30, 1207–1213 (1969), doi: 10.1016/0022-3697(69)90378-3. [87] A. Iandelli et al., Valence fluctuations of ytterbium in silicon-rich compounds, J. Less Common Met. 64, 213–220 (1979), doi: 10.1016/0022-5088(79)90172-3. [88] E. Houssay et al., Metallurgical reinvestigation of rare earth silicides, Appl. Surf. Sci. 38, 156– 161 (1989), doi: 10.1016/0169-4332(89)90531-X. [89] P. Schobinger-Papamantellos et al., Crystallographic and magnetic structure of TbSi1.67 and TbSi1.67 − 𝛿, J. Magn. Magn. Mater. 79, 231–241 (1989), doi: 10.1016/0304-8853(89)90102-9. [90] J. Pierre et al., Magnetic structures of rare earth silicides RSi2 − 𝑥 (R = Nd, Ho, Dy), J. Magn. Magn. Mater. 81, 39–46 (1989), doi: 10.1016/0304-8853(89)90226-6. [91] S. Auffret et al., Crystallographic and magnetic structures of Er3Si5, Phys. B Condens. Matter 162, 271–280 (1990), doi: 10.1016/0921-4526(90)90022-M. [92] J. Pierre et al., Magnetic properties of rare earth silicide single crystals RSi2 − 𝑥 (R = Pr, Nd, Gd), J. Magn. Magn. Mater. 89, 86–96 (1990), doi: 10.1016/0304-8853(90)90711-X. [93] S. Auffret et al., Magnetic properties versus crystal structure in heavy rare-earth silicides RSi2 − 𝑥 , Phys. B Condens. Matter 173, 265–276 (1991), doi: 10.1016/0921-4526(91)90088-V. [94] L. Braicovich et al., Valence photoelectron spectroscopy of Gd silicides, Phys. Rev. B 41, 3123– 3127 (1990), doi: 10.1103/PhysRevB.41.3123. [95] L. Magaud et al., The effect of silicon vacancies on the electron structure of yttrium disilicide, J. Phys. Condens. Matter 4, 5399–5404 (1992), doi: 10.1088/0953-8984/4/24/009. [96] L. Magaud et al., Role of silicon vacancies in yttrium-disilicide compounds from ab initio calcu- lations, Phys. Rev. B 55, 13479–13484 (1997), doi: 10.1103/PhysRevB.55.13479. [97] J. Szade and M. Neumann, Electronic structure investigation of Gd intermetallics, J. Phys. Con- dens. Matter 11, 3887–3896 (1999), doi: 10.1088/0953-8984/11/19/308. [98] J. Yang et al., Thermal expansion coefficients of rare earth metal disilicides and their influence on the growth of disilicide nanowires, Appl. Phys. A 82, 39–42 (2006), doi: 10.1007/s00339-005- 3336-8. [99] G. W. Peng et al., Mechanisms of silicon diffusion in erbium silicide, Phys. Rev. B 75, 125319 (2007), doi: 10.1103/PhysRevB.75.125319. [100] M. Zavala Aké et al., First-principles calculations of the atomic and electronic properties of group IIIA disilicides in AlB2 type structures, Solid State Sci. 10, 355–361 (2008), doi: 10.1016/ j.solidstatesciences.2007.10.002. [101] M. Nentwich et al., Structure variations within RSi2 and R2TSi3 silicides. Part I. Structure over- view, Acta Crystallogr. Sect. B Struct. Sci. Cryst. Eng. Mater. 76, 177–200 (2020), doi: 10.1107/ S2052520620001043. [102] V. Eremenko et al., Phase diagram of the holmium-silicon binary system and physical properties of holmium silicides up to 1050 °C, J. Alloys Compd. 219, 181–184 (1995), doi: 10.1016/0925- 8388(94)05035-X. [103] S. Luzan et al., Phase diagram of the binary erbium-silicon system and physical properties of erbium silicides up to 1050 °C, J. Alloys Compd. 239, 77–82 (1996), doi: 10.1016/0925-8388(95) 01977-4. [104] S. P. Luzan et al., Phase equilibria in the erbium-silicon system, Powder Metall. Met. Ceram. 36, 24–29 (1997), doi: 10.1007/BF02684247. [105] M. Huang et al., Experimental investigation and thermodynamic modeling of the Gd-Si system, J. Alloys Compd. 441, 94–100 (2007), doi: 10.1016/j.jallcom.2006.09.068. [106] L. V. Goncharuk et al., Thermodynamic properties of holmium silicides HoSi1 − 𝑥 , Ho5Si4, and Ho5Si3, Powder Metall. Met. Ceram. 49, 690–694 (2011), doi: 10.1007/s11106-011-9289-x. [107] V. R. Sidorko and N. P. Gorbachuk, Thermodynamic and thermochemical characteristics of er- bium silicides, Powder Metall. Met. Ceram. 51, 93–101 (2012), doi: 10.1007/s11106-012-9402-9. A19 Bibliography [108] F. Geenen et al., On the formation and structural properties of hexagonal rare earth (Y, Gd, Dy, Er and Yb) disilicide thin films, J. Alloys Compd. 611, 149–156 (2014), doi: 10.1016/j.jallcom. 2014.04.142. [109] H. Okamoto, Ho-Si (holmium-silicon), J. Phase Equilibria 17, 370–371 (1996), doi: 10 . 1007 / BF02665570. [110] H. Okamoto, Er-Si (erbium-silicon), J. Phase Equilibria 18, 403–403 (1997),doi: 10.1007/s11669- 997-0073-z. [111] H. Okamoto, Si-Tb (Silicon-Terbium), J. Phase Equilibria 21, 500–500 (2000), doi: 10 . 1361 / 105497100770339824. [112] H. Okamoto, Gd-Si (Gadolinium-Silicon), J. Phase Equilibria Diffus. 30, 213–214 (2009), doi: 10. 1007/s11669-009-9477-2. [113] J. Kim and I.-H. Jung, Critical evaluation and thermodynamic optimisation of the Si-RE systems: Part II. Si-RE system (RE = Gd, Tb, Dy, Ho, Er, Tm, Lu and Y), J. Chem. Thermodyn. 81, 273–297 (2015), doi: 10.1016/j.jct.2014.08.014. [114] A. B. Gokhale and G. J. Abbaschian, The Si-Sm (Silicon-Samarium) system, Bull. Alloy Phase Diagrams 9, 582–585 (1988), doi: 10.1007/BF02881960. [115] A. B. Gokhale et al., The Nd-Si (Neodymium-Silicon) system, Bull. Alloy Phase Diagrams 10, 246– 251 (1989), doi: 10.1007/BF02877504. [116] F. D. Murnaghan, The Compressibility of Media under Extreme Pressures, Proc. Natl. Acad. Sci. 30, 244–247 (1944), doi: 10.1073/pnas.30.9.244. [117] H. J. Monkhorst and J. D. Pack, Special points for Brillouin-zone integrations, Phys. Rev. B 13, 5188–5192 (1976), doi: 10.1103/PhysRevB.13.5188. [118] P. E. Blöchl et al., Improved tetrahedron method for Brillouin-zone integrations, Phys. Rev. B 49, 16223–16233 (1994), doi: 10.1103/PhysRevB.49.16223. [119] R. P. Feynman, Forces in Molecules, Phys. Rev. 56, 340–343 (1939),doi: 10.1103/PhysRev.56.340. [120] A. Kokalj, Xcrysden – a new program for displaying crystalline structures and electron densities, Journal of Molecular Graphics and Modelling 17, 176–179 (1999), doi: https://doi.org/10. 1016/S1093-3263(99)00028-5. [121] S. Fahy and D. R. Hamann, Electronic and structural properties of CaSi2, Phys. Rev. B 41, 7587– 7592 (1990), doi: 10.1103/PhysRevB.41.7587. [122] K. H. Janzon et al., Notizen: Zur Struktur der Phase CaSi2, Zeitschrift für Naturforsch. B 23, 1544–1544 (1968), doi: 10.1515/znb-1968-1128. [123] W. Setyawan and S. Curtarolo, High-throughput electronic band structure calculations: Chal- lenges and tools, Comput. Mater. Sci. 49, 299–312 (2010), doi: 10.1016/j.commatsci.2010.05. 010. [124] S. G. Mayo et al., Band unfolding made simple, J. Phys. Condens. Matter 32, 205902 (2020), doi: 10.1088/1361-648X/ab6e8e. [125] K. von Klitzing et al., New Method for High-Accuracy Determination of the Fine-Structure Con- stant Based on Quantized Hall Resistance, Phys. Rev. Lett. 45, 494–497 (1980), doi: 10.1103/ PhysRevLett.45.494. [126] K. von Klitzing, The quantized Hall effect, Rev. Mod. Phys. 58, 519–531 (1986), doi: 10.1103/ RevModPhys.58.519. [127] M. . V. . Berry, Quantal phase factors accompanying adiabatic changes, Proc. R. Soc. London. A. Math. Phys. Sci. 392, 45–57 (1984), doi: 10.1098/rspa.1984.0023. [128] R. Resta, Macroscopic polarization in crystalline dielectrics: the geometric phase approach, Rev. Mod. Phys. 66, 899–915 (1994), doi: 10.1103/RevModPhys.66.899. [129] Y. Aharonov and D. Bohm, Significance of Electromagnetic Potentials in the Quantum Theory, Phys. Rev. 115, 485–491 (1959), doi: 10.1103/PhysRev.115.485. A20 Bibliography [130] Y. Aharonov and D. Bohm, Further Considerations on Electromagnetic Potentials in the Quantum Theory, Phys. Rev. 123, 1511–1524 (1961), doi: 10.1103/PhysRev.123.1511. [131] L. Vaidman, Role of potentials in the Aharonov-Bohm effect, Phys. Rev. A 86, 040101 (2012), doi: 10.1103/PhysRevA.86.040101. [132] Y. Aharonov et al., Comment on “Role of potentials in the Aharonov-Bohm effect”, Phys. Rev. A 92, 026101 (2015), doi: 10.1103/PhysRevA.92.026101. [133] L. Vaidman, Reply to “Comment on ‘Role of potentials in the Aharonov-Bohm effect’”, Phys. Rev. A 92, 026102 (2015), doi: 10.1103/PhysRevA.92.026102. [134] P. Pearle and A. Rizzi, Quantum-mechanical inclusion of the source in the Aharonov-Bohm ef- fects, Phys. Rev. A 95, 052123 (2017), doi: 10.1103/PhysRevA.95.052123. [135] W. Franz, Elektroneninterferenz im Magnetfeld, in Verhandlungen der Deutschen Physikalis- chen Gesellschaft, Vol. 20 (1939). [136] W. Ehrenberg and R. E. Siday, The Refractive Index in Electron Optics and the Principles of Dynamics, Proc. Phys. Soc. Sect. B 62, 8–21 (1949), doi: 10.1088/0370-1301/62/1/303. [137] K.-H. Goldhorn and H.-P. Heinz, Mathematik für Physiker 1: Grundlagen aus Analysis und Lin- earer Algebra, 1st ed., Springer-Lehrbuch (Springer, Berlin, Heidelberg, 2007), doi: 10.1007/ 978-3-540-48768-5. [138] G. Möllenstedt and W. Bayh, Kontinuierliche Phasenschiebung von Elektronenwellen im kraft- feldfreien Raum durch das magnetische Vektorpotential eines Solenoids, Phys. J. 18, 299–305 (1962), doi: 10.1002/phbl.19620180702. [139] A. Tonomura et al., Evidence for Aharonov-Bohm effect with magnetic field completely shielded from electron wave, Phys. Rev. Lett. 56, 792–795 (1986), doi: 10.1103/PhysRevLett.56.792. [140] R. M. Martin, Comment on calculations of electric polarization in crystals, Phys. Rev. B 9, 1998– 1999 (1974), doi: 10.1103/PhysRevB.9.1998. [141] R. D. King-Smith and D. Vanderbilt, Theory of polarization of crystalline solids, Phys. Rev. B 47, 1651–1654 (1993), doi: 10.1103/PhysRevB.47.1651. [142] L. Fu and C. L. Kane, Time reversal polarization and a 𝑍2 adiabatic spin pump, Phys. Rev. B 74, 195312 (2006), doi: 10.1103/PhysRevB.74.195312. [143] D. J. Thouless et al., Quantized Hall Conductance in a Two-Dimensional Periodic Potential, Phys. Rev. Lett. 49, 405–408 (1982), doi: 10.1103/PhysRevLett.49.405. [144] C. L. Kane and E. J. Mele, 𝑍2 Topological Order and the Quantum Spin Hall Effect, Phys. Rev. Lett. 95, 146802 (2005), doi: 10.1103/PhysRevLett.95.146802. [145] F. D. M. Haldane, Model for a Quantum Hall Effect without Landau Levels: Condensed-Matter Realization of the "Parity Anomaly", Phys. Rev. Lett. 61, 2015–2018 (1988), doi: 10 . 1103 / PhysRevLett.61.2015. [146] X.-L. Qi et al., Topological field theory of time-reversal invariant insulators, Phys. Rev. B 78, 195424 (2008), doi: 10.1103/PhysRevB.78.195424. [147] L. Fu and C. L. Kane, Topological insulators with inversion symmetry, Phys. Rev. B 76, 045302 (2007), doi: 10.1103/PhysRevB.76.045302. [148] B. W. Roberts, Three Myths about Time Reversal in Quantum Theory, Philos. Sci. 84, 315–334 (2017), doi: 10.1086/690721. [149] P. de Marcillac et al., Experimental detection of α-particles from the radioactive decay of natural bismuth, Nature 422, 876–878 (2003), doi: 10.1038/nature01541. [150] H. Lind et al., Structure and bonding properties of (Bi2Se3)𝑚 (Bi2)𝑛 stacks by first-principles density functional theory, Phys. Rev. B 72, 184101 (2005), doi: 10.1103/PhysRevB.72.184101. [151] G. P. Voutsas et al., The crystal structure of antimony selenide, Sb2Se3, Zeitschrift für Krist. - Cryst. Mater. 171, 261–268 (1985), doi: 10.1524/zkri.1985.171.14.261. [152] A. C. Stergiou and P. J. Rentzeperis, The crystal structure of arsenic selenide, As2Se3, Zeitschrift für Krist. - Cryst. Mater. 173, 185–192 (1985), doi: 10.1524/zkri.1985.173.14.185. A21 Bibliography [153] S. Nakajima, The crystal structure of Bi2Te3 − 𝑥Se𝑥 , J. Phys. Chem. Solids 24, 479–485 (1963), doi: 10.1016/0022-3697(63)90207-5. [154] H. Zhang et al., Topological insulators in Bi2Se3, Bi2Te3 and Sb2Te3 with a single Dirac cone on the surface, Nat. Phys. 5, 438–442 (2009), doi: 10.1038/nphys1270. [155] O. Ü. Aktürk et al., Single-layer crystalline phases of antimony: Antimonenes, Phys. Rev. B 91, 235446 (2015), doi: 10.1103/PhysRevB.91.235446. [156] G. Wang et al., Atomically Thin Group V Elemental Films: Theoretical Investigations of Anti- monene Allotropes, ACS Appl. Mater. Interfaces 7, 11490–11496 (2015), doi: 10.1021/acsami. 5b02441. [157] R. Flammini et al., Evidence of β-antimonene at the Sb/Bi2Se3 interface, Nanotechnology 29, 065704 (2018), doi: 10.1088/1361-6528/aaa2c4. [158] C. S. Barrett et al., The crystal structure of antimony at 4.2, 78 and 298 °K, Acta Crystallogr. 16, 451–453 (1963), doi: 10.1107/S0365110X63001262. [159] J. C. Y. Teo et al., Surface states and topological invariants in three-dimensional topological in- sulators: Application to Bi1 − 𝑥Sb𝑥 , Phys. Rev. B 78, 045426 (2008), doi: 10.1103/PhysRevB.78. 045426. [160] P. Zhang et al., Topological and electronic transitions in a Sb(111) nanofilm: The interplay between quantum confinement and surface effect, Phys. Rev. B 85, 201410 (2012), doi: 10.1103/ PhysRevB.85.201410. [161] X. Luo et al., First-principles investigations of the atomic, electronic, and thermoelectric proper- ties of equilibrium and strained Bi2Se3 and Bi2Te3 including van der Waals interactions, Phys. Rev. B 86, 184111 (2012), doi: 10.1103/PhysRevB.86.184111. [162] A. S. Hewitt et al., Coexisting Bi and Se surface terminations of cleaved Bi2Se3 single crystals, J. Vac. Sci. Technol. B, Nanotechnol. Microelectron. Mater. Process. Meas. Phenom. 32, 04E103 (2014), doi: 10.1116/1.4873689. [163] T. Förster et al., Two-dimensional topological phases and electronic spectrum of Bi2Se3 thin films from GW calculations, Phys. Rev. B 92, 201404 (2015), doi: 10.1103/PhysRevB.92.201404. [164] T. Förster et al., GW calculations for Bi2Te3 and Sb2Te3 thin films: Electronic and topological properties, Phys. Rev. B 93, 205442 (2016), doi: 10.1103/PhysRevB.93.205442. [165] E. Frantzeskakis et al., Trigger of the Ubiquitous Surface Band Bending in 3D Topological Insu- lators, Phys. Rev. X 7, 041041 (2017), doi: 10.1103/PhysRevX.7.041041. [166] S. H. Kim et al., Atomically Abrupt Topological 𝑝-𝑛 Junction, ACS Nano 11, 9671–9677 (2017), doi: 10.1021/acsnano.7b03880. [167] S. H. Kim et al., Topological phase transition and quantum spin Hall edge states of antimony few layers, Sci. Rep. 6, 33193 (2016), doi: 10.1038/srep33193. [168] K.-H. Jin et al., Band structure engineering of topological insulator heterojunctions, Phys. Rev. B 93, 075308 (2016), doi: 10.1103/PhysRevB.93.075308. [169] J. G. Analytis et al., Bulk Fermi surface coexistence with Dirac surface state in Bi2Se3: A com- parison of photoemission, Phys. Rev. B 81, 205407 (2010), doi: 10.1103/PhysRevB.81.205407. [170] I. Aguilera et al., GW study of topological insulators Bi2Se3, Bi2Te3, and Sb2Te3: Beyond the perturbative one-shot approach, Phys. Rev. B 88, 045206 (2013), doi: 10.1103/PhysRevB.88. 045206. [171] S. LaShell et al., Spin Splitting of an Au(111) Surface State Band Observed with Angle Resolved Photoelectron Spectroscopy, Phys. Rev. Lett. 77, 3419–3422 (1996), doi: 10.1103/PhysRevLett. 77.3419. [172] L. Fu, Hexagonal Warping Effects in the Surface States of the Topological Insulator Bi2Te3, Phys. Rev. Lett. 103, 266801 (2009), doi: 10.1103/PhysRevLett.103.266801. [173] J. Tersoff and D. R. Hamann, Theory of the scanning tunneling microscope, Phys. Rev. B 31, 805– 813 (1985), doi: 10.1103/PhysRevB.31.805. A22 Webpages [174] K.-H. Goldhorn and H.-P. Heinz, Mathematik für Physiker 3: Partielle Differentialgleichungen - Orthogonalreihen - Integraltransformationen, 1st ed., Springer-Lehrbuch (Springer, Berlin, Heidelberg, 2008), doi: 10.1007/978-3-540-76334-5. Webpages [W1] Bald fahren autonome Fahrzeuge in Deutschland, https://www.faz.net/aktuell/wirtschaft/ bald - fahren - autonome - fahrzeuge - auf - deutschen - strassen - 17827948 . html (visited on 24/05/2022). [W2] THE INVENTION OF THE COMPUTER, https : / / www . hnf . de / en / permanent - exhibition / exhibition-areas/the-invention-of-the-computer.html (visited on 14/04/2022). [W3] TIMELINE, https : / / www . computerhistory . org / siliconengine / timeline/ (visited on 15/04/2022). [W4] All Nobel Prizes in Physics, https://www.nobelprize.org/prizes/lists/all-nobel-prizes- in-physics/ (visited on 04/03/2022). [W5] 22 nm process, https://en.wikipedia.org/wiki/22_nm_process (visited on 18/04/2022). [W6] 5 nm process, https://en.wikipedia.org/wiki/5_nm_process (visited on 18/04/2022). [W7] Intel Now Packs 100 Million Transistors in Each Square Millimeter, https://spectrum.ieee. org/nanoclast/semiconductors/processors/intel-now-packs-100-million-transistors- in-each-square-millimeter (visited on 18/04/2022). [W8] Mit Höchstleistungsrechnern und Data Analytics gegen das Coronavirus, https : / / www . gesundheitsindustrie- bw.de/fachbeitrag/aktuell/mit- hoechstleistungsrechnern- und- data-analytics-gegen-das-coronavirus (visited on 13/04/2022). [W9] Storage, https://home.cern/science/computing/storage (visited on 13/04/2022). [W10] PERFORMANCE DEVELOPMENT, https://top500.org/statistics/perfdevel/ (visited on 13/04/2022). [W11] TOP500 Meanderings: Sluggish Performance Growth May Portend Slowing HPC Market, https: //www.top500.org/news/top500-meanderings-sluggish-performance-growth-may-portend- slowing-hpc-market/ (visited on 13/04/2022). [W12] TOP 10 Sites for November 2020, https://top500.org/lists/top500/2020/11/ (visited on 13/04/2022). [W13] Stromverbrauch der privaten Haushalte nach Haushaltsgrößenklassen, https : / / www . destatis.de/DE/Themen/Gesellschaft-Umwelt/Umwelt/UGR/private-haushalte/Tabellen/ stromverbrauch-haushalte.html (visited on 24/05/2022). [W14] Halbleiter-Knappheit – Wie der Chipmangel überwunden werden kann, https://www.faz.net/ aktuell/wirtschaft/digitec/chipmangel- wie- das- problem- ueberwunden- werden- kann- 17682182.html (visited on 24/05/2022). [W15] Ende der Chip-Engpässe könnte noch auf sich warten lassen, https : / / www . faz . net / agenturmeldungen/dpa/ende-der-chip-engpaesse-koennte-noch-auf-sich-warten-lassen- 18027141.html (visited on 24/05/2022). [W16] HPE Apollo (Hawk), https://www.hlrs.de/solutions/systems/hpe-apollo-hawk (visited on 09/06/2022). [W17] La Grande Arche de La Défense, https://fr.wikipedia.org/wiki/Arche_de_la_D%C3%A9fense# /media/Fichier:Grande_Arche_F%C3%A9vrier_2020.jpeg (visited on 24/03/2022). [W18] Heilen mit Antimon: Von der Chemiatrie zur Chemotherapie, https://www.pharmazeutische- zeitung.de/inhalt-10-2000/titel-10-2000/ (visited on 21/05/2022). A23 Publication list [P1] S. Sanna, U. Gerstmann, E. Rauls, Y. Li, M. Landmann, A. Riefer, M. Rohrmüller, N. Vollmers, M. Witte, R. Hölscher, A. Lücke, C. Braun, S. Neufeld, K. Holtgrewe and W. G. Schmidt, Sur- face Charge of Clean LiNbO3 Z-Cut Surfaces, in High perform. comput. sci. eng. ‘14 (Springer International Publishing, Cham, 2015), pp. 163–178, doi: 10.1007/978-3-319-10810-0_12. [P2] A. Lücke, U. Gerstmann, S. Sanna, M. Landmann, A. Riefer, M. Rohrmüller, N. J. Vollmers, M. Witte, E. Rauls, R. Hölscher, C. Braun, S. Neufeld, K. Holtgrewe and W. G. Schmidt, Solving the Scattering Problem for the P3HT On-Chain Charge Transport, in High perform. comput. sci. eng. ´15 (Springer International Publishing, Cham, 2016), pp. 155–170, doi: 10.1007/978-3-319- 24633-8_10. [P3] I. Miccoli, F. Edler, H. Pfnür, S. Appelfeller, M. Dähne, K. Holtgrewe, S. Sanna, W. G. Schmidt and C. Tegenkamp, Atomic size effects studied by transport in single silicide nanowires, Phys. Rev. B 93, 125412 (2016), doi: 10.1103/PhysRevB.93.125412. [P4] S. Sanna, C. Dues, U. Gerstmann, E. Rauls, D. Nozaki, A. Riefer, M. Landmann, M. Rohrmüller, N. J. Vollmers, R. Hölscher, A. Lücke, C. Braun, S. Neufeld, K. Holtgrewe and W. G. Schmidt, Submonolayer Rare Earth Silicide Thin Films on the Si(111) Surface, in High perform. comput. sci. eng. ´16 (Springer International Publishing, Cham, 2017), pp. 163–175, doi: 10.1007/978- 3-319-47066-5_12. [P5] C. Hogan, K. Holtgrewe, F. Ronci, S. Colonna, S. Sanna, P. Moras, P. M. Sheverdyaeva, S. Ma- hatha, M. Papagno, Z. S. Aliev, M. Babanly, E. V. Chulkov, C. Carbone and R. Flammini, Tem- perature Driven Phase Transition at the Antimonene/Bi2Se3 van der Waals Heterostructure, ACS Nano 13, 10481–10489 (2019), doi: 10.1021/acsnano.9b04377. [P6] K. Holtgrewe, S. Appelfeller, M. Franz, M. Dähne and S. Sanna, Structure and one-dimensional metallicity of rare-earth silicide nanowires on Si(001), Phys. Rev. B 99, 214104 (2019), doi: 10. 1103/PhysRevB.99.214104. [P7] K. Holtgrewe, S. K. Mahatha, P. M. Sheverdyaeva, P. Moras, R. Flammini, S. Colonna, F. Ronci, M. Papagno, A. Barla, L. Petaccia, Z. S. Aliev, M. B. Babanly, E. V. Chulkov, S. Sanna, C. Hogan and C. Carbone, Topologization of 𝛽-antimonene on Bi2Se3 via proximity effects, Sci. Rep. 10, 14619 (2020), doi: 10.1038/s41598-020-71624-4. [P8] S. Appelfeller, K. Holtgrewe, M. Franz, L. Freter, C. Hassenstein, H.-F. Jirschik, S. Sanna and M. Dähne, Continuous crossover from two-dimensional to one-dimensional electronic properties for metallic silicide nanowires, Phys. Rev. B 102, 115433 (2020), doi: 10.1103/PhysRevB.102. 115433. [P9] K. Holtgrewe, C. Hogan and S. Sanna, Evolution of Topological Surface States Following Sb Layer Adsorption on Bi2Se3, Materials (Basel). 14, 1763 (2021), doi: 10.3390/ma14071763. [P10] C. Hogan, P. Lechifflart, S. Brozzesi, S. Voronovich-Solonevich, A. Melnikov, R. Flammini, S. Sanna and K. Holtgrewe, Theoretical study of stability, epitaxial formation, and phase trans- formations of two-dimensional pnictogen allotropes, Phys. Rev. B 104, 245421 (2021), doi: 10. 1103/PhysRevB.104.245421. [P11] K. Holtgrewe, F. Ziese, J. Bilk, M. N. Pionteck, K. Eberheim, F. Bernhardt, C. Dues and S. Sanna, Tuning the Conductivity of Metallic Nanowires by Hydrogen Adsorption, in High perform. com- put. sci. eng. ’20 (Springer International Publishing, Cham, 2021), pp. 133–146, doi: 10.1007/ 978-3-030-80602-6_9. [P12] S. Sanna, J. Plaickner, K. Holtgrewe, V. M. Wettig, E. Speiser, S. Chandola and N. Esser, Spectro- scopic Analysis of Rare-Earth Silicide Structures on the Si(111) Surface, Materials (Basel). 14, 4104 (2021), doi: 10.3390/ma14154104. [P13] Z. Mamiyev, C. Fink, K. Holtgrewe, H. Pfnür and S. Sanna, Enforced Long-Range Order in 1D Wires by Coupling to Higher Dimensions, Phys. Rev. Lett. 126, 106101 (2021), doi: 10.1103/ PhysRevLett.126.106101. [P14] C. Fink, F. A. Pfeiffer, K. Holtgrewe and S. Sanna, Au adsorption on stepped Si(ℎℎ𝑘)-Au surfaces, Surf. Sci. 718, 122010 (2022), doi: 10.1016/j.susc.2021.122010. A24 Danksagung Ich möchte mich hiermit bei all denjenigen bedanken, die mich auf meinem Wege bis zum Abschluss der Promotion begleitet haben. Zuallererst gilt ein großer Dank meinem Doktorvater Prof. Simone Sanna, der mich immer kräftig unterstützt hat. Er hat sich nie davor gescheut, mir auch schwierige Aufgaben zu übertragen, und mir gleichzeitig den Freiraum gegeben, sie entspre- chend gründlich zu bearbeiten. Insbesondere die Tatsache, dass er zum Beginn mei- ner Promotion den Ruf an die JLU bekommen und folglich mit mir und Dr. Christof Dues zusammen die Arbeitsgruppe gegründet hat, war eine Herausforderung, auch aus organisatorischer Sicht, die sich im Nachhinein als wertvolle Erfahrung erwie- sen hat. Bei Christof möchte ich mich in diesem Zuge auch sehr herzlich bedanken, ebenso bei allen anderen Mitgliedern der AG Sanna, für gute fachliche Gespräche und ein hervorragendes Arbeitsklima, auch extra laborem. Ich möchte mich sehr herzlich bei Prof. Wolf Gero Schmidt bedanken, der während meines Studiums an der Universität Paderborn mein Interesse an der theoretischen Materialphysik geweckt hat und mich bis zum Masterabschluss und darüber hinaus bestens unterstützt hat. I want to thank Dr. Conor Hogan, who was my supervisor during my ERASMUS+ re- search internship at the university “Tor Vergata”, Rome, Italy. The collaboration with him was very precious and fruitful. Furthermore, I want to thank all other colla- borators with whom I worked on the RESi2 nanowires and the antimony on Bi2Se3 heterostructures. Ein ganz großer Dank gilt meinen Eltern, die schon früh mein Interesse an der Na- turwissenschaft geweckt und mich in meinen Vorhaben immer unterstützt haben. Des Weiteren gilt mein Dank auch dem Rest meiner Familie und meinen Freunden dafür, dass sie mir immer Halt gegeben haben. Ich möchte mich bei der Deutschen Forschungsgemeinschaft (DFG) für die finanzielle Unterstützung bedanken. Ferner möchte ich mich bei den HPC Einrichtungen bedan- ken, deren Rechenzeiten die Ergebnisse dieser Arbeit ermöglicht haben: das Hoch- schulrechenzentrum der JLU Gießen, das Hochschulrechenzentrum der TU Darm- stadt und das Höchstleistungsrechenzentrum Stuttgart. A25