Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Understanding the evolutionary relationships among species is fundamental to biology, not only as an account of speciation events but also as the basis for comparative analyses of trait evolution. However, for deep phylogenetic relationships, different studies often show incongruence across analyses5,6. Large amounts of data may be required to resolve certain relationships yet others can remain recalcitrant even with genome-scale efforts, particularly for rapid radiations7,8. Phylogenomic incongruence can point to statistical and systematic errors but is also increasingly linked to complex biological processes that accompany rapid diversification9,10. Prime examples of this problem are the phylogenetic relationships among modern birds (Neornithes), which are inconsistently resolved even with large-scale datasets1,2,3,11. The widespread incongruences in evolutionary histories across avian genomes1,12,13 has left the phylogenetic relationships of major extant groups unclear and possibly irresolvable14.
Efforts to resolve high-level avian phylogeny face two major challenges. First, it is difficult to obtain large numbers of orthologous loci with suitable properties for phylogenetic analyses. Many studies have been limited to conserved genomic regions such as protein-coding sequence (exons) and ultraconserved elements (UCEs)2,25. Conserved regions exhibit complex patterns of sequence evolution: for example, selection to maintain protein structure and function places constraints on exon evolution12. Standard models of sequence evolution practical for large datasets exhibit poor fit to these regions, and model misspecifications probably result in topological discrepancies across data types1,12,13. Analysis of large numbers of loci does not remove, but can instead reinforce, biases introduced by model violations1,7. In principle, data types under lower selective pressure such as introns and intergenic regions are preferable; intergenic regions are arguably ideal because they are less probably under strong selection13. The second challenge is collecting genomic data from sufficient numbers of species, given that dense taxon sampling can improve phylogenetic estimation26,27. Thus, the debate in avian phylogenetics has revolved around the trade-off between using diverse loci extracted from entire genomes but for few species (one genome per taxonomic order)1 or using a smaller number of potentially biased loci sampled from more species2,3. Both approaches have shortcomings. The most compelling solution is also the most challenging: to create comprehensive datasets with whole genomes sampled across many taxa that inform on deeper timescales.
a, Topology simplified to orders with higher clade names following ref. 50. Numbers on branches represent local posterior probability if below 1. b, Time tree of all species. Grey bars represent 95% credible intervals for age estimation; dots indicate nodes with fossil calibrations; asterisks mark the three branches lacking full support. A tree with tip labels is shown in Extended Data Figs. 2 and 3.
Within Neoaves we resolve four major clades (Fig. 1a), three of which are Mirandornithes (grebes and flamingos), Columbaves (Columbimorphae (doves, sandgrouse and mesites) and Otidimorphae (cuckoos, bustards and turacos)), in addition to Telluraves (higher landbirds including Afroaves and Australaves). The fourth major clade is new and phenotypically diverse, containing Aequornithes (pelicans, tubenoses, penguins and loons), Phaethontimorphae (kagu, sunbittern and tropicbirds), Strisores (nightbirds, swifts and hummingbirds), Opisthocomiformes (hoatzin) and Cursorimorphae (shorebirds and cranes). This clade was supported in coalescent-based analyses of intergenic regions and UCEs, but not by exons, introns or in concatenated analysis of intergenic regions (Fig. 3d and Extended Data Fig. 4). We name this clade Elementaves because its lineages have diversified into terrestrial, aquatic and aerial niches, corresponding to the classical elements of earth, water and air, and several Phaethontimorphae have names derived from the sun, representing fire.
We found that exonic loci had less phylogenetic information and were more variable in their signal than the other data types (Extended Data Fig. 8d,e). Exons also scored highest in a measure of phylogenetic estimation difficulty (Extended Data Fig. 8f), indicating that their GTs are less reliable than those of other data types. To examine whether exons had a misleading signal, we restricted species-tree inference to GTs with more signal, less gappy alignments, greater clocklikeness and greater total length. Unlike intergenic regions, in which subsampling did not systematically change the species trees, the use of more informative, less gappy and more clocklike exons reduced incongruence between the resulting species trees and the main tree (Fig. 4b and Extended Data Fig. 8g). Thus exons yield phylogenetic trees that are less reliable. This conclusion is consistent with earlier analyses based on fewer genomes1,12,13,29. Our results indicate that the damaging effects of model violation and limited signal of exons are not offset by increased taxon sampling, as one might hope2,43.
We next evaluated how well the new phylogenetic tree reflects avian morphology, testing the expectation that closely related species should resemble one another. We found that our main tree fits morphological traits better than the topology of ref. 2, even when controlling for taxon sampling (Fig. 5a), including the larger number of Passeriformes in our study (supplementary results given in Supplementary Information). Simulations considering the misplacement of taxa and convergent scenarios suggested that the higher phylogenetic signal in this comparison was more probably attributed to topological differences (Extended Data Fig. 11a).
Across the tree we found that rapid evolutionary change occurred at the origin of major clades, throughout the diversification of some clades and along some isolated branches. Passeriformes exhibited a burst of body mass evolution at their most recent common ancestor (Extended Data Fig. 11b). Rates of evolution in relative brain size were more variable, with rapid evolutionary change in some clades (for example, Telluraves, vocal learning lineages such as parrots, corvids and hummingbirds)49. In addition, our data showed that the early burst was followed by sustained varied rates within these groups, especially in Passeri (Extended Data Fig. 11c).
Relationships along the backbone of Neoaves have long been contentious, with various analyses yielding incongruent results. At the heart of the disagreements has been a long-standing question: is it better to sample many taxa at a few loci (typically conserved regions, such as exons and UCEs) or sample many loci widely across the genome, even if available from fewer species? We can finally answer this question because our data provide both dense taxon sampling and many loci across the whole genome. We observed that the number of loci, in addition to sequence types (for example, exons, introns, intergenic regions or chromosome type), had a much greater effect on the inferred tree than taxon sampling. Nevertheless, increased taxon sampling was crucial in inferring more precise dates, and for studying traits, trajectories of population size and substitution rates. By focusing on intergenic regions, a source of data largely unused in the past, we minimized model violations and increased phylogenetic resolution. Nonetheless, our results also showed that several recalcitrant relationships remain, even with this wealth of data, due to challenges imposed by biological processes such as hybridization that are hard to model in deep time using phylogenetics. Overall, our results underscore the complexity of genome evolution and show methodologies that are likely to be useful for future phylogenomic studies focused on deep relationships.
Further details on methods are given in Supplementary Information. No statistical methods were used to predetermine sample size. The experiments were not randomized and investigators were not blinded to allocation during experiments and outcome assessment.
Collapsed GTs were summarized into a coalescent-based species tree using ASTRAL-MP65 v.5.14.5. Support was assessed using posterior probability. We also performed gene-only, multilocus bootstrapping (globalBS) for cases in which uncertainty is not local (for example, two placements many branches away both resulting in high quartet support), a scenario that can mislead local posterior probability support66. In addition we tested polytomy null hypotheses35 and evaluated the quartet score of the three alternative nearest-neighbour interchanges around each branch66. Quartet scores were visualized using DiscoVista67. We evaluated alternative species trees (for example, moving Phaethontimorphae) by scoring these trees against the same input GTs using ASTRAL.
The genome assemblies analysed in this study and their whole-genome alignment were part of a previous study4, and accession numbers are given as part of the Supplementary Data. Alignments, GTs and species trees, in addition to data files produced for their analysis and scripts to plot the figures, are available at -c8e5-4b89-8e8a-fe984ca89e4a (ref. 105). This repository also contains a file detailing contents and commands to use for individual and batch download of files. The study analysed morphological trait data from AVONET94 ( )106 and Dryad ( )97, recombination rates for chicken86 and time-calibrated species trees from ref. 1 ( )107 and ref. 2 (Avian-TimeTree.tre from )108. Source data are provided with this paper.
c80f0f1006