You can combine toolbox functions to support common bioinformatics workflows. You can use ChIP-Seq data to identify transcription factors; analyze RNA-Seq data to identify differentially expressed genes; identify copy number variants and SNPs in microarray data; and classify protein profiles using mass spectrometry data.
Bioinformatics Toolbox provides algorithms and apps for Next Generation Sequencing (NGS), microarray analysis, mass spectrometry, and gene ontology. Using toolbox functions, you can read genomic and proteomic data from standard file formats such as SAM, FASTA, CEL, and CDF, as well as from online databases such as the NCBI Gene Expression Omnibus and GenBank. You can explore and visualize this data with sequence browsers, spatial heatmaps, and clustergrams. The toolbox also provides statistical techniques for detecting peaks, imputing values for missing data, and selecting features.
Bioinformatics Toolbox provides algorithms and visualization techniques for Next Generation Sequencing analysis. The toolbox enables you to analyze whole genomes while performing calculations at a base pair level of resolution. You can use the NGS browser to visualize and investigate short-read alignments using either single-end or paired-end short reads. You can also build custom analysis routines, as shown in the following examples.
The toolbox enables you to visualize sequences and alignments. You can view linear or circular maps of sequences annotated with GenBank features. You can visualize secondary structure diagrams of an RNA sequence. Interactive viewers let you explore and modify pairwise and multiple sequence alignments.
The toolbox enables you to create and edit phylogenetic trees. You can calculate pairwise distances between aligned or unaligned nucleotide or amino acid sequences using a broad range of similarity metrics such as Jukes-Cantor, p-distance, alignment-score, or a user-defined distance method. Phylogenetic trees are constructed using hierarchical linkage with a variety of techniques, including neighbor joining, single and complete linkage, and Unweighted Pair Group Method Average (UPGMA).
The toolbox supports weighting and rerooting trees, calculating subtrees, and calculating the canonical form of trees. The phylogenetic tree viewer lets you prune, reorder, and rename branches; explore distances; and read or write Newick-formatted files. You can also use the annotation tools in MATLAB to create presentation-quality trees.
The toolbox provides protein sequence analysis techniques, including routines for calculating properties of a peptide sequence such as atomic composition, isoelectric point, and molecular weight. You can determine the amino acid composition of protein sequences, cleave a protein with an enzyme, and create backbone plots and Ramachandran plots of PDB data. You can use the Sequence Tool to view the properties of an amino acid sequence or use the Molecule Viewer to display and manipulate 3D molecular structures.