The Genomic Annotation in Livestock for positional candidate LOci (GALLO) is an R package designed to provide a straightforward environment for gene and QTL annotation, as well as data integration from multiple data sources. The QTL enrichment analyses can additionally be performed directly by GALLO using the output obtained from the QTL annotation step. In addition, GALLO also provide a set of functions for graphical visualization for the annotation, comparison, integration and QTL enrichment results. Consequently, GALLO is a useful package for the annotation, identification of hidden pattern across datasets, data mining of previous reported associations, as well as the efficient scrutinization of the genetic architecture of complex traits.
Depends: R (>= 4.0.0)
Imports: circlize, DT, data.table, doParallel, dplyr, dynamicTreeCut, ggplot2, graphics, grDevices, foreach, lattice , parallel, RColorBrewer, rtracklayer, stats, stringr, unbalhaar, utils, igraph, visNetwork
License: GPL-3
.gtf files for gene annotation can de found, for example, in ensembl FTP website: https://www.ensembl.org/info/data/ftp/index.html
.gtf files for QTL annotation can be found in Animal QTLdb: https://www.animalgenome.org/QTLdb/
To install the package, the following command line can be use in R:
install.packages("GALLO")
import_gff_gtf(): Takes a .gft or .gff file and import into a dataframe
find_genes_qtls_around_markers: Takes a dataframe with candidate markers and/or regions (haplotypes, windows, CNVs, etc) and search for genes or QTLs in a specified interval
overlapping_among_groups: Takes a dataframe with a column for genes, QTLs (or any other data) and a grouping column and create matrices with the ovelapping information
plot_overlapping: Takes the output from overlapping_amoung_groups function and creates a heatmap with the overlapping between groups
plot_qtl_info: Takes the output from find_genes_qtls_around_markers and create plots for the frequency of each QTL type and trait
qtl_enrich: Takes the output from find_genes_qtls_around_markers and perform a QTL enrichment analysis
QTLenrich_plot: Takes the output from _find_genes_qtls_around_markers function and creates a heatmap with the overlapping between groups
relationship_plot: Takes the output from find_genes_qtls_around_markers function and creates a chord plot with the relationship between groups
gene_pval: Estimate a gene-level p-value using Weighted Z-score approach and Meta-analysis with LD correlation coefficients approach
NetCen: Compute the centrality metrics for the nodes composing the network generated by the NetVis function
NetVis: Create a dynamic network representing the relationship between two groups of variables
Nmarkers_SimpleM: Estimate the number of effective markers in a chromosome based on an adapted version of the simpleM methodology
Nseg_chr: Estimate the number of independent segments in a chromosome based on the effective population size
PleioChiTest: Compute a multi-trait test statistic for pleiotropic effects using summary statistics from association tests
A tutorial for GALLO usage can be found at:
https://rpubs.com/pablo_bio/GALLO_vignette
For more information, suggestions, discussions, and bug reports, contact pfonseca@uoguelph.ca