geposan/R/data.R

40 lines
1.3 KiB
R
Raw Permalink Normal View History

2021-10-19 13:39:55 +02:00
#' Information on included species from the Ensembl database.
#'
2021-12-13 10:36:44 +01:00
#' @format A [data.table] with the following columns:
2021-10-19 13:39:55 +02:00
#' \describe{
#' \item{id}{Unique species ID}
#' \item{name}{Human readable species name}
#' \item{n_chromosomes}{Number of chromosomes}
#' \item{median_chromosome_length}{Median length of chromosomes}
2021-10-19 13:39:55 +02:00
#' }
"species"
#' Information on human genes within the Ensembl database.
#'
#' This includes only genes on the primary suggested assembly of the human
#' nuclear DNA.
#'
2021-12-13 10:36:44 +01:00
#' @format A [data.table] with the following columns:
2021-10-19 13:39:55 +02:00
#' \describe{
#' \item{id}{Ensembl gene ID}
#' \item{name}{The gene's HGNC name}
#' \item{chrosome}{The human chromosome the gene is located on}
#' }
"genes"
#' Information on gene positions across species.
#'
#' This dataset contains each known value for a gene's distance to the telomeres
#' per species. The data is sourced from Ensembl.
#'
2021-12-13 10:36:44 +01:00
#' @format A [data.table] with the following columns:
2021-10-19 13:39:55 +02:00
#' \describe{
#' \item{species}{Species ID}
#' \item{gene}{Gene ID}
#' \item{chromosome_name}{Chromosome name from the specified species}
#' \item{start_position}{Start position in base pairs}
#' \item{end_position}{End position in base pairs}
#' \item{distance}{Computed distance to nearest telomere}
2021-10-19 13:39:55 +02:00
#' }
"distances"