create_full_GTF.Rd
This function creates a GTF-style data frame by combining and formatting essential genomic annotation fields. Data must be pre-loaded using load_annotation() followed by create_GTF_df() and / or add_UTR(), create_full_GTF(), or create_reduced_GTF().
create_full_GTF(input)
A data frame containing genomic data with the following columns: - `chr`: Chromosome identifier - `start`: Start position of the annotation - `end`: End position of the annotation - `strand`: Strand information ('+' or '-') - `transcript_name`: Name of the associated transcript - `transcript_id`: Unique identifier for the transcript - `gene_name`: Name of the associated gene - `gene_id`: Unique identifier for the gene - `annotationType`: Type of annotation (e.g., 'EXON', 'TRANSCRIPT') - `source`: Source of the transcript - `gene_id`: Unique identifier for the gene - `gene_name`: Name of the associated gene - `transcript_name`: Name of the associated transcript - `transcript_id`: Unique identifier for the transcript
A GTF-style data frame including the first eight columns of the input and a new `combine` column. The `combine` column contains concatenated annotation fields in the format: `gene_id`, `gene_name`, `transcript_name`, and `transcript_id`.
# Run the function
gtf_data <- create_full_GTF(input_data)
#> Error: object 'input_data' not found