This function creates a GTF-style data frame by combining and formatting essential genomic annotation fields. Data must be pre-loaded using load_annotation() followed by create_GTF_df() and / or add_UTR(), create_full_GTF(), or create_reduced_GTF().

create_full_GTF(input)

Arguments

input

A data frame containing genomic data with the following columns: - `chr`: Chromosome identifier - `start`: Start position of the annotation - `end`: End position of the annotation - `strand`: Strand information ('+' or '-') - `transcript_name`: Name of the associated transcript - `transcript_id`: Unique identifier for the transcript - `gene_name`: Name of the associated gene - `gene_id`: Unique identifier for the gene - `annotationType`: Type of annotation (e.g., 'EXON', 'TRANSCRIPT') - `source`: Source of the transcript - `gene_id`: Unique identifier for the gene - `gene_name`: Name of the associated gene - `transcript_name`: Name of the associated transcript - `transcript_id`: Unique identifier for the transcript

Value

A GTF-style data frame including the first eight columns of the input and a new `combine` column. The `combine` column contains concatenated annotation fields in the format: `gene_id`, `gene_name`, `transcript_name`, and `transcript_id`.

Examples


# Run the function
gtf_data <- create_full_GTF(input_data)
#> Error: object 'input_data' not found