Skip to contents

Input

protpipe_example_se
Example SummarizedExperiment for ProtPipe workflows
create_se()
This function takes a data frame of proteomics data and its corresponding sample metadata to construct a SummarizedExperiment object. It handles detection of intensity columns, validation, and synchronization of metadata.
create_se_from_olink()
Create a SummarizedExperiment Object from Olink NPX Data
create_se_from_soma()
Create a SummarizedExperiment Object from SomaScan Data
detect_intensity_cols()
Identify Numeric Columns by Index
convert_numeric_cols()
Safely Convert Character Columns to Numeric Type
olink_all_output()
Format Olink NPX Data into Data and Condition Tables
soma_all_output()
format soma adat into data and condition dataframes
run_protpipe_shiny()
Run the protpipe Shiny App

Quality Control

get_pg_counts()
Get Protein Group Counts Per Sample
plot_pg_counts()
Plot Protein Group Counts
plot_pg_intensities()
Plot Boxplots of Sample Intensity Distributions
get_CVs()
Calculate Coefficient of Variation (CV) for Protein Groups
plot_CVs()
Plot Coefficient of Variation (CV) Distributions
get_sample_correlation()
Calculate Pairwise Sample Correlations
plot_correlation_heatmap()
Plot a Sample Correlation Heatmap

Pre-Processing

lod_filter()
Filter Assay Based on Limit of Detection (LOD)
apply_min_intenisty()
Apply Limit of Detection Threshold
filter_unique_proteins()
Removes duplicate analytes
filter_overlap()
Retain proteins present in a specified group of a SummarizedExperiment object
filter_proteins_by_percent()
Filter Proteins by Percentage of Valid Values
filter_outlier_samples()
Filter Outlier Samples Based on Protein Counts
log2_transform()
Performs a log2 transform of protein intensity values
z_score()
Z-Score Normalization for Proteins Across Samples
mean_normalize()
Mean Normalization of Proteomics Data
median_normalize()
Median Normalization of Proteomics Data
impute()
Impute Missing Values with a Constant
impute_min()
Impute Missing Values with the Row Minimum
impute_left_dist()
Impute from a Down-Shifted Normal Distribution
batch_correct()
Correct for Batch Effects
generate_preprocessing_report()
Generate Markdown Report of Preprocessing Steps
has_step()
Check if a Processing Step has been Applied

Clustering / Dimensionality Reduction

get_PCs()
Calculate Principal Components Analysis (PCA)
plot_PCs()
Plot Principal Component Analysis Results
plot_hierarchical_cluster()
Plot a Hierarchical Clustering Dendrogram of Samples
get_umap()
Calculate UMAP Dimensionality Reduction
plot_umap()
Plot UMAP Dimensionality Reduction Results

Differential Expression

filter_features()
Helper function to filter out sparse proteins
do_limma_binary()
Perform limma differential expression on a SummarizedExperiment
do_t_test_binary()
Perform t-test differential expression on a SummarizedExperiment
do_comparison_continuous()
Perform limma differential expression for a continuous outcome
plot_volcano()
Plot a Volcano Plot for Differential Expression Results
plot_correlation_volcano()
Plot a Volcano Plot for Correlation Results
add_entrez()
Add Entrez Gene IDs to a Data Frame
read_ontology()
Read a Custom Ontology File
enrich_go()
Perform Gene Ontology (GO) Over-Representation Analysis (ORA)
enrich_kegg()
Perform KEGG Over-Representation Analysis (ORA)
enrich_terms()
Run Over-Representation Analysis for Custom Gene Sets
gse_go()
Perform Gene Ontology (GO) Gene Set Enrichment Analysis (GSEA)
gse_kegg()
Perform KEGG Gene Set Enrichment Analysis (GSEA)
gse_terms()
Run GSEA for Custom Gene Sets
enrich_pathways()
Perform Comprehensive GO and KEGG Pathway Enrichment Analysis

Abundance Profiling

compare_protein()
Creates a ggplot bar chart comparing the intensity of a single protein either across all samples or grouped by a condition.
plot_proteomics_heatmap()
Plot a Proteomics Heatmap