GSR: Editing - sim1000G Simulator

You may request changes to this simulator by navigating to the Basic, Details, and Citations/Applications tabs. When you are finished, open the Submit tab. To return back to the simulator view, click sim1000G. Finally, please take note of the GSR simulator privacy policy.
sim1000G
sim1000G integrates fully with R and can simulate existing variation from a single VCF file. In addition it can also simulate arbitrary pedigrees.
We develop a new user-friendly and integrated R package, sim1000G, which simulates genomic regions for unrelated individuals or for families. Only a single input of raw phased Variant Call Format (VCF) file is needed. Haplotypes are extracted to compute linkage disequilibrium in the simulated region and then for the generation of new genotype data for unrelated individuals. The covariance across variants is used to preserve the LD structure of the original population. Arbitrary pedigree sizes are generated by modeling recombination events within sim1000G. Various simulation scenarios are presented assuming unrelated individuals from a single population or two distinct populations, or alternatively for three-generation family data. Sim1000G can capture allele frequency diversity, short and long-range linkage disequilibrium (LD) patterns and subtle population differences in LD structure without the need for any tuning parameters.
simulator variants VCF pedigree
1.19
03-10-2018
03-10-2018
https://github.com/adimitromanolakis/sim1000G
apostolis@live.ca

Attribute Tree Control

Step 1: Use the attribute tree to add new attributes or remove pre-selected attributes to describe the simulator.

Every sub-attribute is selected
Not all sub-attributes are selected
  • Target
    • Type of Simulated Data
      • Genotype at Genetic Markers
      • Diploid DNA Sequence
      • Haploid DNA Sequence
      • RNA
      • Gene Expression
      • Sex Chromosomes
      • Mitochondrial DNA
      • Protein Sequence
      • Sequencing Reads
      • Phenotype
      • Single-Cell Sequencing
      • Bulk Sequencing
      • Proteomics
      • Chromatin Conformation
    • Variations
      • Biallelic Marker
      • Multiallelic Marker
      • Single Nucleotide Variation
      • Amino acid variation
      • Microsatellite
      • Insertion and Deletion
      • CNV
      • Inversion and Rearrangement
      • Alternative Splicing
      • Missing Genotypes
      • Genotype or Sequencing Error
      • Ionization
      • Other
  • Simulation Method
    • Standard Coalescent
    • Exact Coalescent
    • Machine Learning
    • Forward-time
    • Resample Existing Data
    • Phylogenetic
    • Gene dropping
    • Neural network
    • Other
  • Input
    • Data Type
      • Allele Frequencies
      • Empirical
      • Ancestral Sequence
      • Saved simulation
      • Reference genome
      • Other
    • File format
      • Arlequin
      • CREATE
      • Fstat
      • GDA
      • Genepop
      • MIGRATE
      • MS
      • SAM or BAM
      • NEXUS
      • Phylip
      • STRUCTURE
      • XML
      • Tree Sequence
      • Program Specific
      • Other
  • Output
    • Data Type
      • Genotype or Sequence
      • Phenotypic Trait
      • Individual Relationship
      • Phylogenetic Tree
      • Demographic
      • Mutation
      • Methylation
      • Gene Expression
      • Protein Expression
      • Linkage Disequilibrium
      • Diversity Measures
      • Fitness
      • Sequencing Reads
        • Illumina
        • Roche 454
        • SOLiD
        • IonTorrent
        • PacBio
        • Nanopore
        • Other
      • Other
    • File Format
      • Arlequin
      • Fasta or Fastq
      • Fstat
      • Genepop
      • Linkage
      • MIGRATE
      • MS
      • PED
      • Phylip
      • NEXUS
      • STRUCTURE
      • VCF
      • SAM or BAM
      • Tree Sequence
      • Program Specific
      • Other
    • Sample Type
      • Random or Independent
      • Sibpairs, Trios and Nuclear Families
      • Extended or Complete Pedigrees
      • Case-control
      • Longitudinal
      • Other
  • Phenotype
    • Trait Type
      • Binary or Qualitative
      • Quantitative
      • Multiple
    • Determinants
      • Single Genetic Marker
      • Multiple Genetic Markers
      • Sex-linked
      • Gene-Gene Interaction
      • Environmental Factors
      • Gene-Environment Interaction
  • Evolutionary Features
    • Demographic
      • Population Size Changes
        • Constant Size
        • Exponential Growth or Decline
        • Logistic Growth
        • Bottleneck
        • Carrying Capacity
        • User Defined
      • Gene Flow
        • Stepping Stone Models
        • Island Models
        • Continent-Island Models
        • Sex or Age-Specific Migration Rates
        • Influenced by Environmental Factors
        • Admixed Population
        • User-defined Matrix
        • Other
      • Spatiality
        • Discrete Models
        • Continuous Models
        • Landscape Factors
    • Life Cycle
      • Discrete Generation Model
      • Age structured
      • Overlapping Generation
      • User-Defined transition matrices
    • Mating System
      • Random Mating
      • Monogamous
      • Polygamous
      • Haplodiploid
      • Selfing
      • Age- or Stage-Specific
      • Assortative or Disassortative
      • Other
    • Fecundity
      • Constant Number
      • Randomly Distributed
      • Individually Determined
      • Influenced by Environment
      • Other
    • Natural Selection
      • Determinant
        • Single-locus
        • Multi-locus
        • Codon-based
        • Fitness of Offspring
        • Phenotypic Trait
        • Environmental Factors
      • Models
        • Directional Selection
        • Balancing Selection
        • Multi-locus models
        • Epistasis
        • Random Fitness Effects
        • Disruptive
        • Phenotype Threshold
        • Frequency-Dependent
        • Other
    • Recombination
      • Uniform
      • Varying Recombination Rates
      • Gene Conversion Allowed
    • Mutation Models
      • Two-allele Mutation Model
      • Markov DNA Evolution Models
      • k-Allele Model
      • Infinite-allele Model
      • Infinite-sites Model
      • Stepwise Mutation Model
      • Codon and Amino Acid Models
      • Indels and Others
      • Heterogeneity among Sites
      • Others
    • Events Allowed
      • Population Merge and Split
      • Varying Demographic Features
      • Population Events
      • Varying Genetic Features
      • Change of Mating Systems
      • Other
    • Other
      • Phenogenetic
      • Polygenic background
  • Interface
    • Command-line
    • Graphical User Interface
    • Integrated Development Environment
    • Script-based
    • Web-based
  • Development
    • Tested Platforms
      • Windows
      • Mac OS X
      • Linux and Unix
      • Solaris
      • Others
    • Language
      • C or C++
      • Java
      • R
      • Python
      • Perl
      • Visual Basic
      • Other
    • License
      • GNU Public License
      • BSD
      • Creative Commons
      • MIT
      • Other
  • GSR Certification
    • Accessibility
    • Documentation
    • Application
    • Support

Summary of Proposed Changes

Step 2: Review list of proposed attribute addition(s) and subtraction(s).

To Add

    To Remove

      Can't Find the Attribute You Are Looking For?

      If you would like to propose an attribute that you cannot find in the tree above, or if you would like to add a clarification to one or more attributes for this simulator (e.g. a specific file format for attribute /Output/File Format/Other), please list them in the Additional Comment box of the Submit tab.

      You may add citations by pmid, add citations by direct entry, remove citations (using the recycling bin icon), and edit citations (using the rarely seen edit icon) that were originally entered by direct entry.

      Summary of Proposed Changes

      To Add

      To Remove

      Current Citations/Applications

      [Pubmed ID: 30646839], Dimitromanolakis A, Xu J, Krol A, Briollais L, sim1000G: a user-friendly genetic variant simulator in R for unrelated individuals and family-based designs., BMC Bioinformatics, 01-15-2019, https://www.ncbi.nlm.nih.gov/pubmed/?term=30646839,Primary Citation
      [Pubmed ID: 32912334], Gleason KJ, Yang F, Pierce BL, He X, Chen LS, Primo: integration of multiple GWAS and omics QTL summary statistics for elucidation of molecular mechanisms of trait-associated SNPs and detection of pleiotropy in complex traits., Genome Biol, 09-11-2020, https://www.ncbi.nlm.nih.gov/pubmed/?term=32912334,, Application
      [Pubmed ID: 34512212], Choi YH, Briollais L, He W, Kopciuk K, FamEvent: An R Package for Generating and Modeling Time-to-Event Data in Family Designs., J Stat Softw, 03-01-2021, https://www.ncbi.nlm.nih.gov/pubmed/?term=34512212,, Application
      [Pubmed ID: 35318325], Zhou D, Gamazon ER, Integrative transcriptomic, evolutionary, and causal inference framework for region-level analysis: Application to COVID-19., NPJ Genom Med, 03-22-2022, https://www.ncbi.nlm.nih.gov/pubmed/?term=35318325,, Application
      This email will never be published. This email is used only for verification and communication purposes.
      Please inform the GSR team here if you would like to see an attribute added to the attribute tree (or any other changes to the simulator description system as it exists).