Next-generation sequencing (NGS) systems now produce more data than ever before. Additionally, a typical NGS workflow involves manual, time-consuming touchpoints for quality control, analysis setup, and results review. As a result, labs who perform NGS or other complex, high-volume processing of samples can be overwhelmed managing the workflows and data generated. To address these issues and simplify NGS research, we are happy to announce the new version of BaseSpace Sequence Hub. It is designed to enhance your laboratory’s efficiency and support the needs of high-throughput labs.
Included in this update are new features, including a biosample-centric data model that provides tracking of all biosample activity from lab preparation through analysis delivery. We’re also introducing the following features:
- New automation quality control features
- Automated app launches and workflows
- An updated Application Programming Interface (API) to help you streamline your next-generation sequencing (NGS) workflows
- An improved user interface that helps you access your data and perform functions more quickly
Biosample-centric Data Model
Our new biosample-centric data model enables easy tracking of all biosample activity from lab preparation through analysis delivery. Biosamples are the data containers that represent the original DNA source material. They are used to trace all sequencing activities, including lab preparation (with LIMS integration) sequencing runs, data analysis, and delivery of data.
The new data model centers on biosamples, the original source of DNA, so you can easily track all biosample activity from lab preparation, with optional laboratory information management system (LIMS) integration, to delivery of analysis results. Biosamples can be used as inputs to multiple sequencing runs, and they can contain multiple datasets, which can live within separate projects.
Important Note: Biosamples with the same name (Sample ID in the sample sheet) are automatically aggregated. The new features will aggregate all FASTQ data sets with the same Sample ID into a single biosample. It is important to name the samples in your sample sheet uniquely, otherwise they will be aggregated together. Learn more about automatic data aggregation here.
Automated Lane QC, App Launch, and Analysis QC
After sequencing, much of the work required to process biosamples can be automated in bulk. By setting up automation ahead of time using the command line interface (CLI), sequencing runs can be automatically passed or failed based on their sequencing quality, converted to FASTQ datasets, used as inputs in an app, and then be passed or failed based on their app metrics. Automation removes much of the time-consuming and error prone manual work of processing sequencing data into downstream results.
Improved User Interface
The updated interface provides quick access to all of your data from the My Data menu, while the new Action Toolbar contains new and improved app functions such as requeues, QC status changes, workflows, and collaboration tools.
The Analyses page provides a listing of all analyses in your account. The filters on this page help you quickly narrow your search for specific analyses by their current status.
The Projects and Runs pages function the same as before, providing quick access to all of your sequencing projects and instrument runs.
Advanced Automation and Integration Toolset
Alongside our updated data model, we’ve introduced version 2 of the API, which enables you to interact directly with your data and integrate systems together with your BaseSpace Sequence Hub account.
The new automation tools in version 2 of the API:
- Correspond to the new biosample-centric data model
- Improve performance and robustness of the solution
- Include new documentation
Note: The version 1 API is still fully-supported and maintained, although we are actively focusing primarily on version 2 API development. The version 1 API documentation is maintained here.
Version 2 of BaseSpaceCLI has been built using the version 2 API. BaseSpace CLI can be leveraged to read data from your BaseSpace Sequence Hub account and create new data by uploading data and launching apps. In addition, the new BaseSpace CLI can be used to create automated analysis workflows, and import biosamples.
BaseMount is a command-line tool which allows you to explore through runs, projects, biosamples, and datasets, and interact directly with the associated files exactly as you would with any other file system.
We hope the new functionality of BaseSpace Sequence Hub enables your lab to boost productivity and discovery. Visit our updated Support Site to learn more about how to use all the new features and tools. Please contact us at firstname.lastname@example.org if you have any questions or comments.
The BaseSpace Sequence Hub Team
- CLI documentation https://developer.basespace.illumina.com/docs/content/documentation/cli/cli-overview
- CLI automated workflow creation docs https://developer.basespace.illumina.com/docs/content/documentation/cli/cli-examples
- Link to v1 API docs https://developer.basespace.illumina.com/docs/content/documentation/rest-api/v1-api-reference
- Link to v2 API docs https://developer.basespace.illumina.com/docs/content/documentation/rest-api/api-reference
To date, most of what we know about our genome comes from studying populations of cells. Although few would argue with how far we have come to understand our genome, many researchers now realize that it may be just as important to fully examine the heterogeneity that exists within the population of cells. Evidence suggests that bulk sequencing methods can mask the contribution of individual cells. As a result, many researchers are turning to an evolving technique: single-cell sequencing.
Pioneered in the 1990s by James Eberwine2 and made more robust by the analytical sensitivity and specificity of next-generation sequencing (NGS) methods,3 single-cell sequencing enables researchers to examine the heterogeneity of cells, and promises to reveal what role individual cells play in disease and complex biological systems.
How? For every cell sequenced, researchers have a comprehensive map of the transcriptome that can be analyzed in several of different ways to characterize cells at single-cell resolution. Currently, 3 primary applications stand out:
Join us for our BaseSpace® Suite Informatics Summit in Copenhagen, DK on 31 May and 1 June. Immediately after the European Society for Human Genetics (ESHG) annual meeting, attendance at the summit is FREE. Learn more about our informatics tools and how they’re designed to help you transform complex genomic data into meaningful insights quickly and easily.
Why attend a BaseSpace Suite Summit?
- Share your perspectives on applying informatics tools in your lab
- Attend informative sessions and learn how other customers use informatics
- Get important product information for BaseSpace Clarity LIMS, BaseSpace Sequence Hub, BaseSpace Variant Interpreter (Beta), BaseSpace Cohort Analyzer, and BaseSpace Correlation Engine
- Learn best practices, including how an integrated approach to informatics can expedite workflows
- Connect with your peers
Learn more by clicking on the “Summit” dropdown above, or click here.
You’re invited to an exclusive informatics event
Advancing Precision Medicine efforts relies on the ability to make sense of a growing body of genomic data. The need for robust informatics tools and an integrated approach when it comes to acquiring, storing, distributing, and analyzing data is essential.
Join us for our BaseSpace® Suite Summit in Rochester, MN on October 3 and 4. Taking place immediately before the Individualizing Medicine Conference, registration is free. Learn more about our informatics tools and how they’re designed to help you transform complex genomic data into meaningful insights quickly and easily.
- Share your perspectives on applying informatics tools in your lab
- Attend your choice of sessions on informatics topics
- Learn best practices for laboratory information management, including how an integrated approach can expedite workflows
- Connect with your peers
Venue and Format
Lodging and Summit activities take place at the Kahler Grand Hotel in Rochester, MN. All day sessions on October 3 and the morning of October 4 include a variety of hands-on, introductory, and training sessions.
If you have questions, please contact us.
The new Enrichment v3.0 BaseSpace® App (formerly called Isaac Enrichment) introduces major improvements and new features including:
- Improved small variant calling
- Copy number variant (CNV) calling
- Structural variant calling
- Somatic/low-frequency variant calling
- Ability to start from FASTQ or BAM
- GRCh38 reference added
- Variant table CSV file including variant frequencies
- Improved variant annotation engine
- Improved metrics engine
Early next week all BaseSpace Sequence Hub accounts will track storage and compute usage.
Our Basic tier users will continue to enjoy 1 Terabyte of storage for free. On top of that we’re crediting your account with 250 free iCredits to use over 30 days to make sure you have a chance to explore all the apps we offer*.
We offer several compute and storage options to fit your needs. Let us know if you’d like to learn more about BaseSpace Sequence Hub tiers.
* iCredits are used to pay for app (compute and license fees) and additional storage charges. Any unused iCredits will expire at the end of this 30 day period.
Accelerates discovery by demonstrating genetic functional relationships and validating results
Illumina and Elsevier R&D Solutions have announced a collaboration to integrate use of Pathway Studio, Elsevier’s database of experimental data and disease models, with Illumina’s BaseSpace Correlation Engine platform, offering them as one seamless solution. Under the terms of the agreement, all licensed Correlation Engine users have free access to Pathway Viewer, an application that enables researchers to quickly view a snapshot of the functional relationships among genes. In addition, researchers are able to find top scoring canonical pathways that correlate most highly with the genes list.
Correlation Engine is a suite of applications that lets users discover the relationships among their results, with billions of correlations derived from the world’s largest curated library of genomic knowledge. The applications contain over 130,000 experimental comparisons from more than a half million samples, drawn from NCBI’s Gene Expression Omnibus and other open-access or controlled-access databases.