Single-cell RNA-seq (scRNA-seq) technologies can be divided into two categories, tag-based and full-length, based on their capture methods and quantitative nature.
In tag-based scRNA-seq, cells are separated by emulsion/droplets, and individual cells are given a unique cell barcode prior to sequencing. An example of tag-based scRNA-seq is 10x Genomics (Zheng et al. 2017).
In full-length scRNA-seq, cells are physically separated into individual wells of a plate and are often also sorted by other means (e.g., Fluorescence Activated Cell Sorting). With full-length scRNA-seq, each cell is sequenced individually and has its own fastq file. An example of full-length scRNA-seq is Smart-seq2 (Picelli et al. 2014).
For the purposes of this tutorial, we will focus on tag-based scRNA-seq, but it is important to keep in mind that the pre-processing steps and the biases to look out for in post-processing vary based on technology and how the cells are sorted.
For more extensive background on single-cell experimental methods, Predeus et al. also have a very good tutorial for scRNA-seq. We will also refer extensively to the the book Orchestrating Single-Cell Analysis with Bioconductor (Amezquita et al.).
Example: 10x Genomics (Zheng et al. 2017) Individual cells are separated by emulsion/droplets prior to cell lysis. Transcripts from each cell are then tagged with two barcodes: a cell-specific barcode and a Unique Molecular Identifier (UMI) (Islam et al. 2014). All transcripts from all cells are then pooled together and undergo PCR amplification and sequencing as if they are one sample.
Tagging of each transcript with a different UMI before amplification allows the identification of PCR duplicates, allowing control for PCR amplification errors and biases. Individual samples have two fastq files: one for the cell and UMI barcodes (R1) and another with the transcript sequence reads (R2).