
Install
$ npm install --save @gmod/bam
Usage
const { BamFile } = require('@gmod/bam')
const t = new BamFile({
bamPath: 'test.bam',
})
var header = await t.getHeader()
var records = await t.getRecordsForRange('ctgA', 0, 50000)
The bamPath argument only works on nodejs. In the browser, you should pass
bamFilehandle with a generic-filehandle2 e.g. RemoteFile
const { RemoteFile } = require('generic-filehandle2')
const bam = new BamFile({
bamFilehandle: new RemoteFile('yourfile.bam'),
baiFilehandle: new RemoteFile('yourfile.bam.bai'),
})
Input are 0-based half-open coordinates (note: not the same as samtools view
coordinate inputs!)
Usage with htsget
Since 1.0.41 we support usage of the htsget protocol
Here is a small code snippet for this
const { HtsgetFile } = require('@gmod/bam')
const ti = new HtsgetFile({
baseUrl: 'http://htsnexus.rnd.dnanex.us/v1/reads',
trackId: 'BroadHiSeqX_b37/NA12878',
})
await ti.getHeader()
const records = await ti.getRecordsForRange(1, 2000000, 2000001)
Our implementation makes some assumptions about how the protocol is implemented,
so let us know if it doesn't work for your use case
Documentation
BAM constructor
The BAM class constructor accepts arguments
bamPath/bamUrl/bamFilehandle - a string file path to a local file or a
class object with a read method
csiPath/csiUrl/csiFilehandle - a CSI index for the BAM file, required
for long chromosomes greater than 2^29 in length
baiPath/baiUrl/baiFilehandle - a BAI index for the BAM file
recordClass - a custom class extending BamRecord to use for records (see
Custom BamRecord class section below)
Note: filehandles implement the Filehandle interface from
https://www.npmjs.com/package/generic-filehandle2.
This module offers the path and url arguments as convenience methods for
supplying the LocalFile and RemoteFile
async getRecordsForRange(refName, start, end, opts)
Note: you must run getHeader before running getRecordsForRange
refName - a string for the chrom to fetch from
start - a 0-based half open start coordinate
end - a 0-based half open end coordinate
opts.signal - an AbortSignal to indicate stop processing
opts.viewAsPairs - re-dispatches requests to find mate pairs. default: false
opts.pairAcrossChr - control the viewAsPairs option behavior to pair across
chromosomes. default: false
opts.maxInsertSize - control the viewAsPairs option behavior to limit
distance within a chromosome to fetch. default: 200kb
async *streamRecordsForRange(refName, start, end, opts)
This is a async generator function that takes the same signature as
getRecordsForRange but results can be processed using
for await (const chunk of file.streamRecordsForRange(
refName,
start,
end,
opts,
)) {
}
The getRecordsForRange simply wraps this process by concatenating chunks into
an array
async getHeader(opts: {....anything to pass to generic-filehandle2 opts})
This obtains the header from HtsgetFile or BamFile. Retrieves BAM file and
BAI/CSI header if applicable, or API request for refnames from htsget
async indexCov(refName, start, end)
refName - a string for the chrom to fetch from
start - a 0-based half open start coordinate (optional)
end - a 0-based half open end coordinate (optional)
Returns features of the form {start, end, score} containing estimated feature
density across 16kb windows in the genome
async lineCount(refName: string)
refName - a string for the chrom to fetch from
Returns number of features on refName, uses special pseudo-bin from the BAI/CSI
index (e.g. bin 37450 from bai, returning n_mapped from SAM spec pdf) or -1 if
refName not exist in sample
async hasRefSeq(refName: string)
refName - a string for the chrom to check
Returns whether we have this refName in the sample
BamRecord properties
record.fileOffset
record.ref_id
record.start
record.end
record.name
record.seq
record.qual
record.CIGAR
record.flags
record.mq
record.strand
record.template_length
record.tags
record.NUMERIC_MD
record.NUMERIC_CIGAR
record.NUMERIC_SEQ
record.next_refid
record.next_pos
record.isPaired()
record.isProperlyPaired()
record.isSegmentUnmapped()
record.isMateUnmapped()
record.isReverseComplemented()
record.isMateReverseComplemented()
record.isRead1()
record.isRead2()
record.isSecondary()
record.isFailedQc()
record.isDuplicate()
record.isSupplementary()
record.seqAt(idx)
record.toJSON()
Custom BamRecord class
You can provide your own BamRecord class to add custom properties or methods:
import { BamFile, BamRecord } from '@gmod/bam'
class CustomBamRecord extends BamRecord {
get customProperty() {
return `custom-${this.name}`
}
getDoubleStart() {
return this.start * 2
}
}
const bam = new BamFile<CustomBamRecord>({
bamPath: 'test.bam',
recordClass: CustomBamRecord,
})
await bam.getHeader()
const records = await bam.getRecordsForRange('ctgA', 0, 50000)
console.log(records[0].customProperty)
console.log(records[0].getDoubleStart())
License
MIT © Colin Diesh