SeqRecord

  • The SeqRecord object has a few additional attributes beyond the Seq object:

    • seq - The sequence itself

    • id - The primary ID used to identify the sequence - a string. In most cases this is something like an accession number

    • name - A 'common' name/id for the sequence - a string. In most cases this will be the same as the accession number, but it could also be a clone name. Analogous to the LOCUS id in a GenBank record

    • description - A human readable description or expressive name for the sequence - a string

In [1]: import Bio

In [2]: from Bio.Seq import Seq

In [3]: from Bio.SeqRecord import SeqRecord

In [4]: ##create a simple SeqRecord object

In [5]: simple_seq=Seq("GATCAGGATTAGGCC")

In [6]: simple_seq_r=SeqRecord(simple_seq)

In [7]: simple_seq_r.id="AC12345"

In [8]: simple_seq_r.name = "alienClone"

In [9]: simple_seq_r.description="I am not a real sequence"

In [10]: ##print summary

In [11]: print(simple_seq_r.id)
AC12345

In [12]: print(simple_seq_r.description)
I am not a real sequence

In [13]: print(simple_seq_r.seq)
GATCAGGATTAGGCC

In [14]: print(simple_seq_r.seq.translate())
DQD*A

Last updated