Data Item _entity_poly.pdbx_seq_one_letter_code_can

General

Item name
_entity_poly.pdbx_seq_one_letter_code_can
Category name
entity_poly
Attribute name
pdbx_seq_one_letter_code_can
Required in PDB entries
no
Used in current PDB entries
Yes, in about 99.5 % of entries

Item Description

Cannonical chemical sequence expressed as string of one-letter amino acid codes. Modifications are coded as the parent amino acid where possible. A for alanine or adenine B for ambiguous asparagine/aspartic-acid R for arginine N for asparagine D for aspartic-acid C for cysteine or cystine or cytosine Q for glutamine E for glutamic-acid Z for ambiguous glutamine/glutamic acid G for glycine or guanine H for histidine I for isoleucine L for leucine K for lysine M for methionine F for phenylalanine P for proline S for serine T for threonine or thymine W for tryptophan Y for tyrosine V for valine U for uracil

Item Example

 
MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTHTAKYDPSLKPLSVSYDQATSLRILNNGAAFNVEFD

Data Type

Data type code
text
Data type detail
text item types / multi-line text ...
Primitive data type code
char
Regular expression
[][ \n\t()_,.;:"&<>/\{}'`~!@#$%?+=*A-Za-z0-9|^-]*

Aliases

Alias Item Name Dictionary Name Dictionary Version
_entity_poly.ndb_seq_one_letter_code_can cif_rcsb.dic 1.1