Skip to content

Commit 03505ac

Browse files
committed
Explicitly permit eg N+m in MM tag (PR#799)
The text already states that an unmodified base of N means we count any base type, but base N code N in the table is a little misleading as to the intention. It was intended to mean any unspecified modification, in the same way C+C is any unspecified C mod, but in this case it's against all bases rather than a specific base type. However that doesn't solve the issue of whether we can record specific mods against any "fundamental" source base. Clarified this by adding an extra line to the table and some text. (However note this doesn't necessarily imply downstream processing tools will not do any compatibility assessment and reject N+m when the SEQ base is a T.) Fixes #785
1 parent 5a6f5e9 commit 03505ac

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

SAMtags.tex

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -532,6 +532,7 @@ \subsection{Base modifications}
532532

533533
An unmodified base of `{\tt N}' means count any base in {\sf SEQ}, not only those of `{\tt N}'.
534534
Thus `{\tt N+n,100;}' means the 101st base is Xanthosine (n), irrespective of the sequence composition.
535+
A fundamental base of `{\tt N}' may also be used with a base-specific modification code to force the counting to be applied per base rather than per base-type.
535536

536537
The standard code types and their associated ChEBI values are listed
537538
below, taken from Viner {\it et al.}%
@@ -567,6 +568,7 @@ \subsection{Base modifications}
567568
\hline
568569
N & n & Xao & Xanthosine & 18107 \\
569570
N & N & & Ambiguity code; any mod & \\
571+
N & any & & Mod applied to any base & \\
570572
\end{tabular}
571573
\end{center}
572574

0 commit comments

Comments
 (0)