An algebra for biological sequences

Swapan Raha


In this paper, an attempt is made at an algebraic formulation of biological sequences. An algebraic structure is constructed for a given chromosomal string and segmentation. It is shown that this algebra represents, the most common chromosomal mutational mechanisms. Interpretation of the mathematical study is based on biological knowledge. Basic results are derived from the behaviour of the chromosomal segments. This leads us to a new way of manipulating chromosomal mutation with mathematical forms and models.


DNA, Chromosome, Semi-group, σ-algebra

Full Text:




Andersson S, Eriksson K., Dynamics of gene order structures and genomic architectures, Kluwer Academic Publishers,

Kari L., “Dna computing: arrival of biological mathematics,” The Mathematical Intelligencer, vol. 19(2), pp. 9–22, 1997.

Chen X., Kwong S., Li M., “A compression algorithm for dna sequences and its applications in genome comparison,” in

Proceedings of the Sixth Annual International Computing and Combinatorics Conference-RECOMB, ACM Press, 2000,

pp. 107–117.

Claverie JM, “From bioinformatics to computational biology,” Genome Research, vol. 10, pp. 1277–1279, 2000.

Fraleigh JB, A first course in abstract algebra, Pearson Education, Inc., 2003.

Malik DS, Mordeson JM, Sen MK, Fundamentals of abstract algebra, The McGraw-Hill Companies, Inc., International

Edition, 1997.

Asyali MH, Colak D., Demirkaya O., Inan MS, “Gene expression profile classification: a review,” Current Bioinformatics,

vol. 1, no. 1, pp. 55–73, 2006.

Howie JM, An introduction to semigroup theory, London: Academic press, 1976.

Humphreys JE, Linear algebraic groups, Springer-Verlag, 1975.

Hilton H., Introduction to the Theory of Groups of Finite Order, Oxford: Clarendon Press, 1908.

Watterson G., Ewens W., Hall T., Morgan A., “The chromosome inversion problem,” Journal of Theoretical Biology, pp.

–7, 1982.

Mabrouk NEl, “Abc,” in Genome rearrangement by reversals and insertions deletions of contiguous segments, ser. Lecture

Notes in Computer Science, 1984, pp. 222–234.

Demazure M., Gabriel P., Introduction to algebraic geometry and algebraic groups, North-Holland, Amsterdam: ABC,

Kent WJ, Baertsch R., Hinrichs A., Miller W., Haussler, “Evolution’s cauldron: duplication, deletion, and rearrangement

in the mouse and human genomes,” in Proceedings of the National Academy of Sciences USA, vol. 100(20), 2003, pp.

484–11 489.

Bafna V., Pevzner P., “Sorting by transpositions,” SIAM Journal of Discrete Mathematics, vol. 11, pp. 224–240, 1998.

Luscombe NM, Greenbaum D., Gerstein M., What is bioinformatics? An introduction and overview, Yearbook of Medical

Informatics, 2001.

Alekseyev M., Pevzner N., “Colored de bruijn graphs and genome halving problem,” IEEE ACM Transactions on

Computational Biology and Bioinformatics, vol. 4, pp. 98–107, 2007.

Mazumdar D., Mathematical models for biological sequence analysis, Ph.D. thesis, VISVA-BHARATI 2012.