The nucleotide and amino-acid distributions are studied for two variants of mRNA of gene that codes for a protein which is involved in multiple myeloid. Some patches and symmetries are singled out, thus, showing some distinctions between the two variants. Fractal dimensions and entropy are discussed as well.
1. Introduction
In some recent papers, the concepts of fractality [1–19] and entropy [19–21] have been considered as fundamental parameters to investigate the existence of correlations [22–36] and simple rules [37] in the DNA sequences. In particular, it has been observed that the increasing fractal dimension [7–13] can be related to a degeneration in sequences, having as a consequence pathological evolution of related diseases. A fundamental role is played by the concept of information entropy [20, 21] so that a change in the nucleotide distribution in DNA implies a corresponding change in the information content and, as a consequence, a variation in the entropy. Since the cell activity is functionally dependent on the nucleotide distribution our task is to understand better about this distribution and/or about the existence of large scale structure [1–6, 15, 22–37]. So that we could relate the functional activity of cells to some epitomizing patches in the nucleotide distribution. We will propose, in the following, also to take into account the information content in the amino acid distribution. In particular, we will see that the amino-acid distribution shows a higher level structure, and some patchiness which are undetectable in the nucleotide distribution. Our statistical approach is based on the transformation of the symbolic string into a numerical string by the Voss indicator function [4, 5] which is a discrete binary function. On this function, the indicator matrix is defined and on this matrix the fractal dimension and entropy can be simply computed. We will compare the fractal dimension and complexity of two mRNA variants of TET 2 (ten-eleven translocation 2) gene downloaded from gene bank [38] (similar data are also available from [39–41]), by showing that these parameters can be used to classify the two variants. Multiple myeloma is a pathology which involves plasma cell, but it can move and spread into the whole body. Some aspects are still unclear; however, it is known that this pathology is characterized by the activation of abnormal genes through chromosomal translocations and other genetic anomalies. One of the genes involved in the birth and progression of multiple myeloma is the TET 2. In fact it is present in some myelodysplastic syndromes, and it seems to play a key role when it is subject to mutation. The gene TET 2 is related with myelopoiesis; in fact it encodes protein that we can find significantly expressed in hematopoietic cells and granulocytes.
2. Multiple Myeloma and the Oncogene TET 2
Multiple myeloma (MM) is a blood cancer of the plasma cell. Myeloma originates in a specific type of cell, the plasma cell, but it can move, so that it spreads by the blood to the whole body. Like other cancers, multiple myeloma will develop in steps. Myeloma begins when the normal plasma cell becomes abnormal. The abnormal cell divides, and the new cells divide again and again, thus proliferating the number of abnormal cells. Myeloma cells collect in the bone marrow and in the solid part of the bone. These malignant plasma cells produce a para protein, an inactive antibody known also as M-protein or Bence Jones protein, that attack bone marrow, bones, blood, and kidneys. As a consequence, there happens extensive destruction within the skeleton involving multiple bones, and resulting in widespread bone pain and multiple fractures; for this reason, such a disease is also called multiple myeloma. Some genetic factors are also involved in this pathology. In absence of other symptoms and clinical signs, this condition is more properly called benign monoclonal gammopathy of uncertain significance (MGUS). In fact, the uncertainty about the future progression, it shows that also benign diseases might evolve into MM. It is likely that the evolution of MGUS into MM depends on many mutations of the MGUS clone. Initially, MM has a low progression, but afterward it becomes more aggressive. The signs that characterize onset of multiple myeloma are mostly high concentration of calcium ions with damages in the kidneys, the weakening of the immune system with abnormal production of immune globulin, and some other signs such as an evident osteoporosis. Both MGUS and MM diseases are characterized by the presence of alterations in gene expression [42–46]. The chromosomes that are more involved are 1,11,13,14, respectively. The alteration at chromosome 1 is found in half of cases of MM patients [47–54]. The same aberrations chromosome seem to be evident both in MM and in MGUS, thus supporting the thesis that these two diseases are closely related [53].
Gene TET 2 is located on the chromosome 4 exactly in 4q24. More precisely, the TET2 gene is located from base pair 106,067,942 to base pair 106,200,957 on chromosome 4, position as shown in Figure 1 [38].
Location of the TET 2 on chromosome 4.
The gene TET 2 plays a key role in the conversion of methylcytosine (5mC) to 5-hydroxymethylcytosine (hmC) moreover is related to myelopoiesis. For the hmC many roles were noted like for example (1) remodeling of chromatin structure (2) recruitment of some factors (3) demethylation of cytosine [55, 56]. The gene TET2 encodes a protein that we find significantly expressed in hematopoietic cells and granulocytes. In almost all patients with myelodysplastic syndromes, the protein is decreased in peripheral blood granulocytes. TET 2 gene is usually mutated in myeloproliferative disorders (MPDs). The MPD is part of a larger group of disorders called myeloproliferative neoplasms (MPNs). The mutation of TET 2 characterizes some disorder known as systemic mast cell disease, but TET 2 is above all mutated in myelodisplastic syndromes [57].
We will see that, by using some parameters defined on the indicator function, we can single out some patches which characterize abnormal functional activity [1–3, 35, 58, 59].
3. DNA Representation
The DNA, as well as the mRNA, of each organism of a given species is a sequence of a specific number of base pairs defined on the 4 elements alphabet of nucleotides:
A=adenine,C=cytosine,G=guanine,T=thymine.
Since the base pairs are distributed along a double helix, when straightened, the helix appears as a complementary double-strand system. The two sequences on opposite strands are complementary in the sense that opposite nucleotides must fulfil the ligand rules (A with T and G with C) of base pairs, between purines A and G and pyrimidines T and C. In a DNA sequence, there are some subsequences, which can be roughly subdivided into coding and noncoding regions, having special meaning. In particular, genes (belonging to coding regions) are characteristic sequences of base pairs, and the genes in turn are made by some alternating subsequences of exons and introns (except Procaryotes where the introns are missing). Each exon region is made of triplets of adjacent bases called codons. There are 64 possible codons, inasmuch the number of combination of the 4 nucleotides into 3 length classes. There are only 20 amino acids, therefore, the correspondence codons to amino acids are many to one. The 20 elements alphabet of amino acids is in Table 1. In the following, we will analyze two mRNA sequences: (H1) and (H2), downloaded from the National Center for Biotechnology Information [38], which represent respectively.
homo sapiens tet oncogene, family member 2 (TET2), transcript variant 1, mRNA, locus NM_001127208 (9796 bp mRNA linear). The accession number is NM_001127208, version NM_001127208.2 GI: 325197189.
homo sapiens tet oncogene family member 2 (TET2), transcript variant 2, mRNA, locus NM_017628 (9236 bp mRNA linear). The accession number is NM_017628, version NM_017628.4 GI: 325197183.
Letter
Name
Codons
1
M
Methionine
ATG
2
E
Glutamic acid
GAA, GAG
3
Q
Glutamine
CAA, CAG
4
D
Aspartic acid
GAT, GAC
5
R
Arginine
CGT, CGC, CGA, CGG, AGA, AGG
6
T
Threonine
ACT, ACC, ACA, ACG
7
N
Asparagines
AAT, AAC
8
H
Histidine
CAT, CAC
9
V
Valine
GTT, GTC, GTA, GTG
10
G
Glycine
GGT, GGC, GGA, GGG
11
L
Leucine
TTA, TTG, CTT, CTC, CTA, CTG
12
S
Serine
TCT, TCC, TCA, TCG, AGT, AGC
13
P
Proline
CCT, CCC, CCA, CCG
14
F
Phenylalanine
TTT, TTC
15
I
Isoleucine
ATT, ATC, ATA
16
C
Cysteine
TGT, TGC
17
A
Alanine
GCT, GCC, GCA, GCG
18
K
Lysine
AAA, AAG
19
Y
Thyroxine
TAT, TAC
20
W
Tryptophan
TGG
Stop
TAA, TAG, TGA
Some differences between two variants are the following: (H2) is different from (H1) in 5′UTR (untranslate region) and in 3′UTR (untranslate region); furthermore, (H2) variant, compared with (H1), is shown to have c-terminal to be distinct and even shorter of (H1), which is also represented by a longer transcript [37].
4. Dot Plot on the Indicator Matrix
In this section, we will define the indicator matrix [4, 5] on which the computation of multifractality and entropy are based.
4.1. Indicator Function for the 4-Symbol Alphabet
Let
ℵ4={A,C,G,T}
be the finite set (alphabet) of nucleotides and x∈ℵ4 any member of the 4 symbols alphabet.
A DNA sequence is the finite symbolic sequence
D(N)=N×ℵ4
so that
D(N)={xh}h=1,…,N,N<∞
being
xh=(h,x)=x(h),(h=1,2,…,N;x∈ℵ4)
the acid nucleic xat the position h.
Let 𝔇1(N),𝔇2(N) be two DNA sequences; the indicator function [4, 5] is the map
u:D1(N)×D2(N)⟶{0,1}
such that
uhk=u(xh,xk)={1ifxh=xk0ifxh≠xk(xh∈D1(N),xk∈D2(N)).
When 𝔇1(N)≡𝔇2(N) the indicator function, it shows the existence of autocorrelation on the same sequence. According to (4.5), the indicator map of the N-length sequence can be easily represented by the N×N sparse matrix of binary values {0,1}, and this matrix can be visualized by the following (autocorrelation) dot-plot:
⋮⋮⋮⋮⋮⋮⋮⋮⋮⋮⋮G0100000001…C0001000010…A1000101100…A1000101100…T0010010000…A1000100100…C0001000010…T0010010000…G0100000001…A1000100100…uhkAGTCATAACG…
4.2. Indicator Function for the 20-Symbols Alphabet of Amino Acids
As a generalization of the 4-symbols alphabet of nucleotides, we can define the 20-symbols alphabet of amino acids as follows:
ℵ20={M,E,Q,D,R,T,N,H,V,G,L,S,P,F,I,C,A,K,Y,W}.
A protein sequence is the finite symbolic sequence
L(N)=N×ℵ20
so that
L(N)={xh}h=1,…,N,N<∞
being
xh=(h,x)=x(h),(h=1,2,…,N;x∈ℵ20)
the amino acid xat the position h.
The indicator function [4, 5] can be extended also to protein sequences as the map
u:L1(N)×L2(N)⟶{0,1}
such that
uhk=u(xh,xk)={1ifxh=xk0ifxh≠xk(xh∈L1(N),xk∈L2(N)).
After a transduction of the two DNA sequences (H1) and (H2) into their amino acids components, we can see that the corresponding dot plots can show some (higher-level) structure on the distribution of nucleotides (see Figure 2). In particular, (H2) shows a special pattern which is more evident in the amino acids dot plot.
Dot plot for the first 50 nucleotides distribution in the (H1)-(H1), (H2)-(H2) DNA sequences (a,b) and corresponding amino acids (d) and (e). In (c) and (f), the cross correlations (H1)-(H2) are given.
5. Probability Distribution5.1. Frequency Distribution
The probability distribution of nucleotides can be defined by the frequencypX(n)=1n∑i=1nuXi,(X∈ℵ4,xi∈D(N);1≤n≤N)
that the acid nucleic X can be found at the position n. This value can be approximated by the frequency count (on the indicator matrix) of the nucleotide distribution before n. So that, for the transcript variant, we have the probability density distribution of Figure 3 which, however, tends to assume some different constant values thus showing that nucleotides are heterogeneously distributed.
Probability density distribution of nucleotides along TET2 oncogene variants (H1) (brown) and (H2) (blue).
5.2. Distribution of the Essential Amino Acids
Analogously to the nucleotides frequency distribution, we can compute also the amino-acid distribution
pX(n)=1n∑i=1nuXi,(X∈ℵ20,xi∈L(N);1≤n≤N)
that the amino-acid X can be found at the position n.
In particular, we have noticed that even if the nucleotides distribution is nearly the same in both sequences (H1) and (H2), the amino acid shows different distributions for the same amino-acid in each sequence. In other words, the “second”-level distribution seems to be organized according to a different distribution law (see Figures 4 and 5).
Distribution of amino acids along (H1), being 1 methionine, 2 glutamic acid, 3 glutamine, 4 aspartic acid, 5 arginine, 6 threonine, 7 asparagine, 8 histidine, 9 valine, 10 glycine, 11 leucine, 12 serine, 13 proline, 14 phenylalanine, 15 isoleucine, 16 cysteine, 17 alanine, 18 lysine, 19 thyroxine, and 20 tryptophan.
Distribution of amino acids along (H2), being 1 methionine, 2 glutamic acid, 3 glutamine, 4 aspartic acid, 5 arginine, 6 threonine, 7 asparagine, 8 histidine, 9 valine, 10 glycine, 11 leucine, 12 serine, 13 proline, 14 phenylalanine, 15 isoleucine, 16 cysteine, 17 alanine, 18 lysine, 19 thyroxine, and 20 tryptophan.
6. Fractal Dimension and Entropy6.1. Fractal Dimension
The frequency distribution implies a corresponding frequency of correlation in the correlation matrix. By using the indicator matrix, it is possible to give a simple formula which enables us to estimate the fractal dimension as the average of the number p(n) of 1 in the randomly taken n×n minors of the N×N correlation matrix uhkD=121N∑n=2Nlogp(n)logn.
If we compare the fractal dimensions of the two mRNA sequences (H1) and (H2), we can see (Figure 6) that the fractal dimension of nucleotide distribution tends, for both variants, to the value 1.26.
Fractal dimension as function of the length, i=10,…,200 for (H1) (red) and (H2) (blue).
It is interesting to notice that the corresponding amino acids of the two sequences have (more or less) the same fractal dimension which tends for both (Figure 7) to the value 1.29.
Fractal dimension as function of the length, i=10,…,200 for the amino acids of (H1) (red) and (H2) (blue).
6.2. Entropy Estimate
As a measure of the information distribution, we consider the normalized Shannon entropy, which is defined, for a distribution over the alphabetℵℓ, as
H(n)=-1logl∑i=1lpXi(n)×{log2pXi(n)ifpXi(n)≠0,0ifpXi(n)=0,
where pXi(n) is given by (5.1) for nucleotides and (5.2) for amino acids.
Since ∑i=1ℓpXi(n)=1, foralln, the main values of this function are the following.
If pXi(n)=1, pXj(n)=0(j≠i), then H(n)=0. This happens when the information is concentrated in only one symbol.
If pXi(n)=pXj(n)=1/ℓ, i≠j then H(n)=1. In this case, the information is equally distributed over all symbols.
Equation 0≤H(n)≤1. In general, the information content is distributed over the range [0,1].
Therefore, the entropy is a positive function ranging in the interval [0,1], the minimum value is obtained when the distribution is concentrated on a single symbol, while the maximum value is obtained when all symbols are equally distributed.
In particular for higher values of n, according to the frequency definition of probability, the entropy tends to the constant value 1 (see Figures 8 and 9) both for nucleotides and amino acids. However, in the first case (Figure 8), the entropy of (H1) is lower than (H2), while on the contrary, for the corresponding amino acids,s the entropy of (H1) is greater than (H2).
Entropy for the first 300 nucleotides of the sequence (H1) (red) and (H2) (blue).
Entropy for the first 300 amino acids of the sequence (H1) (red) and (H2) (blue).
7. Conclusions
In this paper, two variants of mRNA of isoforms TET2 gene have been analyzed through their nucleotide and amino acids distribution. By using the indicator function (and matrix), the fractal dimension and the entropy have been easily computed. We have noticed that, at the amino acid level, some patches can be easily singled out. Moreover, the second variant (H2) of TET 2 shows some more randomness than (H1).
CattaniC.Fractals and hidden symmetries in DNA20102010312-s2.0-7795452576210.1155/2010/507056507056ZBL1189.92015CattaniC.ElloumiM.ZomayaA. Y.Wavelet algorithms for DNA analysis2010chapter 35New York, NY, USAJohn Wiley & SonsWiley Series in BioinformaticsCattaniC.ccattani@unisa.itPierroG.gaetanopierro@hotmail.itComplexity on acute myeloid leukemia mRNA transcript variant201120111637987310.1155/2011/379873VossR. F.Evolution of long-range fractal correlations and 1/f noise in DNA base sequences19926825380538082-s2.0-000020487810.1103/PhysRevLett.68.3805VossR. F.Long-Range Fractal Correlations in DNA introns and exons1992216BuldyrevS. V.GoldbergerA. L.HavlinS.MantegnaR. N.MatsaM. E.PengC. K.SimonsM.StanleyH. E.Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis1995515508450912-s2.0-3374869028910.1103/PhysRevE.51.5084MetzeK.Fractal dimension of chromatin and cancer prognosis2010256016042-s2.0-7814941833610.2217/epi.10.50AdamR. L.SilvaR. C.PereiraF. G.LeiteN. J.Lorand-MetzeI.MetzeK.The fractal dimension of nuclear chromatin as a prognostic factor in acute precursor B lymphoblastic leukemia2006281-255592-s2.0-33646382360MetzeK.Lorand-MetzeI.LeiteN. J.AdamR. L.Goodness-of-fit of the fractal dimension as a prognostic factor20093165035042-s2.0-7194909162810.3233/CLO-2009-0505GoutzanisL.PapadogeorgakisN.PavlopoulosP. M.KattiK.PetsinisV.PlochorasI.PantelidakiC.KavantzasN.PatsourisE.AlexandridisC.Nuclear fractal dimension as a prognostic factor in oral squamous cell carcinoma20084443453532-s2.0-4074911512610.1016/j.oraloncology.2007.04.005MashiahA.WolachO.SandbankJ.UzielO.RaananiP.LahavM.Lymphoma and leukemia cells possess fractal dimensions that correlate with their biological features200811931421502-s2.0-4604910531810.1159/000125551MetzeK.FerroD. P.FalconiM. A.Fractal characteristics of nuclear chromatin in routinely stained cytology are independent prognostic factors in patients with multiple myeloma2009445supplement 1721BedinV.AdamR. L.de SáB. C. S.LandmanG.MetzeK.Fractal dimension of chromatin is an independent prognostic factor for survival in melanoma201010, article 2602-s2.0-7795307362610.1186/1471-2407-10-260LebedevD. V.FilatovM. V.KuklinA. I.IslamovA. K.KentzingerE.PantinaR.TopervergB. P.Isaev-IvanovV. V.Fractal nature of chromatin organization in interphase chicken erythrocyte nuclei: DNA structure exhibits biphasic fractal properties20055796146514682-s2.0-1424427031610.1016/j.febslet.2005.01.052McNallyJ. G.MazzaD.Fractal geometry in the nucleus2010291232-s2.0-7564908570210.1038/emboj.2009.375TakahashiM.A fractal model of chromosomes and chromosomal DNA replication198914111171362-s2.0-002446678010.1016/S0022-5193(89)80012-8DelidesA.PanayiotidesI.AlegakisA.KyroudiA.BanisC.PavlakiA.HelidonisE.KittasC.Fractal dimension as a prognostic factor for laryngeal carcinoma2005253214121442-s2.0-21344443438FerreiraR. C.de MatosP. S.AdamR. L.LeiteN. J.MetzeK.Application of the Minkowski-Bouligand fractal dimension for the differential diagnosis of thyroid follicular neoplasias2006285-63313332-s2.0-33845653252AdamsJ. M.HarrisA. W.PinkertC. A.The c-myc oncogene driven by immunoglobulin enhancers induces lymphoid malignancy in transgenic mice19853186046533538PontrjaginL.SchnirelmannL.Sur une propriété métrique de la dimension19323315616210.2307/1968109ZBL0003.33101KolmogorovA. N.TihomiroffV. M.ε> -Entropy and ε -capacity of sets in functional spaces1949142386Patrick FitchJ.SokhansanjB.Genomic engineering: moving beyond DNA sequence to function20008812194919712-s2.0-000099571510.1109/5.899061GeeH.A journey into the genome: what’s there20012-s2.0-33748811493CristeaP. D.Large scale features in DNA genomic signals20038348718882-s2.0-003739977210.1016/S0165-1684(02)00477-2ZBL1144.62353HerzelH.TrifonovE. N.WeissO.GroßeI.Interpreting correlations in biosequences19982491–44494592-s2.0-003200178210.1016/S0378-4371(97)00505-0LiW.The study of correlation structures of DNA sequences: a critical review19972142572712-s2.0-003129846410.1016/S0097-8485(97)00022-3LiW.KanekoK.Long-range correlations and partial spectrum in a noncoding DNA sequence19921765566010.1209/0295-5075/17/7/014PengC. K.BuldyrevS. V.GoldbergerA. L.HavlinS.SciortinoF.SimonsM.StanleyH. E.Long-range correlations in nucleotide sequences199235663651681702-s2.0-002658515210.1038/356168a0PengC. K.BuldyrevS. V.HavlinS.SimonsM.StanleyH. E.GoldbergerA. L.Mosaic organization of DNA nucleotides1994492168516892-s2.0-3454785620310.1103/PhysRevE.49.1685WeissO.HerzelH.Correlations in protein sequences and property codes199819043413532-s2.0-003255431810.1006/jtbi.1997.0560YuZ. G.AnhV. V.WangB.Correlation property of length sequences based on global structure of the complete genome20016312-s2.0-1774438299010.1103/PhysRevE.63.011903011903VaidyanathanP. P.YoonB. J.The role of signal-processing concepts in genomics and proteomics20043411-21111352-s2.0-154274223510.1016/j.jfranklin.2003.12.001ZBL1094.92044Bernaola-GalvánP.Román-RoldánR.OliverJ. L.Compositional segmentation and long-range fractal correlations in DNA sequences1996535518151892-s2.0-424376425510.1103/PhysRevE.53.5181LiW.BentleyJ.The complexity of DNA: the measure of compositional heterogenity in DNA sequence and measures of complexity199733337KarlinS.BrendelV.Patchiness and correlations in DNA sequences199325950956776802-s2.0-002745884310.1126/science.8430316BennettJ. M.YoungM. L.AndersenJ. W.CassilethP. A.TallmanM. S.PaiettaE.WiernikP. H.RoweJ. M.Long-term survival in acute myeloid leukemia: the Eastern Cooperative Oncology Group experience19978011220522092-s2.0-0030722602SchrödingerE.1948Cambridge, UKCambridge University PressNational Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov/genbank/Universal Protein Resource, http://www.uniprot.org/help/aboutGene Location, Weizmann Institute of Science, http://genecards.weizmann.ac.il/geneloc/e!Ensemble, Ensembl project, EMBL-EBI & Wellcome Trust Sanger Institute, http://www.ensemble.orgAvet-LoiseauH.FaconT.DavietA.GodonC.RappM. J.HarousseauJ. L.GrosboisB.BatailleR.14q32 translocations and monosomy 13 observed in monoclonal gammopathy of undetermined significance delineate a multistep process for the oncogenesis of multiple myeloma19995918454645502-s2.0-0033568514DrachJ.SchusterJ.NowotnyH.AngerlerJ.RosenthalF.FieglM.RothermundtC.GsurA.JagerU.HeinzR.LechnerK.LudwigH.HuberH.Multiple myeloma: high incidence of chromosomal aneuploidy as detected by interphase fluorescence in situ hybridization19955517385438592-s2.0-0029124130FlactifM.ZandeckiM.LaïJ. L.BernardiF.ObeinV.BautersF.FaconT.Interphase fluorescence in situ hybridization (FISH) as a powerful tool for the detection of aneuploidy in multiple myeloma1995912210921142-s2.0-0029417027FonsecaR.AhmannG. J.JalalS. M.DewaldG. W.LarsonD. R.TherneauT. M.GertzM. A.KyleR. A.GreippP. R.Chromosomal abnormalities in systemic amyloidosis199810337047102-s2.0-003176061710.1046/j.1365-2141.1998.01034.xZandeckiM.LaïJ.-L.GenevièveF.BernardiF.Vollo-RémyH.BlanchetO.FrançoisM.CossonA.BautersF.FaconT.Several cytogenetic subclones may be identified within plasma cells from patients with monoclonal gammopathy of undetermined significance, both at diagnosis and during the indolent course of this condition199790936823690DewaldG. W.KyleR. A.HicksG. A.GreippP. R.The clinical significance of cytogenetic studies in 100 patients with multiple myeloma, plasma cell leukemia, or amyloidosis19856623803902-s2.0-0021792205BarlogieB.EpsteinJ.SelvanayagamP.AlexanianR.Plasma cell myeloma—New biological insights and advances in therapy19897348658792-s2.0-0024594241LiangW.HopperJ. E.RowleyJ. D.Karyotypic abnormalities and clinical aspects of patients with multiple myeloma and related paraproteinemic disorders19794426306442-s2.0-0018692802GahrtonG.ZechL.NillssonK.2 Translocations, t(11;14) and t(1;6), in a patient with plasma cell leukaemia and 2 populations of plasma cells19802414246MorganG. J.DaviesF. E.LinetM.Myeloma aetiology and epidemiology20025652232342-s2.0-003663003510.1016/S0753-3322(02)00194-4Avet-LoiseauH.AttalM.MoreauP.CharbonnelC.GarbanF.HulinC.LeyvrazS.MichalletM.Yakoub-AghaI.GarderetL.MaritG.MichauxL.VoillatL.RenaudM.GrosboisB.GuillermG.BenboubkerL.MonconduitM.ThieblemontC.CasassusP.CaillotD.StoppaA. M.SottoJ. J.WetterwaldM.DumontetC.FuzibetJ. G.AzaisI.DorvauxV.ZandeckiM.BatailleR.MinvielleS.HarousseauJ. L.FaconT.MathiotC.Genetic abnormalities and survival in multiple myeloma: the experience of the Intergroupe Francophone du Myélome20071098348934952-s2.0-3384737831410.1182/blood-2006-08-040410SawyerJ. R.WaldronJ. A.JagannathS.BarlogieB.Cytogenetic findings in 200 patients with multiple myeloma199582141492-s2.0-002916293810.1016/0165-4608(94)00284-ILandgrenO.KyleR. A.PfeifferR. M.KatzmannJ. A.CaporasoN. E.HayesR. B.DispenzieriA.KumarS.ClarkR. J.BarisD.HooverR.RajkumarS. V.Monoclonal gammopathy of undetermined significance (MGUS) consistently precedes multiple myeloma: a prospective study200911322541254172-s2.0-6704916218810.1182/blood-2008-12-194241LangemeijerS. M. C.KuiperR. P.BerendsM.KnopsR.AslanyanM. G.MassopM.Stevens-LindersE.van HoogenP.van KesselA. G.RaymakersR. A.P.KampingE. J.VerhoefG. E.VerburghE.HagemeijerA.VandenbergheP.de WitteT.van der ReijdenB. A.JansenJ. H.j.jansen@chl.umcn.nlAcquired mutations in TET2 are common in myelodysplastic syndromes200941783884210.1038/ng.391KoM.HuangY.JankowskaA. M.PapeU. J.TahilianiM.BandukwalaH. S.AnJ.LampertiE. D.KohK. P.GanetzkyR.LiuX. S.AravindL.AgarwalS.MacIejewskiJ. P.maciejj@ccf.orgRaoA.arao@idi.harvard.eduImpaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET22010468732583984310.1038/nature09586LorsbackR. B.MooreJ.MathewS.RaimondiS. C.MukatiraS. T.DowningJ. R.TET1, a member of a novel protein family, is fused to MLL in acute myeloid leukemia containing the t(10;11)(q22;23) [3]200317363764110.1038/sj.leu.2402834ChenS. Y.sy@ieee.orgGuanQ.gq@zjut.edu.cnParametric shape representation by a deformable NURBS model for cardiac functional measurements2011583, part 148048710.1109/TBME.2010.2087331ChenS.sy@ieee.orgZhangJ.jzhang@informatik.uni-hamburg.deZhangH.GuanQ.DuY.YaoC.ZhangJ.Myocardial motion analysis for determination of tei-index of human heart20101012114281143910.3390/s101211428