Description of Streptococcus thalassemiae sp. nov., a Bacterium Isolated from Human Blood

Blood is a precious biological liquid that is normally sterile. Therefore, bacteria in the bloodstream are shown a priori anomaly. A blood culture is systematically performed to diagnose the cause of the bacteremia. Indeed, a patient received in our service had a thalassemia major and underwent a genoidentical transplant. Then, a blood test was performed to diagnose a four-day fever. In this context, we have isolated strain Marseille-Q2617 from the blood sample. It revealed a new bacterial strain that belongs to the genus Streptococcus. It is a Gram-positive coccus, nonmotile, and nonspore forming. The major fatty acid found is hexadecanoic acid, with 49.5%. A taxonomic method was used to characterize the strain by studying their phenotypic, phylogenetic, and genomic characteristics. In addition, sequence analysis of the 16S rRNA gene shows that the strain Marseille-Q2617 has 99.94% sequence similarity to Streptococcus mitis. Average nucleotide identity (ANI) analysis for strain Marseille-Q2617T showed the highest similarity of 92.9% with S. mitis. The DNA-DNA hybridization value obtained (50.2%) between strain Marseille-Q2607 and S. mitis, its closest related species, was below the recommended threshold (<70%). Strain Marseille-Q2617T has a genome size of 2.02 Mbp with 40.5 mol% of G + C content. Based on these results, we propose a new species of the genus Streptococcus, for which the name Streptococcus thalassemiae sp. nov., Marseille-Q2617T (=CSUR Q2617 = CECT 30109) was proposed.


Introduction
Streptococci are most frequently found in the commensal fora of the skin and upper respiratory tract [1].However, many streptococci are among the most invasive bacteria groups and appear as opportunistic pathogens in certain infections [1]. Some of these highly pathogenic species such as Streptococcus agalactiae, S. gallolyticus, and S. pneumoniae are involved in meningitis, endocarditis, and pneumonia pathologies [2].
Streptococcus is an important bacterial genus that currently includes 110 child taxa with a validly published and correct name (https://lpsn.dsmz.de/genus/ streptococcus). Formerly classifed into three main groups according to their hemolysis pattern (alpha-hemolytic, beta-hemolytic, and nonhemolytic or gammahemolytic), streptococci are presently subdivided into six groups of species based on a systematic study of the 16S rRNA gene sequence: pyogenic, anginosus, mitis, salivarius, bovis, and mutans [3,4].
Nowadays, the general use of matrix-assisted desorption ionization time-of-fight mass spectrometry (MALDI-TOF MS) in the clinical and research environment, coupled with the rapid development of next-generation sequencing technology, gives us a new and more comprehensive insight into the taxonomy. Terefore, we have used a taxonomic approach as described [5] to provide a complete description of this new bacterium species. Tus, based on the morphological, phenotypic, biochemical, phylogenetic, and genomic features, Streptococcus thalassemiae sp. nov., strain Marseille-Q2617 was proposed as a new member of the genus Streptococcus.

Organism Isolation and Collection.
Within the context of an investigation on the etiology of infectious diseases, samples of blood cultures were taken from patients hospitalized at the IHU mediterranean infection. Te strain Marseille-Q2617 was the only one isolated bacterial strain from a patient who had major thalassemia. He had a fever (38.5°C) on day 4 of a genoidentical transplant when blood was drawn.

Identifcation of the Strain by MALDI-TOF MS and 16S
rRNA Gene Sequencing. Colonies were frst identifed using MALDI-TOF MS on a LT Microfex spectrometer (Bruker Daltonics, Bremen, Germany) as described [6]. Te obtained spectra were imported into the MALDI Biotyper 3.0 software (Bruker Daltonics) and matched against the references in the database (Bruker database incremented with in lab references (https://www.mediterranee-infection.com/accesressources/base-de-donnees/urms-data-base/, accessed February 2021)). Te resulting score enabled (or did not enable) the identifcation of the tested species: colonies were labeled as correctly identifed at the species level with a score ≥ 2, at the genus level with a score ≥ 1.7. Unidentifed species (score < 1.7) using MALDI-TOF MS were identifed using Sanger sequencing of the 16S rRNA gene. Te 16S rRNA gene was amplifed using the universal bacterial primers pairs fD1 and rP2 [7] (Eurogentec, Angers, France), and the resulting amplicons were sequenced using the Big Dye ® Terminator v1.1 Cycle Sequencing Kit and 3500xL Genetic Analyzer capillary sequencer (TermoFisher, Saint-Aubin, France), as described [8]. Te sequences were aligned using MUSCLE with default parameters, and the phylogenetic inferences were obtained using the maximum likelihood method and the MEGA X software [9]. Bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree are indicated at the nodes. Bartonella quintana was used as outgroup.

Biochemical, Sporulation, and Motility Tests.
Biochemical substrates are tested using API ZYM (enzymatic activities), API 50 CH (carbohydrate fermentation), and API 20Strep (specifc criteria) galleries. We also carried out the ability to form spores using a thermal shock. Te motility assay was performed by directly examining a fresh colony using a DM 1000 optical microscope (Leica, Nanterre, France) at a ×400 magnifcation. Detection of catalase (bioMerieux) and oxidase activities (Becton Dickinson, Franklin Lakes, NJ, USA) was also tested.

Electron Microscopy.
Te morphology of the strain was visualized with the Hitachi SU5000 scanning electron microscope (Hitachi Group, Krefeld, Germany). A colony was retrieved from agar and suspended in a 2.5% glutaraldehyde fxative solution. A drop of the suspension was then directly deposited on a poly-L-lysine coated slide for fve minutes and treated with 1% phosphotungstic acid aqueous solution (pH 2.0) for two minutes to increase scanning electron micrograph (SEM) image contrasting. Te slide was washed in water, air-dried, and examined using a tabletop SU5000 microscope (Hitachi High-Tech, HHT, Japan). Te scale bar and acquisition settings are shown on the micrograph.

Cellular Fatty Acid
Composition. Cellular fatty acid methyl ester (FAME) analysis was performed using gas chromatography/mass spectrometry (GC/MS). Several culture plates were scraped to obtain approximately 50 mg of bacterial biomass per tube. FAMEs were prepared according to Sasser's protocol [12], and the GC/MS analyses were then done as previously described [13]. Te sample was prepared with approximately 15 mg of bacterial biomass per tube harvested from several culture plates. Briefy, fatty acid methyl esters were separated using an Elite 5-MS column and monitored by mass spectrometry (Clarus 500-SQ 8 S, Perkin Elmer, Courtaboeuf, France). Spectral database search was performed using MS Search 2.0 operated with the Standard Reference Database 1A (NIST, Gaithersburg, USA) and the FAMEs mass spectral database (Wiley, Chichester, UK).

Genome Assembly and Analysis.
Te total DNA genome of the strain Marseille-Q2617 was extracted and sequenced as previously described [14]. SPAdes version 3.10.1 software [15] was used for assembling genomic reads. Mapping of reads recovered from MiSeq and MinION technologies for strain Marseille-Q2617 was carried out using CLC genomics7 (https://www. qiagenbioinformatics.com/products/). Genome strain is annotated with Prokka version 1.13.3, available in the Galaxy Australia online server (https://usegalaxy-au. github.io/). GCview software [16] was used for genome visualization. Te digital DNA-DNA hybridization (dDDH) among compared genomes was calculated by the online tool genome-to-genome distance calculator, version 3.0 [17]. Furthermore, the average nucleotide identity was estimated using the Orthologous Average Nucleotide Identity Tool (OAT) [18].

Results and Discussion
3.1. Phylogenetic Analysis. Te spectra were added to the local Microfex database to allow future identifcation using MALDI-TOF MS. Te phylogenetic tree based on the 16S rRNA gene highlighted the position of the Marseille-Q2617 strain in relation to all related streptococcus species whose names have been validly published in the LPSN is presented in Figure 1.
Nonidentifcation of the strain by MALDI-TOF mass spectrometry led us to carry out additional analysis. Currently, the 16S rRNA gene alone cannot discriminate strains of streptococci, and a polyphasic approach should be adopted [19,20]. BLASTn was carried out on the NCBI DNA database with the sequences of the 16S rRNA genes, and strain Marseille-Q2617 shows 99.84% sequence similarity to Streptococcus mitis ATCC 49456 (NR_116207.1). Indeed, the sequence similarity values of the 16S rRNA gene obtained are very high compared to the threshold value recommended to delimit the species barrier [21]. In contrast, the phylogenetic tree constructed with Mega X software [9] based on fve concatenated genes (gyrA, ddl, gdh, rpoB, and sodA)  Sequences were aligned using MUSCLE with default parameters. Phylogenetic inferences were obtained using the maximum likelihood method and the MEGA X software [9]. Bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree are indicated at the nodes. Bartonella quintana was used as outgroup. Te scale bar represents a 2% nucleotide sequence divergence. sequences revealed distinct positions of the strains. Te phylogenetic analyses deduced from comparisons of concatenated genes position our strain among the streptococci belonging to the Mitis clade [4] and clearly show that it is distinguished from the others (Figure 2). Based on these phylogenetic analyses, we suggest that Marseille-Q2617 is a new member of the genus Streptococcus.

Phenotypic Description. Strain
Marseille-Q2617 is a facultative anaerobe growing under aerobic as well as anaerobic conditions at 28 to 42°C but did not grow at 56°C. Te optimal growth of this strain is observed at 37°C after 24 hours of incubation in aerobic conditions. Te bacterial cells are Gram-positive, coccus shaped, nonmotile, and nonspore forming. Te morphology of bacterial cells was   Te results of the API galleries revealed the biochemical characteristics of the strain Marseille-Q2617. Tus, for API ZYM, the positive reactions obtained were for esterase (C4), naphthol-AS-BI-phosphohydrolase and leucine arylamidase, and esterase lipase (C8). An API 50 CH strip showed that strain Marseille-Q2617 was positive for N-acetyl-glucosamine, esculin ferric citrate, Dgalactose, D-sucrose, D-tagatose, and potassium 5ketogluconate. Te use of API STREP strips yielded positive reactions following these tests: sodium pyruvate, hippuric acid, esculin ferric citrate, pyroglutamic ßnaphthylamide acid, L-leucine-ß-naphthylamide, Larginine, and D-lactose (bovine origin) and starch. Te main phenotypic characteristics of strain Marseille-Q2617 were compared with closely related species (Table 1).

Genome Features. Te draft genome size of strain
Marseille-Q2617 is 2.02 Mbp long, with a 40.5 mol% G + C content. Te genome of strain Marseille-Q2617 consists of 5 contigs and possesses 1 949 predicted genes with 1 779 protein-coding genes and similarly 74 RNA genes (12 rRNAs, 59 tRNAs and 3 other RNA). A total of 96 pseudogenes were detected. Te graphical circular map of the draft genomes of Marseille-Q2617 is presented in Figure 4.
Te DDH and OrthoANI values obtained are below the threshold values recommended for classifying prokaryotic species [22,23]. DDH values obtained after genome analysis varied from 22.6% (between Streptococcus oralis and Streptococcus chenjunshii) to 58.6% (between Streptococcus pneumoniae and Streptococcus pseudopneumoniae). Te strain Marseille-Q2617 shared its highest DDH value (55.4%) with Streptococcus pseudopneumoniae (Table 3). Furthermore, in the Streptococcus species studied, genomic analysis based on average nucleotide identity (ANI) showed

Conclusion
Based on the phylogenetic, genomic, and phenotypic specifcities which give the strain Marseille-Q2617 their unique criteria among the known species belonging to Streptococcus genus, we propose that they be considered as novel species, named Streptococcus thalassemiae sp. nov.

Description of Streptococcus thalassemiae sp. nov
Streptococcus thalassemiae (tha.las.se'mi.ae. N.L. gen. n. thalassemiae, refering to the isolation of the bacterial strain from a patient with thalassemia). It is a Grampositive cocci bacterium, nonmotile, and nonspore forming. Growth is obtained between 25°C and 42°C, with optimal growth at 37°C. Te growth of strain Marseille-Q2617 was observed at diferent pH values (from 5.0 to 8.0) and salt concentrations of up to 2.5% NaCl. Te cells have a diameter varying between 1 to 4 µm: no catalase and no oxidase activities. Te colonies are small, punctiform, and pale grayish, with a mean diameter of 0.5 mm on blood agar. Strain Marseille-Q2617 T exhibits positive reactions for penicillin G, amoxicillin, daptomycin, ciprofoxacin, ceftriaxone, clindamycin, doxycycline, fosfomycin, imipenem, linezolid, oxacillin, rifampicin teicoplanin, and oxacillin. C 16:0 (56.1%), C 18:0 (12.7%), and C 18:1n9 (11.5%) are the major fatty acids detected in the cell wall of Streptococcus thalassemiae sp. nov. Te draft genome of strain Marseille-Q2617 T was 2.02 Mbp with 40.5 mol% of G + C content. Indeed, the 16S rRNA and genome sequences of strain Marseille-Q2617 T are deposited in the GenBank database under accession numbers LR809138 and CAHJXN010000000, respectively.
Te type strain of Streptococcus thalassemiae sp. nov., is Marseille-Q2617 T (�CSUR Q2617 � CECT 30109) and was isolated from a man with major thalassemia.

Data Availability
Te 16S rRNA and genome sequences of strain Marseille-Q2617T are deposited in the GenBank database under accession numbers LR809138 and CAHJXN010000000, respectively

Ethical Approval
Te clinical samples were obtained in the context of diagnostic screening. Patients were informed of the possible use of their samples for research purposes and retained their right to refuse approval at any time. According to the French Jarde Law (Law No. 2012-300 of 5 March 2012 and Decree No. 2016-1537 of 16 November 2016 published in the Offcial Journal of the French Republic), as this study did not involve specifc collection of samples or use medical/personal data from patients, neither institutional ethical approval nor individual patient consent was required for this noninvasive study. As a result, this study was approved by the Ethics committee of the IHU under number No. 2022-004.

Conflicts of Interest
Te authors declare that they have no conficts of interest.

Authors' Contributions
Fatou Samba DIOUF and Cheikh Ibrahima LO wrote the original draft, performed the formal analysis, and acquired the data. Mamadou BEYE, Mapenda GAYE, Mariema SARR, Babacar MBAYE, Stephane ALIBAR, and Gregory DUBOURG participated in data collection. Jean-Christophe LAGIER, Cheikh SOKHNA, Cheikh Ibrahima LO, and Stephane ALIBAR revised the original draft. Pierre-Edouard FOURNIER and Florence FENOLLAR conceptualized the study and edited the review. All authors read and approved the fnal version of the manuscript.