Proximity of Residence to Bodies of Water and Risk for West Nile Virus Infection: A Case-Control Study in Houston, Texas

West Nile virus (WNV), a mosquito-borne virus, has clinically affected hundreds of residents in the Houston metropolitan area since its introduction in 2002. This study aimed to determine if living within close proximity to a water source increases one's odds of infection with WNV. We identified 356 eligible WNV-positive cases and 356 controls using a population proportionate to size model with US Census Bureau data. We found that living near slow moving water sources was statistically associated with increased odds for human infection, while living near moderate moving water systems was associated with decreased odds for human infection. Living near bayous lined with vegetation as opposed to concrete also showed increased risk of infection. The habitats of slow moving and vegetation lined water sources appear to favor the mosquito-human transmission cycle. These methods can be used by resource-limited health entities to identify high-risk areas for arboviral disease surveillance and efficient mosquito management initiatives.


Introduction
Houston, Texas, is a metropolis in the southeastern United States with around four million residents [1]. West Nile virus (WNV) human cases were first reported locally in 2002 [2] and have since become endemic with human cases reported annually [3]. WNV is an arboviral disease from the Flaviviridae family whose main transmission cycle occurs between birds and mosquitoes; humans serve as an incidental host. In southeastern United States, Culex quinquefasciatus mosquitoes have been demonstrated as important vectors of WNV disease transmission [2,4].
In the United States, WNV transmission season traditionally occurs from spring to fall, with a peak in late summer [2]. In warm weather, mosquito larval development occurs within days [5,6] allowing for rapid reproduction of new mosquito populations. Mosquito larval development occurs in water bodies with each species having their own preferential type. Culex quinquefasciatus mosquitoes have a diverse larval habitat range, with high larval counts near human habitation [7,8]. Mosquito control efforts in Houston, Texas, target residential areas where either mosquito pools or dead birds are positive for WNV disease. Targeted areas are identified through random mosquito trapping and reporting of dead birds by residents. The ecological dynamic between vector, reservoir, and human habitats is critical to understand when examining risk for human WNV infection. While this vector's larval habitat preferences are known, no studies to date have examined direct associations between larval water habitats and WNV human disease transmission. This paper presents a novel method for examining disease clustering and its spatial association with water sources.

Methods
A case-control study design was used to determine the association between water sources and the risk of human infection with WNV.
2.1. Case Selection. Cases were defined as WNV-positive patients identified through local surveillance performed by the Houston Department of Health and Human Services (HDHHS), Harris County Public Health and Environmental Services (HCPHES), or the Gulf Coast Regional Blood Center (GCRBC). Local surveillance identified cases either by state mandatory reporting laws or by national blood donation testing guidelines that required laboratory confirmation of WNV human disease. Previous research has shown that the highest rates of WNV human seroprevalence were among those who reported a history of being outside during the hours of dusk and dawn [9]. These hours are concurrent with the peak activity time of Culex quinquefasciatus mosquitoes. Since most people are at home during dusk and dawn, it was resolved that cases are most likely exposed while at home. It was determined appropriate to use cases' home address at time of disease development as their location of mosquito exposure. Cases' home addresses were collected via case investigations performed by HDHHS, HCPHES, or GCRBC during 2002 and 2009. Exclusion criteria included evidence of nonlocally acquired disease as documented in the case investigation nonrecognition of address by MapMarker USA version 14 geocoding software, or home address falling outside the metropolitan's geographic area as determined by the geocoding software. After applying the exclusion criteria, we had 356 residential addresses from cases for final analysis.

Control
Selection. Controls were defined as selected block centroids generated from the United States Census Bureau decennial data (http://www.census.gov/). Controls were selected using two methods: a population proportionate to size sampling method which takes into account varying population densities within the metropolitan city and a random sampling method. There were three selection frames that were used to identify the final control. In descending order the frames were census tract level, block group, and finally block. The population proportionate to size sampling methods was used to select the initial frame: census tract level. It was understood that population distribution was uniform throughout the census tracts selected; therefore, we used a random selection method for the two additional frames: block group and block. Since the smallest defined census level is a block, the centroid of the block level was used as a surrogate for control households. Based on sample size calculations, a 1 : 1 case-control ratio was determined appropriate to satisfy statistical significance using discipline standards; therefore, 356 control addresses were selected for final analysis.

Data
Analysis. Spatial analysis of case and control residential distances' to local water body sources was performed using MapInfo v9.5.1 software. Shapefiles of water sources within the metropolitan's geographic parameters were provided in kind by Dr. Irina Cech, professor at the University of Texas Health Science Center at Houston. The shapefiles were based on United States Geological Survey water source definitions and data. Case and control residential coordinates were superimposed onto the water source shapefile. Water source labels were used to identify the particular water source, that is, Cedar Spring, Lou River, Brays Bayou, and so forth. The water source type was inferred from these labels. Using the software's measurement tool, we measured the distance from each case/control point, to the closest water source, excluding salt water sources since Culex quinquefasciatus mosquitoes do not utilize salt water sources as larval habitats [5]. For each case/control point we recorded the proximity to the closest water source, the type of the particular water source, and the name of the particular water source. We used STATA v11.0 (College Station, Texas) to run all statistical analyses. Chi-squared tables and logistic regression were used to analyze the significance of proximity to a water source between the two populations. Odds ratios, 95% confidence intervals (CIs), and P values were computed to analyze the significance of three factors: specified residential proximity to a water source; proximity to a particular water source type; proximity to a particular water source. Attack rates (number of WNV human cases over total number of households) were calculated for each census tract and mapped to spatially identify areas of high WNV human transmission. A Getis Ord hot spot analysis was performed using ESRI ArcGIS 10.0 to determine concentrations of high and low human disease clustering. The GetisOrd (Gi) hot spot analysis identifies clusters of higher and lower magnitude than would be randomly found and statistical output is in the form of a Z score known as a GiZ score. Areas of high clustering were indicated by a GiZ score of 1.96 or greater, and areas of low clustering were indicated by a GiZ score of −1.96 or less.

Results
On average, cases and controls resided the same proximity from water sources [x 0 (controls) = 892 meters, x 1 (cases) = 931 meters]. Using linear regression, we found no statistical association between residential proximity to water and odds for human WNV infection. However, when we binomiallycoded at varying distances ranging from 50 to 750 meters, we found a significant protective trend from distances ranging from 50 to 200 meters (Table 1). Living less than or equal to 200 meters from a water source (x 2 = 6.67, P < 0.01) was found to be protective from infection by a factor of 0.54.
Water source types were analyzed for association with odds for human WNV infection using odds ratios and chisquared tests, as seen in Table 2. We examined the six most common water source types. Two water source types were statistically associated with odds of human infection. Living near a creek increased one's odds of human infection by a factor of 1.37 (P = 0.09). Living near a spring decreased one's odds of human infection by a factor of 0.55 (P = 0.06). To further analyze these associations, we created two groupings based on slow moving and moderate moving water source types. A grouping of slow moving water bodies (creeks and gullies) was found to increase one's odds of human infection by a factor of 1.45 (P = 0.03). A grouping of narrow moderate moving water bodies (streams and rivers) was found to be protective against human infection by a factor of 0.50 (P = 0.02). Particular water sources were evaluated for association with odds for human WNV infection by odds ratios and chisquared tests, as seen in Table 3. The eleven most common specific water sources were analyzed. Two water body sources were significantly associated with increased odds for human infection. Living close to White Oak Bayou (P = 0.01) increased one's odds of human infection by a factor of 2.25. Additionally, living near Cypress Creek (P = 0.02) was also associated with increased odds of human infection by a factor of 2.54. Since Cypress Creek has several tributaries, an additional category was made that included all feeders for Cypress Creek. This group had the strongest significance of all water bodies (P < 0.01) with increased odds of human infection by a factor of 1.93. We also found that living close to Buffalo Bayou had increased odds of human infection by a factor of 1.59, which neared significance (P = 0.07).
Spatial distribution of WNV attack rates per 10,000 population by census tract illustrates that the highest risk area of transmission is in Northwest Houston as seen in Figure 1. Hot spot analysis confirmed that there were significant clusters of cases in Houston as seen in Figure 2. The areas of highest valued clusters were along the Northwest corner of Harris County, which overlaps Cypress Creek and its feeders. Figure 3 demonstrates the spatial relevance of the Houston area inlaid within Harris County, in relation to the state of Texas, and the United States of America.

Discussion
This is the first known case-control study to perform a spatial analysis of human WNV infection risk with regard to proximity of residences to water sources serving as surrogates for potential aquatic larval habitats. Overall, we found no direct association between proximity of residences to water sources and odds of WNV human infection in Houston, Texas. However, we found a significant trend of decreased risk of infection among people living within 200 meters of a water source. It is conjectured that areas closest to water sources are the primary target of mosquito control programs, therefore decreasing the risk of transmission at closer distances. We did find a pattern of increasing odds ratios as distance increased by 50-meter intervals, suggesting that mosquitoes in Houston have an expansive flight range that is important in the ecology of disease transmission. Culex quinquefasciatus mosquitoes are known to have an expansive flight range with recapture documented up to 1000 meters outside of their release site [10]. One speculation could be that the use of adulticides along water bodies could temporarily suspend adult mosquito activity allowing for higher mosquito activity occurring at greater distances. Although adulticides are the primary mosquito  control method used in this area, it is known that the use of adulticides is random and not associated with specific water bodies. Another speculation is that alternate breeding sites, specifically storm sewers, also play a role in disease transmission. In Houston, Culex quinquefasciatus are the dominate mosquito species collected from storm sewers, and storm sewers have been demonstrated as a preferential site for breeding, larval development, and daytime resting [11]. Unfortunately, we did not have access to sewer blueprints of the metropolitan area to further investigate this theory. When analyzing residential proximity to water source types, we did find a strongly significant association for risk of human infection among residences near creeks and gullies, specifically Cypress Creek. It is believed that the slower movement of water and dense vegetation is preferential for the local transmitting Culex vector species. Due to low numbers of cases per creek, no additional specific creek sources were included in the final analysis. Cypress Creek is a large water source that flows throughout the northwest corner of the metropolitan Houston area.  where Cypress Creek flows. We feel the true association of infection is with the particular water source Cypress Creek. Additional studies should perform mosquito pool testing around Cypress Creek and additional creeks and gullies throughout the metropolitan area to examine WNV field infection rates of mosquitoes in efforts to further validate our findings.
When analyzing residential proximity to water source types, we did find a strong protective association of residences closest to streams; however, no particular stream water sources were identified as being associated with infection. To further investigate these findings, we created a grouping of moderate moving water sources which included streams and rivers. This grouping had the strongest significance of protection from human WNV infection. Additionally, no particular river water sources were identified as being associated with infection. These findings are evidence that residences in closest proximity to moderate moving water sources are significantly protected against WNV human infection.
Houston is prone to flooding, and as part of the flood mitigation program, the city has an extensive network of bayous, which are man-made canals [2]. The surrounding habitats of bayous in Houston are varied with some being cast with concrete walls and others edged with grass, shrubs, and other vegetation. Overall, we did not find an association between the living near bayous and increased odds of infection. However, we did find that White Oak Bayou and Buffalo Bayou were significantly associated with increased odds of infection. These specific bayous are lined with extensive vegetation preferential to mosquito habitats. This is in sharp contradiction to the bayous lined with concrete, such as Brays Bayou, where the data suggested decreased odds of infection. We cogitate that the type of bayou lining and habitat dictates WNV transmission. Future research should incorporate bayou linings and their individual risk for local human habitants.
There are a few limitations of this study that are worth noting. One limitation was the potential for selection bias due to the inability to verify disease status of controls by serum antibody testing. Since WNV is a mandatoryreportable disease in the state of Texas, anyone who tested positive should have been reported to the local health department. The risk of misclassification of controls is possible if a resident at the address never developed symptoms or had mild disease that went undiagnosed as WNV. However, this risk is presumed minimal since current estimates of seroprevalence in Houston are relatively low [12]. Due to financial constraints, we were unable to obtain a serum sample from controls to verify disease status. Lastly, we were unable to test for potential confounders related to human-mosquito transmission, such as socioeconomic status, gender, rainfall, or other seasonal environmental factors. Complete records for these potential confounders were unavailable. Despite the inability to control for these potential confounds, we believe the results are sound considering people do not choose their residence location based on human-mosquito transmission hotspots.
The main strength of the study is the ability to determine high risk areas of WNV transmission around the Houston metropolitan area using minimal resources. The methods we used are simple to perform and could be of benefit to health authorities in other jurisdictions to identify areas with increased risk for WNV transmission. In resourcescarce public health departments, this inexpensive method could greatly increase the effectiveness of mosquito control programs. Our case-control selection methods would be simple to replicate. Since WNV is a reportable disease nationally, case investigations are performed for all patients that test positive. From these case investigations, health departments should have the addresses of the cases in their jurisdiction. Control selection would be easy to execute as census data is readily available from the US Census Bureau website that is updated both annually and decennially. 6 Journal of Biomedicine and Biotechnology In conclusion, we found that living near slow moving water bodies, such as creeks and gullies, or bayous with heavy vegetation increased one's odds of infection with WNV. Most importantly, we identified Cypress Creek as an area of high WNV human infection that should be targeted by future mosquito control efforts. With the recent literature suggestive of increased ranges of arboviral vectors and areas of transmission, this method of spatial analysis could benefit other health authorities in areas experiencing active WNV transmission who need predictive models of exposure risk for targeted education and control efforts for disease prevention.