|
|
||||||||
1From the Hamilton Glaucoma Center and 2Ophthalmic Informatics Laboratory, Department of Ophthalmology, and the 3Institute for Neural Computation, University of California at San Diego, La Jolla, California; the 4Department of Ophthalmology, Hadassah University Hospital, Jerusalem, Israel; and the 5Computational Neurobiology Laboratory, Salk Institute, La Jolla, California.
| Abstract |
|---|
|
|
|---|
METHODS. Standard perimetry thresholds for 52 locations plus age from one eye of each of 156 patients with glaucomatous optic neuropathy (GON) and 189 eyes of healthy subjects were clustered with an unsupervised machine classifier, variational Bayesian mixture of factor analysis (vbMFA).
RESULTS. The vbMFA formed five distinct clusters. Cluster 5 held 186 of 189 fields from normal eyes plus 46 from eyes with GON. These fields were then judged within normal limits by several traditional methods. Each of the other four clusters could be described by the pattern of loss found within it. Cluster 1 (71 GON + 3 normal optic discs) included early, localized defects. A purely diffuse component was rare. Cluster 2 (26 GON) exhibited primarily deep superior hemifield defects, and cluster 3 (10 GON) held deep inferior hemifield defects only or in combination with lesser superior field defects. Cluster 4 (6 GON) showed deep defects in both hemifields. In other words, visual fields within a given cluster had similar patterns of loss that differed from the predominant pattern found in other clusters. The classifier separated the data based solely on the patterns of loss within the fields, without being guided by the diagnosis, placing 98.4% of the healthy eyes within the same cluster and spreading 70.5% of the eyes with GON across the other four clusters, in good agreement with a glaucoma expert and pattern standard deviation.
CONCLUSIONS. Without training-based diagnosis (unsupervised learning), the vbMFA identified four important patterns of field loss in eyes with GON in a manner consistent with years of clinical experience.
The problem with these supervised machine learning classifiers is that we do not know what patterns they are relying on to enable the accurate classification of the visual fields. In supervised learning, each participants data are labeled with a diagnosis (GON or no GON), and the classifiers learn to make the correct diagnosis. What they learn from the training visual fields1 and exactly how they use this knowledge to reach their conclusion for another set of visual field data5 is not known.
The present study addresses these questions by using an unsupervised machine learning method to cluster visual fields from standard automated perimetry. Unsupervised learning means that the classifier had no knowledge of which diagnostic group the visual fields came from or what patterns of loss are associated with a particular diagnosis. Unsupervised learning methods, as used in this study, learn the associations in the data on their own, yielding patterns instead of diagnoses.
Fields obtained with standard perimetry were chosen because it is the best understood of all the perimetric procedures, and the patterns of glaucomatous field loss associated with it are well documented. This study differs from the previous machine learning classification studies that separated visual fields into normal or glaucomatous, because the unsupervised clustering type of classifier chosen is not restricted to two outcomes and its rules for clustering are known. In this case, the classifier provided information about several different patterns of visual field loss present in the data. Patterns that, we assume, contribute in some way to the classification and may help with our understanding of the representation of glaucoma in visual fields.
We present the patterns of visual field loss discovered by an unsupervised machine learning classifier, the relationship of these patterns to those typically associated with glaucoma and early field loss, and a discussion of the potential of this particular unsupervised learning technique, variational Bayesian mixture of factor analysis (vbMFA), to further our understanding of glaucomatous visual field loss.
| Methods |
|---|
|
|
|---|
Exclusion criteria for both groups included unreliable visual field test results (defined as fixation loss, false negative, and false positive errors
33%), angle abnormalities on gonioscopy, any diseases other than glaucoma that could affect the visual fields, and medications known to affect visual field sensitivity. Subjects with a best-corrected visual acuity worse than 20/40, spherical equivalent outside ±5.0 D, and cylinder correction greater than 3.0 D were excluded. Poor-quality stereoscopic photographs of the optic nerve head was also an exclusionary criterion. A family history of glaucoma was not.
The classification of an eye as glaucomatous or normal was based on the appearance of the optic disc through consensus of masked evaluations of two independent graders of a stereoscopic disc photograph and not on visual field defects. Color simultaneous stereoscopic photographs were obtained (model TRC-SS camera; Topcon Instrument Corp. of American, Paramus, NJ) after maximum pupil dilation. These photographs were taken within 6 months of the visual field in the data set. Stereoscopic disc photographs were recorded for all patients with the exception of a subset of normal subjects (95 eyes) for whom photographs were not available. These normal subjects had no evidence of optic disc damage detected by dilated slit lamp indirect ophthalmoscopy with a hand-held 78-D lens. All photograph evaluations were accomplished using a stereoscopic viewer (Pentax Stereo Viewer II; Asahi Optical Co.-Pentax, Tokyo, Japan) illuminated with color-corrected fluorescent lighting. GON was defined by evidence of any of the following: excavation, neuroretinal rim thinning or notching, nerve fiber layer defects, or an asymmetry of the vertical cup-to-disc ratio
0.2 between the two eyes. Inconsistencies between the two graders evaluations were resolved through adjudication by a third evaluator. Because the purpose of the study was to identify patterns of visual field loss and not to assess the effects of risk factors on the presence or absence of a field loss, no specific range of intraocular pressures was required for the patient group.
Inclusion criteria for the normal category required that subjects have normal findings in dilated eye examinations, open angles, and no visible evidence of GON. Normal optic discs had a cup-to-disc ratio asymmetry
0.2, intact rims, and no hemorrhages, notches, excavation, or nerve fiber layer defects. Normal subjects had intraocular pressures
22 mm Hg with no history of elevated IOP.
If both of the eyes met the inclusion criteria, only one of the eyes was selected at random. The final selection totaled 345 eyes, including 189 normal eyes (mean age, 50.0 ± 6.7 years; SD) and 156 eyes with GON (62.3 ± 12.4 years). We did not want to limit the algorithm to working only with certain age-matched data, and so we included age in the model as the first step in reducing it as a variable (see the Results section).
Visual Field Testing
All subjects had automated full-threshold standard visual field testing with the Humphrey Visual Field Analyzer (Carl Zeiss Meditec) with program 24-2 or 30-2. The visual field locations in program 30-2 that are not in program 24-2 were deleted from the data and display.
The input to the classifier included the absolute sensitivity in decibels of each of the 52 test locations (not including two locations near the blind spot) in the 24-2 visual field and age. These were the same input data used in our initial study.1 The 52 thresholds were extracted from the perimeter by computer (Peridata, ver. 6.2; Peridata Software GmbH, Hürth, Germany). Age was included to account for the differences in mean age between the healthy and glaucomatous eyes and because age is an important correction factor used in visual field analysis.
Cluster Analysis with vbMFA
Background.
In this approach, we were interested in using machine learning classifiers that could adapt to or learn the intrinsic distribution or clustering of the data. In this way, they can identify classes of data that belong together due to their data distribution similarities. This principle is known as unsupervised learning, because the data (in our case visual fields) are not labeled with the diagnosis for the learning process.
Unsupervised learning refers to a type of algorithm or technique often used by the machine learning, adaptive filtering, and neural network research community. The main feature of unsupervised learning is that the algorithm seeks information from data without an operator instructing the algorithm about what the data are or where they originate. That is, there are no labels attached to the data points that could teach the algorithm whether it belongs to one certain class or cluster. In unsupervised learning, the algorithm is driven by an objective function (organization of the data) in which the algorithms internal parameters are adjusted so that the objective function is maximized or minimized. There are several different objective functions or models of learning used depending on the specific learning task. These include grouping data according to similarity measures, finding data directions with maximum variance or matching the likelihood of the data with a preconceived model.
The model we chose is an enhancement of the data-generative model, a model that has received significant attention and has been used in many applications.6 The data-generative model uses a probabilistic approach for explaining how the observed data were constructed, by learning a multidimensional probabilistic model that is described by a mean vector and a covariance matrix. It is called a data-generative model because the data can be viewed as being generated by sequentially adding components of the covariance matrix (columns) to the mean vector and finally reproducing the observed data.
We extended the basic generative model to a mixture of Gaussian densities model in which each component in the mixture explains a certain grouping, classification, or clustering of the data with a Gaussian distribution. However, when all are mixed together, the overall generative model can approximate or adapt any skewed or non-Gaussian distribution necessary to fit or represent the observed data. In addition, it allows for unsupervised classification of the data. More specifically, we used the mixture of factor analyzers (MFAs)7 model, which is a specific mixture model that can account for noisy measurements (such as visual field data) and tries to find parameters that explain variations in the data to capture important components in the visual field patterns. The vbMFA is a further extension of the basic MFA algorithm. The vbMFA is able to learn any number of components of the covariance matrix and any number of classes (or clusters). For example, it is not obvious that the data would fall into two clusters (abnormal and normal) or that there are only a few important components of the covariance matrix representative in each class. All the internal parameters (mean vectors, covariance matrix, weighting of clusters, and number of clusters) are inferred by the vbMFA algorithm, making it a powerful method to extract as much information as possible from the observed data alone without overfitting the model to the data. The vbMFA provides results with the least assumptions, interventions, or additional information from the user, so that it is not overfit to training data, as it might be in a supervised model.
Method.
The full details of these methods can be found in previous publications.6 7 8 The steps used with the vbMFA to divide the data into clusters and to characterize each by the cluster mean and variance can be simplified as follows:
The clustering (steps 25) was repeated with many different random initial partitions (approximately 200), each generated independently by the vbMFA without operator input. The results of these
200 different initial partitions were converged to 10 final solutions with varying numbers of clusters (26). vbMFA, which is based on Bayesian techniques, allows computation of marginal likelihood scores.9 These scores reflect the probability of getting the generated data set, assuming the density model and the parameters chosen are correct. Bayesian techniques also compare goodness of fit to the data and an assessment of the degree of overfitting.7 10 Applying these techniques resulted in two clustering solutions with equal scoring: one with four distinct clusters and one with five. The five-cluster solution is reported herein because it removed one GON eye from the cluster containing the normal subjects and it provided an additional visual field loss pattern for assessment. If we had chosen to report the four-cluster solution instead of the five-cluster solution, four of the six eyes in cluster 4 would have moved into cluster 1 and two would move into cluster 2. This advanced-loss group would not have been separated from the other patterns.
| Results |
|---|
|
|
|---|
|
|
The mean grayscale for each cluster may be masking the individuality of a range of patterns within each cluster. To determine this, we visually evaluated the visual fields comprising each cluster. In cluster 2 all fields had a significant superior visual field defect, with a few also having a milder defect in the inferior field (Fig. 3) . In cluster 3 the always-present inferior field defect was accompanied by a superior nasal step in 6 of the 10 fields (Fig. 4) . In cluster 4, five of the six fields showed deep defects in both hemifields, the sixth showed a deep superior central defect (Fig. 5) . It was in the cluster 1 group of fields that the initial conclusion, that this group represented early diffuse defects, might not have been well founded. This purely diffuse defect was present in a very small percentage of these eyes (Fig. 6) with most fields in this group exhibiting small (only a few test locations) circumscribed depressions in sensitivity. The location of this defect could be almost anywhere in the visual field, so that when the group was averaged together, it lead to the misinterpretation of a mild diffuse loss as indicative of the fields in cluster 1.
|
|
|
|
|
As a additional post hoc analysis, we asked the vbMFA to recluster just the fields from clusters 5 and 1, thinking that this might lead to a further breakdown for the 46 GON eyes clustered with the normal subjects.
Post Hoc Results.
Table 2 shows the breakdown for each of the five clusters along with the mean and SD of mean defect and pattern standard deviation for each group. Also shown is the percentage of fields in each group classified on the post hoc analysis as abnormal based on the Glaucoma Hemifield Test result, the pattern standard deviation, and the glaucoma expert. The visual fields from GON eyes with moderate to advanced loss were placed in clusters 2 to 4. The post hoc analysis showed that the fields from cluster 2 (n = 26), cluster 3 (n = 10), and cluster 4 (n = 6) were all considered abnormal by the Glaucoma Hemifield Test, the pattern standard deviation, and the expert with 100% specificity and 100% sensitivity, if the presence of GON is considered the standard for abnormality. Although the fields in clusters 2 to 4 were not difficult to classify as abnormal, each of these three clusters exhibited a pattern of visual field loss distinct from each of the other clusters. The nearly always present, small, localized defects found in fields within cluster 1 explain why the pattern standard deviation and Glaucoma Hemifield Test were also successful in identifying these fields as abnormal. Of the 71 eyes in cluster 1, 68 (96%) had GON.
|
value for agreement between the vbMFA with pattern standard deviation was 0.905, with the Glaucoma Hemifield Test was 0.913, and with the expert it was 0.873.
The post hoc evaluation using only the eyes from clusters 5 and 1 did not yield any additional clusters. Forty-five of the original 46 GON eyes remained with the normal subjects, suggesting that the original clustering was close to optimal for the vbMFA. When only cluster 5 was compared with cluster 1, the
statistic remained high, with 0.863 for pattern standard deviation, 0.875 for Glaucoma Hemifield Test, and 0.829 for the expert.
| Discussion |
|---|
|
|
|---|
Our present study differed from our previous work in the type of classifier (unsupervised versus supervised) and in the goal (identifying patterns of field loss associated with glaucoma or normal, instead of classifying the fields as belonging to an eye with glaucoma or to one that is normal).1 5 The classifier used in this study did not know the diagnosis associated with each visual field. This classifier learned, on its own, to separate the 345 visual fields into clusters. The discernible rules underlying the classification by the vbMFA in this report are that patterns of field loss considered similar are placed within the same cluster, and patterns that differ are in different clusters, each separable by its mean and covariance.
Several "typical" patterns for glaucomatous visual field loss have been described over many years, and examples of each are depicted in Figure 7 .13 14 15 16 17 18 19 20 21 22 These patterns result from the pathways traversed by the axons from the retinal ganglion cells to the optic disc. The axons coming from adjacent locations on the retina tend to form bundles of optic nerve fibers, each of which enters the optic disc at specific locations around its rim. This leads to a topographical relationship between damage at the optic disc and loss of ganglion cell function measured by the visual field test. Certain locations along the optic disc are more likely to be damaged by glaucoma. Hence, certain patterns of visual field loss arise as typical to glaucoma, such as nasal steps23 where the anatomy suggests it is very unlikely that a defect within a nerve fiber bundle would cross the horizontal meridian and analysis of fields supports this.24 This fact is very useful in the interpretation of visual fields. Possibly defective areas in one hemifield can be compared to comparable normal areas in the other hemifield to help determine the probability of a true defect.4 Others patterns include isolated paracentral defects,14 15 temporal wedges,25 altitudinal defects, and arcuate scotomas.16 Combinations are also possible. Figures 3 4, 5 to 6 show how the patterns in the vbMFA clusters 1 to 4 resemble classic glaucomatous patterns, identified over years of experience with standard visual fields.
|
The results also support the notion that a high percentage of standard visual fields remain normal in eyes showing optic nerve damage. This has been shown before with anatomic assessment of nerve fiber number in donor eyes26 and in studies comparing optic disc and nerve fiber layer with visual fields.27 28 29 30 31 In this study, fields in 46 eyes with GON were placed in the same cluster as those in healthy eyes. On post hoc analysis the majority of these fields were also designated as normal by pattern standard deviation (98%), the Glaucoma Hemifield Test (96%), and the expert (87%). Their mean deviations and pattern standard deviations were slightly poorer than those of the healthy eyes in this cluster, but were still well within what is considered the normal range (Table 2) . This may mean that some individuals within the normal range on global and local indices have vision loss that might be discernible if we could only go back in time and collect baseline data for the fields before vision was affected by glaucoma. This is consistent with findings from the Ocular Hypertension Treatment Study (OHTS), which found that pattern standard deviations from normal baseline visual fields were predictive of future visual field loss in both univariate and multivariate models.32
The analyses presented herein also have a direct bearing on an ongoing argument in the perimetric literature: whether the earliest visual field deficits associated with glaucoma are localized, diffuse, or some combination of the two.14 33 34 35 36 37 38 39 40 In the early-stage glaucoma group, cluster 1 (Fig. 2 , top left), a localized component was present in nearly all fields, with 90% (64/71) abnormal on pattern standard deviation and/or the Glaucoma Hemifield Test, both measures of local deviation in visual field sensitivity. This suggests that even with some diffuse component in the field, these eyes could have been identified as abnormal based solely on the localized pattern of visual field loss. This does not mean a diffuse component may not be useful for identifying particular subtypes of glaucomatous damage. However, we did not find a purely diffuse component leading to a separate cluster of fields for any subjects included in this study, even though these subjects covered a range of glaucomatous damage from early to moderately advanced.
The impressive performance of the vbMFA with standard perimetry, the current clinical standard, suggests that it may be very useful for identifying visual field patterns associated with glaucoma in the newer and more sensitive visual-functionspecific tests without the need for years of clinical experience. These tests include frequency-doubling technology perimetry (Carl Zeiss Meditec and Welch-Allyn, Skaneateles, NY) or short-wavelength automated perimetry (Carl Zeiss Meditec and Interzeag, Zurich, Switzerland). These tests target a subset of the retinal ganglion cells, which are sparse and have different temporal and spatial summation properties.41 For example, the most commonly used version of the frequency doubling test for glaucoma, program N-30, has only 18 locations in the peripheral visual field measured with targets 10° of visual angle. For these reasons, it is likely that the patterns associated with GON differs for this test and for short-wavelength perimetry, although this remains to be tested. Two more potential applications for this machine classifier would be to determine whether visual fields in different ocular diseases can be separated based on their typical patterns of loss, as has been suggested with subjective classifications, and to group visual field locations into sectors that are optimum for analysis of visual field data for glaucoma. Finally, this machine classifier may be of some use for training ophthalmologists to differentiate the typical patterns found in various eye diseases.
In summary, we found that certain features were common to all fields within a given cluster and the patterns of visual field loss found within each cluster were representative of patterns typical of glaucoma. vbMFA did this with the least assumptions, interventions or additional information from the user. This is encouraging because it shows that the classifier was somehow using information that has been shown over many years and numerous studies to be useful for classifying standard visual fields. This type of unsupervised learning algorithm, therefore, may be useful to classify fields from other visual function or optic nerve imaging techniques with which we do not have the same level of experience with the results.
| Footnotes |
|---|
Submitted for publication April 3, 2003; revised October 24, 2003, and March 21, 2004; accepted April 6, 2004.
Disclosure: P.A. Sample, Carl Zeiss Meditec (F); K. Chan, None; C. Boden, None; T.-W. Lee, None; E.Z. Blumenthal, None; R.N. Weinreb, Carl Zeiss Meditec (F); A. Bernd, None; J. Pascual, None; J. Hao, None; T. Sejnowski, None; M.H. Goldbaum, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Pamela A. Sample, Department of Ophthalmology, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0946; psample{at}eyecenter.ucsd.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. Boden, K. Chan, P. A. Sample, J. Hao, T.-W. Lee, L. M. Zangwill, R. N. Weinreb, and M. H. Goldbaum Assessing Visual Field Clustering Schemes Using Machine Learning Classifiers in Standard Perimetry Invest. Ophthalmol. Vis. Sci., December 1, 2007; 48(12): 5582 - 5590. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-L. Huang, H.-Y. Chen, and J.-C. Lin Rule Extraction for Glaucoma Detection with Summary Data from StratusOCT Invest. Ophthalmol. Vis. Sci., January 1, 2007; 48(1): 244 - 250. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Goldbaum, P. A. Sample, Z. Zhang, K. Chan, J. Hao, T.-W. Lee, C. Boden, C. Bowd, R. Bourne, L. Zangwill, et al. Using Unsupervised Learning with Independent Component Analysis to Identify Patterns of Glaucomatous Visual Field Defects Invest. Ophthalmol. Vis. Sci., October 1, 2005; 46(10): 3676 - 3683. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |