|
|
||||||||
From the Visual Function Laboratory and Hamilton Glaucoma Center, Department of Ophthalmology, University of California at San Diego, La Jolla, California.
| Abstract |
|---|
|
|
|---|
METHODS. One hundred eleven eyes with glaucomatous optic neuropathy (GON), 31 with progressive optic neuropathy (PGON) 53 with ocular hypertension, and 51 with no disease were included (N = 246). Visual field results were not used to classify the eyes. Short-wavelength automated perimetry (SWAP), frequency-doubling technology perimetry (FDT), high-pass resolution perimetry (HPRP), and standard automated perimetry (SAP) were performed. Receiver operating characteristic (ROC) curves were used to compute the areas under the curves (AUC) and sensitivity levels at given specificities for a variety of abnormality criteria. The agreement among tests for abnormality, location, and extent of visual field deficit were assessed.
RESULTS. AUC analysis: When the normal group was compared with the GON group, the FDT pattern SD (PSD) area was larger than the HPRP PSD (P = 0.020), and the FDT area of total deviation (TD) <5% was larger than the HPRP mean deviation (MD; P = 0.004). When the normal group was compared with the PGON group, the FDT area of pattern deviation (PD) <5% was larger than the SWAP PSD (P = 0.020). A difference from previous work was that AUCs for PSD or the best SAP were not significantly poorer than those in the function-specific tests. At set specificities, FDT yielded higher sensitivities than all other tests for all parameters. The agreement among tests for abnormality was fair to moderate (
= 2470.563). When loss was present on more than one test, the quadrant of the visual field affected was the same in 95% (79/83) of eyes. The number of eyes identified and number of abnormal quadrants increased across groups with increasing certainty of glaucoma.
CONCLUSIONS. At equal specificity, no single perimetric test was always affected, whereas others remained normal. Several parameters at suggested criterion values provided good sensitivity and specificity. FDT showed the highest sensitivity overall, with SAP performing better than in prior reports. Of note, the same area of the retina was identified as damaged in all tests.
Initially, it was hypothesized that the blueyellow ganglion cells tested by SWAP were parvocellular.2 3 It was later learned from Dacey4 and others5 6 that the blueyellow cells are small bistratified ganglion cells that project their axons to the koniocellular (interlaminar) layers of the lateral geniculate nucleus (LGN) of the thalamus, rather than to the parvocellular layers.7 Frequency-doubling technology perimetry (FDT)8 9 and various forms of motion perimetry10 11 12 13 14 attempt to target the magnocellular (also known as parasol) ganglion cells that project to the magnocellular layers of LGN, and high-pass resolution perimetry (HPRP)15 16 is thought to assess the parvocellular (also known as midget) ganglion cells that project to the parvocellular layers of LGN. Recent reviews detail the evidence supporting anatomic and functional segregation of three primary pathways (parvocellular, magnocellular, and koniocellular) through the LGN.17 18 Anatomic projections to cortex and functional preferences within cortical layers are much less segregated.18 19 In addition, the relationship of visual function to underlying visual pathways is based primarily on electrophysiology in healthy primates and on lesion studies,17 and so some caveats apply in the application of visual-functionspecific perimetry to ganglion cell assessment (see the Discussion section).
Several studies comparing one visual-functionspecific test to SAP have shown that function-specific tests are superior to SAP for early detection of vision loss associated with glaucoma.20 21 22 23 There are some problems with these studies (see the Discussion section). Very few have compared more than one visual-functionspecific test in the same patient population.24 25 26 We first made such a comparison 6 years ago and evaluated SAP, SWAP, motion automated perimetry (MAP), and FDT in 71 eyes with glaucomatous optic neuropathy, 37 ocular hypertensive eyes, and 28 age-matched normal control eyes.24
For detection of functional loss, it was found that (1) standard visual field testing was not optimal, missing 54% of eyes with glaucomatous optic neuropathy (GON); (2) a combination of two or more tests improved detection of functional loss; (3) in an individual, the same retinal location was damaged, regardless of the visual function tested; and (4) SWAP, MAP, and FDT showed promise as early indicators of function loss in glaucoma.
One limitation of this previous study was that the normal eyes had normal SAP visual fields and we had to adjust specificities based on other published reports. This limitation does not exist in the present study. A second limitation was the testing of only the koniocellular and magnocellular pathways and the lack of a function-specific test preferred by the parvocellular pathway. We have since obtained sufficient data with HPRP, and in this current study, we to compared it to SWAP, FDT, and SAP in the same individuals.
| Methods |
|---|
|
|
|---|
Inclusion Criteria for DIGS.
Participants underwent complete ophthalmic examinations including slit lamp biomicroscopy, intraocular pressure (IOP) measurement, and dilated stereoscopic fundus examination. Simultaneous stereoscopic photographs were obtained for all participants and had to be of adequate quality for the subject to be included. All participants had open angles, a best corrected acuity of 20/40 or better, a spherical refraction within and inclusive of ±5.0 D (transposition allowed), and cylinder correction within ±3.0 D. A family history of glaucoma was allowed.
Exclusion Criteria for DIGS.
Normal and ocular hypertensive participants were excluded if they had a history of intraocular surgery (except for uncomplicated cataract surgery). We also excluded all participants with nonglaucomatous secondary causes of elevated IOP (e.g., iridocyclitis, trauma), other intraocular eye disease, other diseases affecting the visual field (e.g., pituitary lesions, demyelinating diseases, HIV+ or AIDS, or diabetic retinopathy), with medications known to affect visual field sensitivity, or with problems other than glaucoma affecting color vision (as assessed by the Farnsworth D-15 color vision test).
DIGS participants are not chosen based on any criteria other than those specified herein. Most of the patient participants came to the study through the Glaucoma Service at the UCSD Department of Ophthalmology. Normal subjects were recruited from the general population through advertisement as well as from the staff and employees of the University of California San Diego.
The Current Study.
All eligible DIGS participants with reliable visual field results on all four tests defined as
25% false positives, false negatives, and fixation losses were included. All tests were performed in randomized order and completed within a 3-month period. All possible orders of the four tests were determined, and participants were assigned to an order sequentially on arrival for study. One eye was selected randomly from each subject, except in cases in which only one eye met study criteria, and then that eye was included.
Participants.
Two hundred forty-six eyes from 246 participants were evaluated on all three visual-functionspecific perimetry tests as well as on SAP. Criteria for classification are detailed later.
Diagnostic categories were based on simultaneous stereoscopic color photographs (TRC-SS camera; Topcon America Corp., Paramus, NJ), obtained after maximal pupil dilation. All photograph evaluations were taken using a stereoscopic viewer (Stereo Viewer II; Asahi Pentax, Golden, CO) illuminated with color-corrected fluorescent lighting. Two trained and masked graders from the UCSD Optic Disc Reading Center assessed each photograph independently. Inconsistencies between the two graders evaluations were resolved by consensus or through adjudication by a third evaluator. GON was identified by evidence of any of the following: excavation, neuroretinal rim thinning or notching, nerve fiber layer defects, or asymmetry of the vertical cup-to-disc ratio
0.2 between the two eyes.
To identify progressive glaucomatous optic neuropathy, two trained graders independently compared the first and last photographs in each participants series. Graders were masked to all other participant information, photograph date and test results. Photographs were graded A equal to B, A worse than B, or A better than B. Inconsistencies between the two graders evaluations were resolved through adjudication by a third evaluator for each pair of photographs. After consensus was reached, the temporal sequence of the photographs was unmasked. Progression was defined based on evidence of increasing excavation, rim thinning or enlarging of notches or nerve fiber layer defects in the later photograph. Changes in rim color and the presence of disc hemorrhage or progressive parapapillary atrophy were not sufficient for characterization of progression. The time frame for progression on photographs ranged from 2 to 12 years.
| Classification of Study Groups |
|---|
|
|
|---|
Normal Control Subjects.
Normal eyes had IOP <23 mm Hg with no history of increased IOP, normal findings in an ocular examination, and normal optic discs according to the criteria.
Ocular Hypertension.
Ocular hypertensive eyes had normal optic discs and highest IOPs
23 mm Hg on at least two separate occasions.
Glaucomatous Optic Neuropathy.
These participants had evidence of glaucomatous-appearing optic discs on stereophotographs with or without elevated IOP.
Progressive Glaucomatous Optic Neuropathy.
These participants showed progressive GON on evaluation of serial photographs sometime during their follow-up in DIGS before the visual field dates used in this analysis.
| Psychophysical Tests of Function |
|---|
|
|
|---|
Standard Achromatic Automated Perimetry.
In SAP, a small (0.47°) 200-ms flash of white light is presented as the target on a dim background (10 cd/m2 or 31.5 asb). The target was randomly presented to 54 locations within the central 24° of visual field using the Humphrey Visual Field Analyzer II (Carl Zeiss Meditec, Dublin, CA), program 24-2, software version 3.4.7, and the SITA testing algorithm. The two locations just above and below the blind spot were not included in the analysis. Figure 1 shows all target locations in all four tests. Also shown are the abnormal locations for each test in the same example eye with GON.
|
Frequency-Doubling Technology Perimetry.
FDT was measured with the frequency-doubling visual field instrument (Carl Zeiss Meditec, Dublin, CA) using Welch-Allyn technology (Skaneateles Falls, NY) and the N-30 program, software version 3.00.1. The targets consist of a 0.25-cyc/deg sinusoidal grating that undergoes a 25-Hz counterphase flicker. The test involves a modified binary search staircase threshold procedure with stimuli presented for a maximum of 720 ms. FDT measures the contrast needed for detection of the stimulus. Each grating target is a square subtending approximately 10° in diameter. Targets are presented in one of 18 test areas located within the central 20° radius of the visual field temporally and 30° nasally.
High-Pass Resolution Perimetry.
In HPRP, ring-shaped vanishing optotypes which vary in size are used to assess resolution ability in the central 30° of the visual field.15 The optotypes used in HPRP are high-spatial-frequency filtered targets where the inner and outer portions of the rings are darker (15 cd/m2), whereas the center portion of the rings is brighter (25 cd/m2). The space-averaged luminance of the entire ring is equal to the luminance of the background (20 cd/m2). Therefore, when the edges of the ring cannot be resolved, the rings blend into the background, that is, the targets are either resolved (seen) or they are invisible. The target consists of rings of different sizes, presented at 50 locations within the central 30°. No stimuli are presented within the central 5° of the visual field (Fig. 1) . The subject responds when the target is large enough to resolve. We measured HPRP with the Ophthimus High-Pass Resolution Perimeter, version 2.0, software version 2.51 (HighTech Vision, Malmö, Sweden).
| Statistical Analyses |
|---|
|
|
|---|
set at 0.05. The
2 analysis was used to assess agreement of categorical variables. Visual field parameter data were derived by comparison to the manufacturers internal normative database for SAP, FDT, and HPRP and to our laboratorys normative database for SWAP (n = 345, one eye per subject), which was developed in collaboration with Chris Johnson for the SWAP ancillary arm of the Ocular Hypertension Treatment Study after it was suspected that the manufacturers internal normative database for SWAP-full threshold testing was inaccurate. Soliman et al.28 have confirmed the inaccuracy. The inclusionexclusion criteria were the same as for our normal control group, but none of our control subjects were included in the SWAP normative database.
Abnormality.
Receiver operator characteristic (ROC) curves were generated for the visual field parameters shown in Table 1 , by comparing normal and patient eyes with GON and then again comparing normal and patient eyes with PGON. The area under these curves (AUCs) were compared statistically for the best-performing parameter from each test and for the pattern SD (PSD), which performed well in all tests, using the method of DeLong et al.29 Theirs is a nonparametric method used to compare correlated AUC (that is, AUC for different tests obtained in the same group of individuals). It is based on a Mann-Whitney statistic and has been widely used for this purpose in the medical literature.27 30
|
Agreement among Tests.
We evaluated the agreement among the tests to determine the percentage of eyes that were classified the same, by the various combinations of tests, based on the pattern deviation (PD) <1% (or deep dent for HPRP) criterion number of points that yielded 80% specificity. The developer of HPRP, Lars Frisen supplied the following explanation of dents (personal communication, 2005). "Dents for HPRP were identified by sets of linear regressions. One set ran over test locations that could be viewed as members of one and the same field radius. The other set ran over test points that normally share much the same value, i.e., they normally belong to one and the same isopter. Normally, test results should rise monotonically along radii, and should stay constant along isopters. The regressions helped identify test locations that deviated by 1-l.9 dB from expected (= shallow dent), or by 2 dB or more (= deep dent). Hence, the analysis essentially aimed to highlight deviations from the normal shape of the threshold surface."
A
2 analysis was used to determine the statistical significance of this agreement in pairs of tests. Kappa statistics were used and significance assessed by using the method of Landis and Koch.31
Location of Field Defect.
The stimuli and test locations differ among the four tests. For this reason, we assessed the agreement in the location of the defect by quadrant. We first determined whether the field was abnormal based on the criterion number of PD points at 1% determined from the ROC analysis for 80% specificity (shaded criteria in Table 1 ). Once it was determined that a field had a sufficient number of PD locations at
1% to be called abnormal, a quadrant was called abnormal if it contained any one of these PD points (deep dents for HPRP). HPRP presents stimuli centered on the vertical and horizontal meridians. When one of these points (which overlap two quadrants) was abnormal, we attributed it to the most defective quadrant.
Extent of Defect.
The number of quadrants affected on each perimetric test gave the extent of the defect. Overall significant differences among all tests were further assessed with paired comparisons using the Tukey HSD post hoc test.
| Results |
|---|
|
|
|---|
|
|
In addition to the AUC, the shape of the ROC curve is important. There is a tradeoff between sensitivity and specificity. For this reason, we chose two different specificities for analysis. The ROC curve results were used to determine criterion values for various parameters at specificities near 80% and 90%. Using this step-wise approach proceeding from the ROC analysis to determining the desired specificity can assist in developing comparable criteria for abnormality for each of the four tests. The PGON results are shown in Table 1 . Sensitivities were slightly lower in the GON group, but again the relationship among the test results was comparable to that in the PGON group. FDT showed the highest sensitivity in all cases. With this approach, a possible advantage to FDT testing for detection of vision loss in participants with PGON or GON emerges at both specificities and for all parameters. Table 4 shows the percentage of OHT eyes classified as abnormal when using these same criterion values.
|
|
statistics, proportion agreement, and strength of agreement for these relationships.31 Agreement is fair to moderate with better agreement found for some pairs of tests (SWAP and FDT; SWAP and HPRP) when using the PD plot criteria compared with the PSD.
|
|
|
Extent of Defect.
Table 7 shows the number of abnormal quadrants for each participant group and each test based on the PD criteria at 80% specificity. Within diagnostic groups, a significant difference in the normal group was attributed to FDT compared with each of the other three tests. Within a perimetric test the differences were significant (P < 0.0001) and were attributable to differences between normal subjects and the GON and PGON groups for all tests and between the OHT and the GON and PGON groups for all tests. No differences were found between normal and OHT or between GON and PGON.
|
| Discussion |
|---|
|
|
|---|
There were several important improvements in this study compared with those in our original study.24 First, visual fields were not used to classify any of the study participants. This avoids bias in two ways. For example, if SAP is used as a gold standard along with GON to define normal and glaucoma, no other test could ever exceed it in sensitivity and specificity by definition. In our original study, normal participants were required to have normal SAP examinations causing difficulties in setting all tests for equal specificities. We had to use other published data to set criterion values. Not requiring SAP defects to classify participants in the present study allowed SAP deficits to precede those of the function specific tests, or vice versa, without constraint.
The second improvement in this study was the inclusion of HPRP, a possible parvocellular cell test. Although magnocellular and parvocellular cell functions overlap greatly,33 there is a range for both cell types where one is significantly more likely to mediate detection than the other if the parameters of the visual stimulus are properly designed.18 The high-resolution nature of HPRP is most likely handled by the parvocellular cells, especially in the central visual field. The amount of parvocellular cell damage needed before magnocellular cells can take over detection of the HPRP stimulus (the amount of parvocellular cell isolation), however, is unknown.
The third improvement was the inclusion of a group of eyes with progressive GON. Glaucoma is defined as a progressive optic neuropathy. By design we did not use visual field loss to help verify the presence of glaucoma in eyes with recently identified or long-term stable GON. Therefore, evidence of PGON provided a subgroup that most surely had glaucoma.27 The results of this group, even though few in number and therefore lower in power, were consistent with those in the larger GON group, thereby strengthening the conclusions drawn.
When new tests are developed, it is problematic to determine what criteria are best for separating healthy from diseased eyes. A variety of methods have been used. For example, the modified Statpac criteria (Carl Zeiss Meditec, Inc.) for SWAP are based on those developed for SAP.34 Often, different criteria are developed and used by different investigators. An example of this is the number of different criteria for evaluating abnormality on FDT (reviewed in Muskens et al.35 ). A strength of the present study was the use of ROC analysis to set criteria for each perimetric procedure and to equate the tests for specificity. In clinical practice, each instrument uses its own internal normative database. It seemed most clinically relevant to use these databases and to assess specificities for comparisons among tests using an independent group of healthy eyes.
A limitation to our study was that we did not have confirming visual field results for all test types, and so this criterion was not included. If a test is overcalling abnormality, it suggests a higher sensitivity for that test than is the actual case. In this real false-positive situation, a repeat would most likely not confirm the abnormal result. Therefore, requiring two abnormal test results in a row should improve specificity and give a truer measure of the tests sensitivity to glaucomatous loss. In our study, this could influence the results if one test is more likely to overcall abnormality than the others. For example, the percentage of OHT eyes classified as abnormal was higher for FDT (45%) at 80% specificity than for the other tests. This percentage is higher than the number that would be expected to convert to glaucoma, a result similar to our earlier finding.24 At 90% specificity, the result (26%) is more reasonable. There has been a case report showing that FDT results fluctuate with changes in IOP,36 but none of our participants had the high IOP noted in that study. In FDTs favor, Spry et al.26 found little difference in confirmation of field results for FDT (47%) compared with SWAP (42%), and the small difference seen was in the direction opposite that expected if FDT was simply overcalling abnormality in these OHT eyes. To address partially the lack of confirmation on a specific test type, results from two different test types can be used to verify the presence of glaucomatous vision loss. In this situation we found that only a small number of normal eyes were identified as abnormal when at least two tests were required (Table 5) .
Our study also does not address progressive visual field loss. Additional study is needed to determine the relative ability of each test type to follow the disease once visual field loss is already present.
The results of the present study point out a difference from earlier findings,24 where we found that SWAP-FT and FDT-N30 outperformed SAP-FT. In the present study, this difference was not found. It may be that the introduction of the SITA thresholding algorithm for SAP, with its tighter confidence limits, has improved SAPs ability to detect abnormality relative to the longer and more variable SWAP full threshold. It remains to be seen if the newly released SITA version of SWAP will improve its performance in a similar manner. This result also highlights one of the ongoing problems in clinical research. The technologies are changing so rapidly that it is difficult to obtain data and report results on a device before it has moved on to the next generation with improvements. For glaucoma, this problem exists both with measures of visual function and with assessment of the optic nerve and retinal nerve fiber layer. Ongoing re-evaluation with patients transitioning from one version or test to another is critical to giving a complete picture of how these new instruments or procedures will best alter clinical practice.
An additional reason for the improvement in SAP performance relative to SWAP may be that none of the participants in this study were selected based on their SAP visual field results. As we have mentioned, our previous study included only normal control subjects with normal SAP fields, and specificity criterion values were derived from the literature. SAP was not used for this in the present study and the same normal subjects were tested on all four tests and used to set the specificity criterion values, removing any possible bias due to different normal subjects used for the specificity of each test.
This study found that patients could show deficits on any one of the four perimetric procedures, while remaining normal on some or all the other three. In addition, the results were similar to those in earlier work in that agreement on abnormality among tests was only fair to moderate. These two findings together are consistent with the conclusion that no one ganglion cell subtype is always affected first in glaucoma.24 26 37 All optic nerve fibers are damaged, but tests that favor detection of a stimulus by one visual pathway or processing subsystem reduce the ability of the visual system to use other pathways to compensate for the damaged retinal ganglion cell type under test.2 3 These findings are consistent with those from an elegant study in primate animal models of glaucoma. Harwerth et al.38 combined psychophysics, electrophysiology, anatomy, and histochemistry to show that glaucomatous atrophy causes a nonselective reduction of metabolism of magnocellular and parvocellular neurons in the afferent visual pathway. Yucel et al.39 also find no evidence for selective cell loss in glaucoma within the magnocellular, parvocellular, or koniocellular layers of the LGN.39 However, not all eyes with primary open-angle glaucoma or those at risk of the disease are affected in the same way. Perhaps blueyellow ganglion cell function is reduced first in one individual, parvocellular in another individual, and magnocellular ganglion cell function in another.24 40 The inclusion of HPRP did not alter this conclusion. The study also replicated our earlier finding that when two or more test results are abnormal, the same quadrant of the visual field is affected on all. This finding is very important for using two different tests to verify the presence of abnormality and for targeting follow-up by careful monitoring of specific areas of the visual field.
In summary, no one test type was always affected in patients with GON or PGON, whereas the other test types remained normal. Several parameters were identified that provided good sensitivity at set specificities. The FDT N-30 test showed the highest sensitivity for all parameters. SAP performance was equal to or slightly better than SWAP and not significantly different from FDT, a finding that differs from those in previous studies. Consistent with previous findings, the same quadrant of the retina is damaged for all affected tests. A combination of test types may be most efficient in identifying early loss and confirming the area of the retina affected by glaucoma.
| Footnotes |
|---|
Submitted for publication December 5, 2005; revised March 17 and April 10, 2006; accepted June 9, 2006.
Disclosure: P.A. Sample, Carl Zeiss Meditec, Inc., Welch-Allyn, and Haag-Streit (F); F.A. Medeiros, Carl Zeiss Meditec, Inc. (F); L. Racette, None; J.P. Pascual, None; C. Boden, None; L.M. Zangwill, Carl Zeiss Meditec, Inc. (F, R); C. Bowd, None; R.N. Weinreb, Carl Zeiss Meditec, Inc. (F, R)
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Pamela A. Sample, Department of Ophthalmology, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0946; psample{at}glaucoma.ucsd.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Racette, F. A. Medeiros, L. M. Zangwill, D. Ng, R. N. Weinreb, and P. A. Sample Diagnostic Accuracy of the Matrix 24-2 and Original N-30 Frequency-Doubling Technology Tests Compared with Standard Automated Perimetry Invest. Ophthalmol. Vis. Sci., March 1, 2008; 49(3): 954 - 960. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. McKendrick, G. P. Sampson, M. J. Walland, and D. R. Badcock Contrast Sensitivity Changes Due to Glaucoma and Normal Aging: Low-Spatial-Frequency Losses in Both Magnocellular and Parvocellular Pathways Invest. Ophthalmol. Vis. Sci., May 1, 2007; 48(5): 2115 - 2122. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |