Journal of Deaf Studies and Deaf Education Advance Access originally published online on May 2, 2007
The Journal of Deaf Studies and Deaf Education 2007 12(3):350-361; doi:10.1093/deafed/enm012
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Perception of Suprasegmental Features of Speech by Children With Cochlear Implants and Children With Hearing Aids
Tel-Aviv University, Israel
Correspondence should be addressed to Tova Most, School of Education, Tel Aviv University, Israel 69978 (e-mail: tovam{at}post.tau.ac.il).
Received October 26, 2006; revised March 1, 2007; accepted March 7, 2007
This study assessed perception of suprasegmental features of speech by 30 prelingual children with sensorineural hearing loss. Ten children had cochlear implants (CIs), and 20 children wore hearing aids (HA): 10 with severe hearing loss and 10 with profound hearing loss. Perception of intonation, syllable stress, word emphasis, and word pattern was assessed. Results revealed that the two HA groups significantly outperformed the CI group in perceiving both intonation and stress. Within each group, word pattern was perceived best, and then intonation and emphasis, with syllable stress perceived poorest. No significant correlation emerged between age at implantation and perception of the various suprasegmental features, possibly due to participants' relatively late age at implantation. Results indicated that CI use did not show an advantage over HA use in the perception of suprasegmental features of speech. Future research should continue to explore variables that might improve this perception.
| Introduction |
|---|
|
|
|---|
Speech comprised segmental features relating to the characteristics of the individual phonemes (vowels and consonants) and suprasegmental features such as intonation, stress, and emphasis, which are carried along the syllable, the utterance, or the sentence (Snow, 2001
Perception of suprasegmental features is accomplished through perception of the timeenergy envelope of the speech signal and/or its fundamental frequency information. Listeners perceive a sentence's intonation mainly through the changes in fundamental frequency along the sentence. Along these changes, time and intensity aspects also change, serving as acoustic cues in the perception process (Cohen-Licht & Most, 2000
; Grant, 1987
; Most, 1985
). The perception of a sentence's syllable stress or word emphasis is also accomplished through the perception of the timeenergy envelope of the speech signal and/or its fundamental frequency information. The stressed syllable (as in rebel vs. re'bel) or the emphasized word (as in Tom wants to drink vs. Tom wants to drink) is characterized by a higher fundamental frequency, longer duration, and greater amplitude in comparison to the same but unstressed syllable or unemphasized word (Borden et al., 1994
). By using synthetic stimuli and controlling the changes of each of these parameters, researchers found that each of these comprised an effective perception cue (Fry, 1955
; Isenberg & Gay, 1978
).
Many individuals with severe or profound hearing loss have residual hearing in the lower frequency region. Previous research reported that, with hearing aids (HA), some of them can detect and discriminate the timeenergy envelope of the speech signal, and others can also detect and discriminate fundamental frequency information (Engen, Engen, Clarkson, & Blackwell, 1983
; Erber, 1979
). Inasmuch as the suprasegmental features are cued by duration, amplitude, and fundamental frequency information, various researchers have claimed that such features should be available to many individuals with hearing loss (e.g., Boothroyd, 1984
). Some previous research reported success in perceiving intonation (Most & Frank, 1991
) and stress (Most, 2000
) when using tasks with empirically evidenced validity and interventions that specifically exposed these auditory cues to the children. Yet, other researchers reported difficulties in perceiving stress (Jackson & Kelly, 1986
) or intonation (Stark & Levitt, 1974
). Moore (1998)
claimed that many individuals with hearing loss have deficits in frequency and temporal resolution. Inasmuch as perception of suprasegmental features relies on these abilities, many individuals with severe or profound hearing loss experience difficulties in perceiving them.
For individuals with sensorineural hearing loss, the perception of intonation is considered as the most difficult of the suprasegmental features because it relies mainly on the perception of changes in fundamental frequency, which are not accessible to many individuals. Perception of the other features, on the other hand, relies mainly on the timeenergy envelope, which is more accessible (Borden et al., 1994
). Children with severe and profound hearing loss had more difficulties in the perception of yes/no questions, which are cued by a rising intonation contour, than in the perception of statements, which are cued by a falling intonation contour (Most & Frank, 1991
). Also in an earlier study, Most (1985)
reported successful perception of intonation even in children with profound hearing loss, provided they had received early and intensive intervention.
With regards to the perception of stress, some researchers (Jackson & Kelly, 1986
; Osberger & McGarr, 1982
) reported that individuals with hearing loss often experience difficulties in perceiving the stress pattern. Rubin-Spitz and McGarr (1986)
found that participants with severe and profound hearing loss succeeded in perceiving the stress pattern only in those stimuli where the stressed and unstressed syllables differed in amplitude. In contrast, Most (2000)
reported an 80.3% success rate in perceiving syllable stress among Hebrew speaking participants with severe and profound hearing loss. She claimed that inasmuch as the perception of syllable stress relies mainly on the timeenergy envelope of the speech signal, this feature should be relatively easy to perceive. Most attributed the participants' success in perceiving the stress pattern to the utilization of an appropriate and natural task, comprising meaningful sentences containing minimal pairs of target words that differed in their stress pattern.
The perception of word pattern (the number of syllables in a word) is considered to be very basic and accessible to most individuals, even those with profound hearing loss (Boothroyd, 1984
; Geers & Moog, 1989
). Using the Hebrew version of the Early Speech Perception battery (Geers & Moog, 1989
), Kishon-Rabin et al. (2000)
showed that children with profound hearing loss (poorer than 100 dBHL) had difficulties perceiving one-syllable words; however, they were able to perceive the word pattern.
In summary, studies on the perception of suprasegmental features by children using HA demonstrated that although suprasegmental features are easier to perceive than segmental ones (Boothroyd, 1984
), many individuals with severe and profound hearing loss do experience difficulties in perceiving the former. Studies revealed that word pattern was perceived by most participants, whereas intonation, stress, and emphasis were reported as difficult to perceive by many individuals with severe to profound hearing loss (e.g., Gold, 1987
).
Cochlear implant (CI) technology has opened up new rehabilitation options for spoken language use among individuals with severe and profound hearing loss. Many speech features that were not audible via the acoustic amplification of HA became audible via the electrical stimulation of the CI, thus improving the ability to perceive speech (Waltzman & Hochberg, 1990
). Many studies that compared the perception of speech by children and adults using CI to those with similar hearing loss using HA showed the advantage of the CI over the hearing aid (Boothroyd & Eran, 1994
; Blamey et al., 2001
). However, most of these studies evaluated the perception of segmental features.
Much less research attention, however, has been given to the perception of suprasegmental features. Only a few studies examined the perception of suprasegmental features by participants with CI, and among these, some were conducted several years ago (e.g., Carney, Kienle, & Miyamoto, 1990
; Waltzman & Hochberg, 1990
), a possibly important factor when considering technological developments and modifications in CIs. For example, Carney et al. evaluated the perception of suprasegmental features by participants with a single-channel CI. Intonation was perceived best and syllable stress least well. Waltzman and Hochberg (1990)
found that children with Nucleus 22-channel CI as well as children with HA performed well in perceiving word emphasis and pitch changes. Boothroyd and Eran (1994)
reported that children who used the Nucleus CI did not significantly differ from children with HA in their perception of syllable number, but children with HA did perform better at perceiving intonation. The children in that study revealed hearing loss ranging between 82 and 117 dB (M = 99 dBHL).
Interestingly, research on intonation perception by implanted participants with older processors showed better performance than by those who used more advanced ones. Based on their results for participants with the Clarion CI with CIS speech processing strategy, Green, Faulkner, and Rosen (2004)
claimed that the perception of pitch by today's CI utilizes temporal rather than spectral information. Also, O'Halpin, Falkoner, Rosen, and Viani (2006)
suggested that their study participants who used CI Nucleus 24 (with ACE and SPEAK) did not rely on changes in fundamental frequency when perceiving word emphasis following a presentation of synthetic speech with controlled changes in each of the acoustic parameters (amplitude, duration, and fundamental frequency).
In light of the shortage of research on suprasegmental features, despite their importance for successful communication, this study aimed to assess the perception of suprasegmental features of speech by children with CI in comparison to that of children wearing HA, at different levels of hearing loss.
| Method |
|---|
|
|
|---|
Participants
Thirty children aged 815 years with prelingual bilateral sensorineural hearing loss participated in the study. The CI group comprised 10 children with profound hearing loss (>90 dBHL) who used CI. The HAS group comprised 10 children with severe hearing loss (M = 77 dBHL, standard deviation [SD] = 4.96) who used HA. The HAP group comprised 10 children with profound hearing loss (M = 99 dBHL, SD = 6.73) who used HA. Degree of hearing loss for children in the HAS and HAP groups was determined according to the pure tone average of 500 Hz, 1 KHz, and 2 KHz in the better ear. Hearing loss in the HAS and HAP groups was detected at a young age (between 1 and 3 years old). Children were fitted with digital HA from that time, using a DSL fitting procedure. See Appendices A and B for residual hearing at low frequencies among children in the two HAP and HAS subgroups, respectively.
Participants in the CI group used Nucleus 24 with ACE processing strategy (e.g., Pasanisi et al., 2002
). Their age of implantation ranged from 3:6 to 12:4 years (M = 7:11, SD = 3.14). Duration of CI use ranged from 1 to 8 years (M = 4:2, SD = 2.32). All implantees had well-established maps with dynamic range (M levelT level) between 40 and 60 current levels.
All 30 children had hearing parents, and spoken Hebrew was their native language. They used spoken language as their main mode of communication and were fully integrated, individually, in regular classes with hearing children. None of the participants had additional handicaps. They all received speech and language therapy from the time of hearing loss detection. The participants had no reading difficulties, according to their teachers' reports.
Stimulus Materials
Perception of word pattern, syllable stress, sentence intonation, and word emphasis was evaluated via four closed-format tests where recorded speech materials were presented to the children for identification among printed alternatives.
Word pattern.
The pattern perception test included 12 items, each with three words of differing syllable number. Each of the 36 Hebrew words (12 items x 3 words) was printed on a separate card. Each item contained a one-syllable, a two-syllable, and a three-syllable word. To avoid confusion, the number of syllables was determined according to both the dictionary and the speech envelope, considering that the two could conflict in their criteria for number of syllables, as in the word "lemon," which is two syllables by dictionary but one syllable by speech envelope (Erber, 1979
). The speech envelope was examined using IBM Speechviewer III software, which provides both visual and auditory analyses of speech attributes such as pitch, loudness, and timing. In each item, all three words began with a similar syllable. For example: gir (chalk), gibor (hero), and giborim (heroes) or bat (daughter), batim (houses), and batata (sweet potato). For the word pattern test, the total of 36 words was divided into two lists of 18 words each. Children were asked to identify each verbally presented, recorded word out of 18 printed alternative words. A correct response was considered when a word in the same category (i.e., one syllable, two syllable, or three syllable) was chosen.
Intonation.
The intonation subtest from the Hebrew Speech Pattern Contrasts (HeSPAC) (Kishon-Rabin, Eran, & Boothroyd, 1990
), which was based on Boothroyd's (1984)
SPAC in English, assessed the ability to discriminate between a statement and a yes/no question. Two lists totaling 48 sentences were used. Each list contained 12 sentences, each comprising familiar words that could have two possible intonation curvesa statement or a question (i.e., totaling 24 items in each list). Each sentence was printed twice: once as a statement with a period at the end, for example, cham po. (It's hot here.), and once as a question with a question mark at the end, for example, cham po? (Is it hot here?). For the intonation test, children were asked to listen to each audio-recorded sentence and report whether it was a statement or a question.
Syllable stress.
Word syllable stress was assessed through the use of 12 bisyllabic, meaningful, minimal, familiar pairs differing in their stress pattern; for example, bira (beer) and bi'ra (capital). The original test (Most, 2000
) included 10 minimal pairs, and 2 additional pairs were included for the purpose of this study, for a total of 24 words. Each word was printed within a meaningful sentence; for example, ani shote bira (I drink beer) and Jerusalem ir bi'ra (Jerusalem is the capital). All 12 sentence pairs were printed on the same sheet of paper. For the syllable stress test, children were asked to listen to each of the recorded words, presented in a random order, and to identify which sentence contained that word with that stress pattern out of the pair of sentences.
Word emphasis.
Perception of word emphasis in a sentence was evaluated using two lists of this subtest from the HeSPAC (Kishon-Rabin et al., 1990
). Each list included 12 sentences containing three one-syllable words. Each of the 12 sentences was recorded three times, once with the first word emphasized, once with the second word emphasized, and once with the third word emphasized, thus totaling 36 sentences in each of the two lists. Each of the 12 sentences was printed three times, with the first, second, or last word emphasized through the use of bold print. For example, ten li gir, ten li gir, and ten li gir (Give me a piece of chalk). For the word emphasis test, children were asked to listen to each recorded sentence and identify the appropriate printed sentence out of the three possible alternatives.
Procedure
All the speech materials were first recorded by a native Hebrew-speaking woman with clear voice and articulation using Sony's Sound Forge 7 software. To maintain consistent intensity throughout, the recordings were normalized. The stressed syllables and the emphasized words were normalized separately in order to keep the inherent intensity differences. The difference between the intensity level of these parts and the rest of the sentence was retained, as was measured in the original recording.
To achieve validity, all the recorded materials were introduced in a pilot study to five children with normal hearing, aged 8:815:3 (M = 12, SD = 2.42). The obtained mean correct identification scores for the four tests ranged between 95% and 100%.
In this study, each participant was evaluated alone in the same quiet room using his/her own well-functioning and optimally fitted sensory aids (CI or HA). Sensory aids' functioning was checked by the examiner using a status clip for the hearing aid or a signal checker for the CI, as well as Ling's (1976)
six sounds test.
The test materials were presented at normal conversational level (72 dBSPL) at the child's seat through the use of a JVC CD Portable RC-X501 tape recorder. Before each test, the printed materials were first introduced to the child to ascertain familiarity with the words. The examiner explained each task and introduced a few practice items (not included in the test) to ensure participants' understanding. The child was instructed that each of the stimulus words/sentences may appear more than once, to ensure that children continue listening for that stimulus even after it was presented and thus eliminate attempts to guess according to a process of elimination.
In each of the four tests, the order of presentation of the test stimuli was randomized. The order of presentation of the different tests was randomized among the participants.
| Results |
|---|
|
|
|---|
Each child received a score for the percent of correct responses regarding each of the four suprasegmental features: word pattern, intonation, stress, and emphasis. Inasmuch as closed-set materials were used, the scores were corrected for guessing using Boothroyd's (1988)
![]() |
The mean scores of each of the three groups on the different speech perception tests were compared. Table 1 presents the mean scores (in percentages) and the SDs of the three groups on the four tests. Multivariate analysis of variance revealed significant differences among the groups, F(8, 50) = 4.73, p < .001. One-way analyses of variance (ANOVAs) on each of the tests revealed significant differences between the groups on the perception of syllable stress, F(1, 27) = 12.51, p < .001, and on the perception of intonation, F(1, 27) = 14.22, p < .001. Bonferroni tests, conducted to examine the source of these differences, revealed that in both the intonation and stress tests, the CI group performed significantly poorer than the HAS and HAP groups (p < .05). The performance of the two hearing aid groups did not differ significantly from each other (p > .05). No significant differences emerged among the three groups in the perception of word pattern or in the perception of emphasis (p > .05).
|
To examine differences between the four test scores within each of the groups, two-way ANOVA with repeated measures was conducted. A significant interaction between group and test emerged, F(6, 81) = 9.62, p < .001. Bonferroni tests within each group, conducted to examine the source of these differences, revealed that within the CI group the word pattern was perceived significantly better than all the other tests (p < .05). Word emphasis was perceived significantly better than stress and intonation (p < .05). Stress was significantly more difficult to perceive than the other three features (p < .05).
Among the HAP group, the word pattern was perceived significantly better than the stress, and both emphasis and intonation were perceived significantly better than the stress (p < .05). In the HAS group, the word pattern was perceived significantly better than the stress (p < .05), but no other significant differences emerged among the tests in this group (p > .05). Figure 1 demonstrates the performance of each group on the four different tests.
|
Error Types
We examined children's types of errors on the stress, intonation, and emphasis tests, but not for the word pattern test because its scores were very high for all groups.
In the stress test, there were two possible errors: perceiving a word with initial stress as one with final stress and perceiving a final stress as an initial one. One-way ANOVAs revealed a significant difference between the groups for each error type, F(2, 27) = 4.26, p < .05. Bonferroni tests for each of these two error types revealed a significant difference between the CI group (M = 51.43, SD = 15.10) and the HAS group (M = 21.16, SD = 25.80) in substituting final stress for initial (p > .05). Also, there was a tendency toward a significant difference for this kind of substitution between the HAP group (M = 49.30, SD = 33.34) and the HAS group (p = .06). No significant difference emerged between the CI and HAP groups for this error type (p > .05). There were no significant differences between the groups in the percentage that erroneously perceived a final stress as an initial one (p > .05).
A two-way ANOVA with repeated measures revealed no significant difference in the percent of error type (p > .05). There was a significant interaction between error type (%) and group, F(2, 27) = 4.27, p < .05. Bonferroni tests revealed that the percent of errors perceiving final stress as initial was higher (78.84%) than the percent of errors perceiving initial stress as final (21.16%) in the HAS group (p < .05). In both the CI and the HAP groups, no significant differences emerged in the percentages of the two error types (p > .05).
For the emphasis test, there were three categorical alternatives: substitution of initial emphasis for medial or final emphasis (HAS: M = 49.70, SD = 35.77; HAP: M = 28.76, SD = 20.23; CI: M = 32.40, SD = 15.51), substitution of medial emphasis for initial or final emphasis (HAS: M = 26.85, SD = 25.95; HAP: M = 37.20, SD = 33.70; CI: M = 44.5, SD = 13.22), and substitution of final emphasis for initial or medial emphasis (HAS: M = 23.45, SD = 33.65; HAP: M = 34.03, SD = 20.35; CI: M = 23.1, SD = 18.33). To evaluate differences in percentages of errors, a two-way ANOVA with repeated measures was conducted. This analysis revealed no significant differences in the percent of error types as well as no significant interaction between error type and group (p > .05).
For the intonation test, there were two possible errors: perception of a yes/no question as a statement (HAS: M = 86.90, SD = 21.38; HAP: M = 80.10, SD = 28.20; CI: M = 60.40, SD = 17.76) and perception of a statement as a question (HAS: M = 13.10, SD = 21.38; HAP: M = 19.90, SD = 28.20; CI: M = 39.60, SD = 17.76). One-way ANOVAs revealed no significant differences between the groups for either type of error (p > .05). A two-way ANOVA with repeated measures revealed a significant difference in the error type, F(1, 23) = 30.6, p < .001. Substitution of a statement for a question was significantly higher (75.81%) than substitution of a question for a statement (24.19%). No significant differences emerged between the groups, and no significant interaction emerged between error type and group (p > .05).
Within-Group Correlations
The relationships among the different tests within each group were examined. Within the CI group, Pearson productmoment tests revealed significant correlations between intonation and stress perception (r = .66, p < .05) and between intonation and emphasis (r = .83, p < .01) but insignificant correlations between word pattern and the other tests (p >.05). Within the HAP group, significant correlations emerged between emphasis and word pattern (r = .67, p < .05), between intonation and syllable stress (r = .68, p < .05), and between intonation and emphasis (r = .87, p < .05). Within the HAS group, significant correlations (p < .05) emerged between word pattern and all three other tests: with stress (r = .71), with emphasis (r = .84), and with intonation (r = .82). Also, a significant correlation appeared between stress and intonation (r = .79, p < .05).
Demographic Variables
For CI users, no significant correlations emerged between age at implantation and performance on any of the tests. Also, no significant correlation emerged between duration of implant use and suprasegmental perception performance (p > .05).
For the two hearing aid groups, examination of the correlations between degree of hearing loss and suprasegmental perception yielded significant negative correlations only for the HAP group. In this group (n = 10), greater degree of hearing loss correlated with lower test performance for pattern (r = .67, p < .05), stress (r = .29, p < .05), emphasis (r = .79, p < .01), and intonation (r = .66, p < .05).
| Discussion |
|---|
|
|
|---|
Usage of the CI has been shown to enable more audibility to the speech signal and consequently better perception of speech by children who use CI in comparison to those with a similar degree of hearing loss who use HA (Blamey et al., 2001
The good performance of the two hearing aid groups may be explained by the fact that the changes in fundamental frequency as well as the speech envelope, which serve as cues for the listener in perceiving suprasegmental features (Borden et al., 1994
), are transmitted through the hearing aid and therefore are audible to individuals with HA who have severe and even profound hearing loss (Boothroyd, 1982
; Erber, 1979
; Most & Frank, 1991
). This ability was reported earlier for the perception of different features, such as intonation (Most, 1985
; Most & Frank, 1991
) and stress (Most, 2000
).
The finding that the CI group did not succeed in perceiving these features or in surpassing those with similar degrees of hearing loss who wore HA may find a possible explanation in that the CI does not provide sufficient information to the listener regarding these suprasegmental features of speech. These features are perceived based on low frequency and temporal information and, as Kong, Stickney, and Zeng (2005)
reported, CI listeners exhibit poor pitch perception due to their limited spectral resolution, especially the inaccurate encoding of low-frequency information. These researchers reported that in current CIs, the low-frequency information is neither appropriately represented by the place of stimulation nor by the temporal fine structure of the neural firing pattern. Thus, the CI does not provide adequate information in the low-frequency range. The relatively shallow insertion depth of present electrode arrays severely limits the transfer of low-frequency spectral information. The average insertion depth for the Nucleus implant was estimated to be 20 mm (Ketten et al., 1998
), which corresponds to the acoustic frequency lower limit of about 1000 Hz (Greenwood, 1990
). Even with the latest electrode designs of all the different CIs, which are intended to provide a deeper insertion of up to 30 mm, there is still no guarantee that low-frequency neurons can be stimulated due to both reduced nerve survival in individuals with hearing loss and nontonotopic distribution of low-frequency neurons in the cochlea (Linthicum, Fayad, Otto, Galey, & House, 1991
; Nadol, Young, & Glynn, 1989
). In addition, low-frequency temporal information is not appropriately encoded in current speech processing strategies (Kong et al., 2005
). Thus, it is possible that the participants in this study did not receive the necessary information regarding temporal and spectral information in the low-frequency range.
As mentioned above, HA, in contrast, enable low-frequency spectral information (Kong et al., 2005
). Also Henry and Turner (2003)
reported that the acoustic stimulation provided by a hearing aid might provide the user access to finer spectral and temporal pitch cues in the speech signal that are not resolved well by CIs. Thus, it would be of interest in future research to examine the performance of individuals with CI versus individuals with both CI and a hearing aid in their perception of suprasegmental features.
Another important factor that likely affected the CI group's performance was age at implantation. The current sample of children with CIs comprised relatively "late" implantees (8 of the 10 children were implanted after 6 years of age). Previous reports on children with CI showed the central role of age at implantation, where earlier age at implantation revealed a strong positive effect on the child's speech, language, and hearing performance (Dowell et al., 2002
; Harrison, Gordon, & Mount, 2005
). Harrison et al. compared the perception of words and sentences by children who were implanted between the ages of 2 and 13 years, studying children longitudinally up to 8 years after implantation. The results showed that those who were implanted between the ages of 2 and 3 performed significantly better than those who were implanted later. Also, after a long duration of CI use, the late implantees did not perform as well as the "early" implantees.
Thus, the relatively late implantation after age 6 among most of the present participants (8 of 10) may have had a crucial effect on their performance. The lack of a significant correlation between age at implantation and performance on the different tests supports this notion. As a result of this late age of implantation, the present data should be taken with caution because it is difficult to distinguish between the device's possible technological restrictions and the possible limitations resulting from the late age at implantation (e.g., due to reduced learning time, less exposure during crucial young ages). However, the fact that the CI participants were able to perceive at least some suprasegmental features suggests that the data do have sufficient merit. Future research should assess children who were implanted at a younger age and compare their results to those of this study.
Another explanation for the low performance of the CI group may be that the auditory performance of the participants did not represent their auditory potential. As Boothroyd (1982)
reported, auditory performance is a function of both auditory capacity and learning opportunity. Poor auditory performance can result from either poor auditory capacity or inadequate learning opportunity. Therefore, a child with excellent auditory capacity may underachieve because of poor learning opportunities. Although all the participants in the study received speech and language therapy, the emphasis in their auditory training was on the perception of the segmental features and not so much on the suprasegmental features, based on the clinicians' reports.
Klieve and Jeanes (2001)
reported that children with CI who received specific training in the perception of suprasegmental features, with and without linguistic context, improved in their perception. This improvement was even evident 10 weeks after the end of the intervention. Future research should assess the perception of suprasegmental features before and after intervention programs. It may also be interesting to intervene with production tasks as well as perception tasks and to examine the relations between perception and production of these speech features. Previous research reported a mutual relationship between perception and production of speech. In other words, improved perception skills may contribute to better production skills (Most & Frank, 1994
), and training of production skills may lead to an improvement in perception (Novelli-Olmstead & Ling, 1984
).
It should be noted that all our participants used the Nucleus 24 with ACE processing strategy in order to control for the type of implant and processing strategy. However, implants vary in their components, such as number of electrodes or speech coding strategy; hence, other CIs and/or processing strategies could possibly have yielded different results, calling for further research.
Studies on perception of segmental features revealed differences in auditory perception (Psarros et al., 2002
; Pasanisi et al., 2002
) and subjective feelings (Keifer, Hohl, Stürsebecher, Pfennigdorff, & Gstöettner, 2001
) of people who used different coding strategies. Fu, Hsu, and Horng (2004)
studied the identification of tone among Chinese individuals with different CIs and showed that the use of the ACE and the CIS coding strategies, in comparison to the SPEAK strategy, resulted in better tone identification. Kong et al. (2005)
studied the identification of melodies in a pilot study on a small number of people who used new CIs from all the different companies: Cochlear, Med-El, and Advance Bionics. These authors reported significant differences in participants' performance and suggested that research should continue to search for the source of these differences. Thus, the effect of CI type as well as coding strategy on the perception of suprasegmental features should be further examined in future research.
Another issue examined in the current study was the hierarchy in the perception of the different characteristics within each group. Previous research on hearing aid users reported that pattern perception was easiest to perceive because it is based mainly on the speech envelope (duration and amplitude changes), whereas intonation was the most difficult because it is based mainly on the changes in the fundamental frequency along the utterance. Stress and emphasis lie in between (Gold, 1987
). The present results partly supported these previous findings.
The pattern of the word, that is, its number of syllables, was the easiest to perceive by participants of all three groups who obtained high scores. As mentioned, this is a very basic skill, and even people with profound hearing loss may succeed in such tasks (Geers & Moog, 1989
; Kishon-Rabin et al., 2000
). Next in the hierarchy were emphasis and intonation, and the most difficult was perception of word stress.
The perception of word emphasis by CI users was previously reported by O'Halpin et al. (2006)
. These authors reported that although their participants had difficulties in differentiating between frequencies, most of them succeeded in the perception of word emphasis in a sentence. They suggested that the participants relied on the intensity and duration cues only.
Interestingly, in contrast to hearing aid users, the better perception of intonation in comparison to syllable stress by CI users was already reported (Carney et al., 1990
). These authors, however, examined the perception of a small group of CI users (Nucleus 24) by reading sentences aloud rather than through the use of recorded speech stimuli. This study offered the advantage of more controlled intensity and consistency regarding the stimuli level of presentation.
Also, recall that previously reported results were on English speaking participants, whereas this study was conducted on Hebrew speakers. Differences in the perception hierarchy between this study and previous ones, that is, stress emerging as the most difficult feature to perceive, may perhaps result from linguistic features. In English, the trochaic stress pattern is very common, whereas in Hebrew both the iambic and the trochaic stress are common (Adi-Bensaid & Bat-El, 2004
). Possibly, the difference in stress pattern between the languages could have affected the current outcomes, calling for more detailed future study designs regarding this issue.
Another explanation for the lower performance on the syllable stress test in comparison to the other tests lies in how this feature was presented. First, in listening to the syllable stress, the listener heard just a word, whereas for the other featuresintonation and emphasisthe listener heard a longer stimulus, that is, a full sentence. Also, the syllable stress test required a different and perhaps more demanding task from the listener. The participant was asked to listen to the word and then to choose the related sentence that contained that word. In other words, the listener had to listen to the meaning and not to the form. This specific task was used because previous research showed its usefulness and validity (Most, 2000
). In the other tests, tasks were simpler. In the intonation test, the listener heard a whole sentence and had to decide whether it was a statement or a question. In the emphasis test, the listener heard a whole specific sentence (with one emphasized word) and had to choose that sentence out of three possible alternatives. Thus, for these features, the listener only had to follow the acoustic changes along the sentence. Although, as mentioned, the syllable stress assessment task was reported as more valid than previously used measures in which the listener was required to listen to the word and choose the related written word out of a minimal pair differing in their stress pattern, it is possible that the current stress assessment task was more demanding than the other tasks in the present research.
We wanted to examine whether a certain type of error was more frequent in each of the tests. Specifically, we wished to determine if participants would substitute more initial syllable stress or vice versa, would substitute a question more than a statement, or would substitute initial emphasis more than a medial or a final one. Results showed that for the syllable stress test, no significant differences appeared between the error types for either the CI or HAP groups. In the HAS group, however, children substituted final stress (78.84%) more than initial stress (21.16%). It should be recalled that in general this group made fewer errors than the other two groups. When there were errors, however, they were made as mentioned above. This was quite surprising inasmuch as the frequency of the final stress in two-syllable words is higher than the initial stress. Nevertheless, as Carney et al. (1990)
reported, the trochaic stress pattern is easier to perceive and perhaps affected the results.
In the emphasis test, no significant differences emerged in substitution errors for initial, medial, or final emphasis, in any of the groups. Thus, even if the acoustic cues for the perception of word emphasis changed as a function of the emphasized word's position in the sentence (Weiss, Carney, & Leonard, 1985
), this did not affect success in perceiving it in the present research.
Regarding intonation perception, all the groups succeeded better in the perception of statements than in the perception of yes/no questions. These results supported previous reports on individuals with HA (Most & Frank, 1991
).
In summary, the CI technology opened up new options in the rehabilitation of many individuals with severe and profound hearing loss by enabling a better perception of spoken language. However, it seems that the present CIs are not sufficient in providing acoustic information on the suprasegmental features of speech, which are essential for intelligible communication. It should be noted, though, that the current results should be treated with caution due to the small sample. Future research should continue to examine the different variables with regard to the sensory aid itself, such as the speech processing strategy, and the individual factors such as age at implantation or type of intervention, with the aim of improving perception of suprasegmental speech features through the use of CI.
Appendix A Pure Tone (PT) and Pure Tone Average (PTA) Data on the Subgroup of Children With HA and Profound Hearing Loss (n = 10)
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Appendix B Pure Tone (PT) and Pure Tone Average (PTA) Data on the Subgroup of Children With HA and Severe Hearing Loss (n = 10)
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Acknowledgments |
|---|
The authors would like to express their appreciation to Dee Ankonina for her editorial contribution. No conflicts of interest were reported.
| References |
|---|
|
|
|---|
-
Adi-Bensaid L, Bat-El O. The development of the prosodic word in the speech of a hearing-impaired child with a cochlear implant device. Journal of Multilingual Communication Disorders (2004) 2:187206.[CrossRef]
Blamey PJ, Sarant JZ, Paatsch LE, Barry JG, Bow CP, Wales RJ, et al. Relationships among speech perception, production, language, hearing loss, and age in children with impaired hearing. Journal of Speech, Language & Hearing Research (2001) 44:264285.
Boothroyd A. Hearing impairment in young children (1982) Englewood Cliffs, NJ: Prentice-Hall.
Boothroyd A. Auditory perception of speech contrasts by subjects with sensorineural hearing loss. Journal of Speech & Hearing Research (1984) 27:134144.
Boothroyd A. Perception of speech pattern contrasts from auditory presentation of voice fundamental frequency. Ear & Hearing (1988) 9:313321.[ISI][Medline]
Boothroyd A, Eran O. Auditory speech perception capacity of child implant users expressed as equivalent hearing loss. Volta Review (1994) 96(5):151168.
Borden GJ, Harris KS, Raphael LJ. Speech science primer (1994) 3rd ed. Baltimore, MD: Lippincott Williams & Wilkins.
Carney AE, Kienle M, Miyamoto RC. Speech perception with a single-channel cochlear implant: A comparison with a single-channel tactile device. Journal of Speech & Hearing Research (1990) 33:229237.
Cohen-Licht D, Most T. The contribution of speech envelope to the perception of intonation by hearing and hearing impaired individuals (2000) Unpublished master's thesis, School of Education, Tel-Aviv University. (Hebrew).
Dowell RC, Dettman SJ, Hill K, Winton E, Barker EJ, Clark GM. Speech perception outcomes in older children who use multichannel cochlear implants: Older is not always poorer. Annual Otology, Rhinology & Laryngology (2002) 111:97101.
Engen T, Engen EA, Clarkson RL, Blackwell PM. Discrimination of intonation by hearing-impaired children. Applied Psycholinguistics (1983) 4:149160.[ISI]
Erber NP. Speech perception by profoundly hearing-impaired children. Journal of Speech & Hearing Disorders (1979) 44:255270.
Fu QJ, Hsu CJ, Horng MJ. Effects of processing strategy on Chinese tone recognition by Nucleus-24 cochlear implant users. Ear & Hearing (2004) 25:501508.[CrossRef][ISI][Medline]
Fry DB. Duration and intensity as physical correlates of linguistic stress. Journal of Acoustic Society of America (1955) 17:765768.
Geers AE, Moog JS. Evaluating speech perception skills: Tools for measuring benefits of cochlear implants, tactile aids, and hearing aids. In: Cochlear implants in young deaf childrenOwens E, Kessler DK, eds. (1989) Boston: College Hill. 227256.
Gold TG. Communication skills of mainstreamed hearing-impaired children. In: Development of language and communication skills in hearing-impaired childrenLevitt H, McGarr NS, Geffner D, eds. (1987) 108122. ASHA Monograph, 26.
Grant KW. Identification of intonation contours by normally hearing and profoundly hearing impaired listeners. Journal of the Acoustical Society of America (1987) 82:11721178.[CrossRef][ISI][Medline]
Green T, Faulkner A, Rosen S. Enhancing temporal cues to voice pitch in continuous interleaved sampling cochlear implant. Journal of the Acoustical Society in America (2004) 116:22982310.[CrossRef]
Greenwood DD. A cochlear frequency-position function for several species: 29 years later. Journal of Acoustic Society of America (1990) 87:25922605.[CrossRef]
Harrison RV, Gordon KA, Mount RJ. Is there a critical period for cochlear implantation in congenitally deaf children? Analyses of hearing speech perception performance after implantation. Developmental Psychology (2005) 46:252261.[CrossRef]
Henry BA, Turner CW. The resolution of complex spectral patterns by cochlear implant and normal hearing listeners. Journal of the Acoustical Society of America (2003) 113:28612873.[CrossRef][ISI][Medline]
Isenberg D, Gay T. Acoustic correlates of perceived stress in an isolated synthetic disyllable. Journal of Acoustical Society of America (1978) 64:S21.
Jackson P, Kelly D. The relationship between word and stress pattern recognition ability and hearing level in hearing impaired young adults. Volta Review (1986) 88:279288.[ISI]
Keifer J, Hohl S, Stürsebecher E, Pfennigdorff T, Gstöettner W. Comparison of speech recognition with different speech coding strategies (SPEAK, CIS and ACE) and their relationship to telemetric measures of compound action potentials in the Nucleus 24M cochlear implant system. Audiology (2001) 40:3242.[ISI][Medline]
Ketten DR, Skinner MW, Wang G, Vannier MW, Gates GA, Neely JG. In vivo measures of cochlear length and insertion depth of nucleus cochlear implant electrode arrays. Annals of Otology Rhinology Laryngology Supplement (1998) 175:116.
Kishon-Rabin L, Eran O, Boothroyd A. Hebrew version of the speech pattern contrast (SPAC) test. In: Unpublished Report #42 (1990) New York: Graduate Center, City University of New York.
Kishon-Rabin L, Taitelbaum R, Segal O, Henkin Y, Tene S, Muchnik C, et al. Speech perception: Implanted vs. aided children. In: Cochlear implantsWaltzman S, Cohen N, eds. (2000) New York: Thieme. 212214.
Klieve S, Jeanes RC. Perception of prosodic features by children with cochlear implants: Is it sufficient for understanding meaning differences in language? Deafness and Education International (2001) 3(1):1537.[CrossRef]
Kong Y-Y, Stickney G, Zeng F-G. Speech and melody recognition in binaurally combined acoustic and electric hearing. Journal of Acoustic Society of America (2005) 117:13511361.[CrossRef]
Ling D. Speech and the hearing impaired child: Theory and practice (1976) Washington, DC: A.G. Bell Association for the Deaf.
Linthicum FH, Fayad J, Otto SR, Galey FR, House WF. Cochlear implant histopathology. American Journal of Otolaryngology (1991) 12:245311.
Meyer TA, Svirsky MA, Kirk KI, Miyamoto RT. Improvement in speech perception by children with profound prelingual hearing loss: Effects of device, communication mode, and chronological age. Journal of Speech, Language & Hearing Research (1998) 41:846858.
Moore BCJ. Cochlear hearing loss (1998) London: Whurr.
Most T. Perception of intonation by severely and profoundly hearing impaired children (1985) Unpublished doctoral dissertation, City University of New York.
Most T. Production and perception of syllable stress by children with normal hearning and children with hearing impairment. Volta Review (2000) 101(2):5170.[ISI]
Most T, Frank Y. The relationship between the perception and the production of intonation by hearing impaired children. Volta Review (1991) 93:301309.[ISI]
Most T, Frank Y. The effect of age and hearing loss on tasks of perception and production of intonation. Volta Review (1994) 96:137149.[ISI]
Nadol JB, Young Y-S, Glynn RJ. Survival of spiral ganglion cells in profound sensorineural hearing loss: Implications for cochlear implantation. American Otology Rhinology Laryngology (1989) 98:411416.
Novelli-Olmstead T, Ling D. Speech production and speech discrimination by hearing impaired children. Volta Review (1984) 86:7280.[ISI]
O'Halpin R, Falkoner A, Rosen S, Viani L. The perception of stress and intonation by children with cochlear implants (2006) March. Paper Presented at the 8th European Symposium of Pediatric Cochlear Implantation, Venice, Italy.
Osberger MJ, McGarr N. Speech production characteristics of the hearing impaired. In: Speech and language advances in basic research and practiceLass N, ed. (1982) Vol. 8. New York: Academic Press. 221283.
Pasanisi E, Bacciu A, Vincenzo V, Guida M, Berghenti MT, Barbot A, et al. Comparison of speech perception benefits with SPEAK and ACE coding strategies in pediatric Nucleus CI24M cochlear implant recipients. International Journal of Pediatric Otorhinolaryngology (2002) 64:159163.[CrossRef][ISI][Medline]
Psarros CE, Plant KL, Lee K, Decker JA, Whitford LA, Cowan RSC. Convention from the SPEAK to the ACE strategy in children using the Nucleus 24 cochlear implant system: Speech perception and speech production outcomes. Ear & Hearing (2002) 23(1S):18S27S.[CrossRef][ISI][Medline]
Rubin-Spitz J, McGarr N. Perception of stress contrasts by the hearing impaired. Journal of the Acoustic Society of America (1986) 79(1):10S.
Snow D. Transcription of suprasegmentals. Topics in Language Disorders (2001) 21(4):4151.[ISI]
Stark RE, Levitt H. Prosodic feature reception and production in deaf children. Journal of the Acoustic Society of America (1974) 55(S63).
Waltzman S, Hochberg I. Perception of speech pattern contrasts using a multichannel cochlear implant. Ear & Hearing (1990) 11(1):5055.[ISI][Medline]
Weiss AL, Carney AE, Leonard LB. Perceived contrastive stress production in hearing-impaired and normal-hearing children. Journal of Speech & Hearing Research (1985) 28:2635.
This article has been cited by other articles:
![]() |
T. Most, I. Levin, and M. Sarsour The Effect of Modern Standard Arabic Orthography on Speech Production by Arab Children With Hearing Loss J. Deaf Stud. Deaf Educ., July 1, 2008; 13(3): 417 - 431. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


