AECT Handbook of Research

Table of Contents

28: Auditory Presentations and Language Laboratories
PDF

28.1 Introduction and Historical Overview
28.2 Memory for Auditory Presentations
28.3 Audio Technology
28.4 Audio-Tutorials
28.5 Compressed Speech
28.6 Distance Audio Education
28.7 Auditory Learning vs. Print Presentations
28.8 Language Laboratories
  References









Search this Handbook for:

28.8 Language Laboratories

28.8.1 History

Because of the nature of their task, foreign-language educators have been heavily involved in the use of audio equipment. They welcomed the first audio device, the phonograph, and have immediately adopted other advances in audio technology such as magnetic tape and digital media. Unfortunately, the history of the use of audio technology to teach languages has not been duly noted by historians of educational technology. Paul Saettler, in his definitive The Evolution of American Educational Technology, only makes passing references to foreign-language teaching, and language laboratories are granted merely one paragraph (p. 187). It will be demonstrated that this disregard is startling in view of the extensive use of, and massive investment in, audio equipment by foreign-language educators. Moreover, it will be shown that the research that accompanied these commitments has not been appreciated by the larger educational technology community.

28.8.1.1. History of the Use of Audio Resources in Foreign-Language Education: 1877 to 1945. Leon (1962) and Peterson (1974) have documented the early use of audio recordings by foreign-language educators since the invention of the phonograph by Thomas Edison in 1877. By 1893 there were commercial record sets available for Spanish and English as a foreign language. The phonograph was used in regular classes and for self-study at home, but to what extent is difficult to ascertain. In their 340-page annotated bibliography of "modern" language methodology (the references commence in 1880s), Buchanan and MacPhee (1928) include only nine entries concerning the phonograph. Three of these are listings of recorded courses; none of the six articles is a controlled study of the merit of the phonograph. The 491-page Bagley et al. volume (1930) contains no mention of the phonograph. This paucity of references is surprising when one considers that in the 1880s the field of phonetics was born out of the effort to teach proper foreign-language pronunciation. The literature of the period is full of articles on phonetics, and many pronunciation textbooks and teaching materials were published. One would have expected greater enthusiasm in the language-teaching community for the equipment that could provide native speaker models.

According to a contemporary (Keating, 1936), initial use of the phonograph and other devices such as the stereopticon (an early slide projector) was haphazard, and interest waned because there was no real absorption of modern inventions into the teaching program (p. 678). The Depression may have prohibited a wider use of the phonograph in the 1930s. A definite discouragement to its use was the Carnegie-funded Coleman report of 1929, which stated that the reading skill should be emphasized (Parker, 1961). Nevertheless, it should be noted that the decade saw much interest in the use of radio for foreign-language instruction. From October 1935 (volume 20) through December 1946 (volume 30), the Modern Language Journal had a radio "department."

It is not until 1908 that there is any evidence of a laboratory arrangement of phonographic equipment (Leon, 1962). By this is meant a dedicated facility for foreign-language study. This lab was at the University of Grenoble in France. An American, Frank C. Chalfant, who studied there in the summer of 1909, appears to have been the one who brought the idea back to this country. He installed a "phonetics laboratory" at Washington State College in Pullman during the 1911-1912 academic year. Pictures of this installation in use show students listening via networked earphones. This lab also had a phonograph recording machine so that students could compare their pronunciation with the native-speaker models.

Near the time that Chalfant established his phonetics laboratory, the U.S. Military and Naval Academy set aside rooms for listening to foreign-language records (Clarke, 1918). Another early facility was set up at the University of Utah in 1919 by Ralph Waltz (1930). He moved to Ohio State and built another lab about which he published several articles (Waltz, 1930, 1931, 1932). Waltz is usually credited with coining the term language laboratory in 1930 (Hocking, 1964). In fact, Chalfant had used it as early as 1916 in the Washington State College yearbook, the Chinook and probably in the regional foreign-language education circles of which he was a leader. In any event, the preferred term until after W.W.II was "phonetics lab." During the 1930s many institutions established labs (Gullette, 1932), but, as in the case of the phonograph, discussions of their use did not loom large in the methodological literature. For example, the Modern Language Journal's annual annotated bibliography of monographs and articles only had four entries prior to 1945 besides the three articles by Waltz. The bibliography of the language laboratory for the years 1938-1958 compiled by Sanchez (1959) only added four items to the total for the prewar period. Eddy (1944) and Whitehouse (1945) describe the equipment and use of two labs at the end of the period under consideration.

28.8.1.2. History of the Use of Audio Resources in Foreign-Language Education: 1946 to 1958. The year 1946 is considered to mark the beginning of the Modern Language laboratory movement (Hocking, 1964; Koekkoek, 1959). The labs at Louisiana State University (Hocking, 1964) and the University of Laval in Quebec City, Canada (Kelly, 1969), were built that year. Whether these postwar labs owed anything to the previous phonetics labs is unclear, but probable. Claudel's (1968) use of "predecessor" (p. 221) expresses linkage. However, according to Koekkoek, "the beginning of the language laboratory movement was a new start, albeit with similar means and ends, rather than a direct expansion of the limited phonetics laboratory tradition" (1959, p. 4). Sanchez (1959) is ambiguous on the question. The earliest entry in his annotated bibliography of the "modern" language laboratory is a reference to a phonetics laboratory (Peebles, 1938), but he included the note "not related to the Modern Language lab, as such' (p. 231). The record at the universities of Iowa (Funke, 1949) and Tennessee (Stiefel, 1952) indicate continuity with phonetics labs. It thus appears that Koekkoek's statement must be tempered. Most institutions that built language labs after the war did so for, the first time, whereas a few others updated their prewar phonetics labs.

A point of difference between phonetics labs and language labs were individual booths or carrels. Although the lab at Ohio State had long tables divided into "compartments" (Waltz, 1930, p. 28) by 18-inch-tall boards, these did not provide sufficient acoustic isolation (Schenk, 1930). Levin (193 1) suggested that the facility he described would be improved by the installation of soundproof booths. These became standard equipment in the postwar labs. Labs of the period were principally audio installations, but movie, slide, and filmstrip projectors were sometimes present as well (Newmark, 1948; Hirsch, 1954).

Also at issue is the impulse for the modern lab movement. It is certain that the military's success in language training during the war caught the attention of the foreign-language teaching profession at large. In 1945, the Modern Language Journal's annual bibliography began a separate category for the "Army" (Army Specialized Training Program, ASTP) method. It contained far more entries than any of the other 21 categories. Regarding labs specifically,. Gaudin (1946) demonstrated enthusiasm for the Army experience in her design of a facility. Koekkoek maintained that 'Vie language laboratory and its spread is a postwar development, fostered by a climate of experimentation which was stimulated by the Army language teaching program during the war" (1959, p. 4). Pictures of labs in the 1950s certainly have a military air to them. Students sitting in rows with eyes straight ahead suggests columns of soldiers at attention. The individual student in a booth wearing a headset evokes an image of a navigator or radar technician at his post on a ship or airplane.

Hocking (1964), however, adamantly denied that the ASTP method drove the establishing of labs. He is echoed by Barrutia:

... we have Elton Hocking to thank for almost single-handedly trying to keep the record straight about the fiction of the supposed extended use of recording equipment and aural-oral techniques in the A.S.T.P... the Army Specialized Training Program did not, as is so widely believed, pioneer language laboratories... (1967, p. 89.0).

To what then did Hocking and Barrutia attribute the postwar interest in labs? They cite the availability of magnetic tape and tape-recording machines from 1946. Hitherto, labs were outfitted with phonographs or wire recorders. These had several problems: Their sound fidelity was low, they were fragile, and they were difficult to edit. Plastic disc player/recorders such as the SoundScriber (first advertised in the Modern Language Journal in October 1946) were an improvement, but according to Hocking "the superiority of the tape recorder-reproducer was immediately apparent" (1964 ' p. 18), and the verdict of history confirmed this first impression.

This major technological improvement does not fully account for the language laboratory movement. Roughly concurrent with the invention of magnetic tape was the development of the audiolingual method. It is here that the ASTP can be given some deserved credit. It stressed the listening and speaking skills more than reading and writing - the priorities of prewar methods. The Army method relied much on small-group practice to develop the learners' aural and oral abilities. Another important feature of the ASTP was the preponderate use of native-speaker instructors. Stack connects these developments in equipment and methodology:

The language laboratory owes its existence to the recognition that the spoken form of language is central to effective communication, and that it should have as large a share in instruction as do written forms. In order to implement this new orientation of language teaching, the textbook (which is essentially graphic) was supplemented by sound recordings of native speakers. The coincidental advent of the tape recorder created a fortuitous juncture of technology and pedagogy (197 1, p. 3). By 1958, in the United States there were 64 labs in secondary schools and 240 in colleges and universities (Johnston & Seerley, 1960). The passage of the National Defense Education Act that year ushered in a new phase in language laboratory history.

28.8.1.3. History of the Use of Audio Resources in Foreign-Language Education: 1959 to Present Responding to the challenge to Yankee know-how and American ingenuity by the Soviet's launching of Sputnik on October 4, 1957, Congress passed the National Defense Education Act, which President Eisenhower signed on September 2, 1958. The act sought to strengthen the teaching of mathematics, science, and foreign languages in America's schools. The intent of the foreign-language provisions of this important legislation has been described by Derthick (1959). The history of the language laboratory in the first years following the NDEA has been written by Parker (1961), Hocking (1964), and Diekhoff (1965). There was an explosion in the number of facilities, thanks to generous federal support: $76 million in matching funds by 1963 (Diekhoff, 1965). By 1962 there were approximately 5,000 installations in secondary schools (Hocking, 1964). Another 1,000 secondary schools had labs by 1964 (Diekhoff, 1965). This represents a thousand fold increase in the number of labs at the secondary level from 1958! Most of these were in medium-to-large school districts (Godfrey, 1967). Although colleges and universities were not eligible for equipment funds under the NDEA, they were caught up in the national enthusiasm for language study, and thus committed their own monies to labs. By 1962 there were 900 labs in higher education (Hocking, 1964). More postsecondary labs were built from 1965 when matching funds became available under Title VI-A of the Higher Education Act (Ek, 1974).

Unquestionably, the 1960s were the golden years of the language laboratory. Those involved in these facilities felt an urgent need to gather and compare experiences. William Riley Parker wrote this about the motivation for the first of the Indiana and Purdue universities-sponsored language laboratory conferences in 1960 (the others were in 1961, 1962, and 1965):

... foreign language teachers feel themselves suddenly involved in a technological revolution, suddenly chin-deep in a tide of new demands upon their competencies, and they seek, some almost frantically, enlightenment and practical help (1960, p. v).

B. F. Skinner spoke at the first of these meetings (January 22, 1960) on the use of teaching machines for foreign language instruction. One of the respondents to Skinner's paper was Robert Glaser. Neither of these men were foreign-language educators by training, but both were already well known in the educational technology community. Their presence at this conference is testimony both to the interest the larger educational community of the day had in foreign-language instruction and to the willingness of foreign-language professionals to accept insights from other disciplines, notably psychology. In addition to the Indiana conferences, there were many lab-related presentations at meetings of the various professional associations to which language educators belonged: the Modern Language Association (MLA), the American Association of Teachers of French (AATF), the American Association of Teachers of German (AATG), and the American Association of Teachers of Spanish and Portuguese (AATSP).

A spate of books, articles, and dissertations also accompanied the flow of money and the many installations. Most of the entries in Davison's (1973) 780-item bibliography of the language laboratory from 1950 through 1972 are from the 1960s, and thus post-NDEA. The first edition of Edward Stack's textbook, The Language Laboratory and Modern Language Teaching, appeared in 1960. It should be consulted by those interested in the literature of the period, because it explains the terminology of installations and operations current at the time. Also -appearing in the early 1960s were Hutchinson's monograph concerning labs in high schools (1961), and the technical guide to facilities by Hayes (1963). Leon's book Laboratoire des Langues et Correction Phonitique (1962), although written in French and published in France, circulated widely in this country, as evidenced by the numerous citations of it. The Scherer and Wertheimer (1964) book-length report of an experiment involving language labs will be discussed in the section on research. As for articles, hundreds appeared in all ranges of periodicals from school district newsletters to long-established refereed journals such as The Modern Language Journal, Language Learning, Hispania, The French Review, and The German Quarterly. A publication that focused on language laboratories, The Audio-Visual Language Journal, was founded in Great Britain in 1962. Both The International Review of Applied Linguistics and Foreign Language Annals carried articles about the language laboratory from their inceptions in 1963 and 1967, respectively. The major research articles of the period will be noted in a later section.

During the first half of the decade, language laboratory professionals held caucuses at the conventions of the Modern Language Association and the Department of Audiovisual Instruction, but they soon felt the need for their own organization. The National Association of Language Laboratory Directors (NALLD) was founded in 1965. The NALLD began publishing a newsletter the following year. The inaugural issue reported that at the first NALLD meeting in Chicago in December 1965, there had been much discussion of the lab director's job description and the problem schools face in recruiting qualified applicants. Job openings were featured regularly from the start of this publication.

The major technical development of note during the decade was the audiocassette (Dodge, 1968). The advantages of this were the lower price and smaller, lighter machines that -could play it. However, it did have the drawbacks of lower fidelity and greater difficulty of editing by cutting and splicing. The quality of sound was eventually ameliorated, and the editing problem was not sufficient to prevent the cassette from replacing reel tape in language labs in the 1970s. Another technical advance was the speech compressor-expander (see 28.5). This device allowed a recording to be sped up (compressed) or slowed down (expanded). Articles on this technology were numerous in the general educational literature from the start of the decade. Paradoxically, it was not until 1978 that anything on it appeared in the NALLD Journal (Harvey, 1978). One would have expected a greater enthusiasm among the foreign-language community. The ability to slow down a tape would seem to be a boon to students struggling with a difficult passage. Moreover, variable-speed technology was not unknown in foreign-language teaching, for Hirsch (1954) had commended the use of the sound stretcher (p. 22) in the early 1950s. Machines with a repeat or skip-back function came on the scene at this time as well. This feature permitted students to replay easily a tape segment, and thus was well suited to dictations and audiolingual listen-and-repeat drills. The cassette Canon Repeat-Corder L was first advertised in the NALLD Journal in the October 1970 issue. Aikens, and Ross (1977) wrote an article in the same journal describing a reel-to-reel machine they fabricated. By the end of the decade, the major manufacturers, such as Sony and Tandberg, were producing machines with skip-back capability.

Language laboratories ended the 1960s on a mixed note. On the one hand, federal funding was diminished:

... the amount of equipment funding in Title III-A of the National Defense Education Act (NDEA) and Title VI-A of the Higher Education Act (HEA), two large sources for equipment funds, dropped from an allotment in fiscal year 1968-69 of $91.24 million to nothing in fiscal year 1969-70. The portent of this budgetary reduction is not as black as it might seem: any program for which the federal government is still offering subsidy, e.g., bilingualism, poverty, etc., still has access to equipment funds, but the inflated years of the mid-sixties have come to a close (Dodge, 1968, p. 331).

On the other hand, Smith (1970) did not lament this decline in federal support as entirely negative, because he candidly acknowledged that "the recent years have seen much Professional neglect and misuse of the language laboratory (P. 191). He sensed a positive development in the unanimous agreement that the laboratories should be used to "individualize instruction," in the university community and provide the corresponding "increase in expenditures for equipment and materials for tutorial and individualized instruction' (P. 192). Heinich (1968) also commented on the problems associated with labs and the insights that were gained by both language educators and instructional technologists:

The language laboratory movement threw content and media specialists together in an intimate working relationship that produced very strange and startling experiences. For the first time, language teachers discovered that the mode and materials of instruction interact with instructional behavioral objectives and methods. Many language teachers did not understand that a language laboratory requires a different method of instruction: that print stimulus methods are not audio stimulus methods. On the other hand, the audiovisual specialist was shaken out of a comfortable

bookkeeping-procurement function and introduced, often for the first time, to the rigors of developing curriculum materials to meet specific curricular objectives. The novelty of the roles played by both has caused so many difficulties that the language laboratory has not yet reached its potential value. One of the lessons learned by audiovisual directors in this encounter is the incredible quantity of materials required by technology when media are used for direct instruction. The classroom teacher, at the same time, was experiencing another instance of shared responsibility with media (pp. 50-5 1).

The 1970s and early 1980s were a period of malaise for the language laboratory. Coinciding with the drying up of funds was a sharp drop off in the number of articles published. An index of this change can be seen in the ACTFL yearbooks. The first two volumes contained the articles by Dodge (1968) and Smith (1970), with 84 and 95 citations, respectively. The 1971 volume had one paragraph about labs and two references! From then on until 1983, many volumes contained no mention of labs, and those that did accorded a page at most. Labs had their vocal defenders to be sure (Jarlett, 1971), but frank avowals of their problems (Altamura, 1970; Racle, 1976) and their need for revitalization (Strei, 1977) were prominent.

A turnaround in the decline of the language lab could be seen from the early 1980s. A 3-day colloquium with the theme "A Renaissance for the Language Lab" was held at Concordia University in July of 1981 (Kenner, 1981). McCoy and Weible maintained that the recent "revival of interest in language laboratories" was "directly attributable to the 'domestication' of the tape recorder, made possible through the invention of the audiocassette" (1983, p. 110). What this indicates is that it took nearly 2 decades for the audiocassette to fully work its way into the instructional mores of teachers.

The lab of the 80s was not to be limited to audio technology. The year 1983 saw the founding of the Computer Assisted Learning and Instruction Consortium (CALICO), a group that was (and still is) dominated by language educators. It should not be thought that the invention of the personal computer in the late 70s was solely responsible for the interest in computer-assisted language instruction (known as CALL). Mainframes had already been much used for this purpose, most notably in the PLATO system at the University of Illinois. Computers were welcomed for their potential, but cautions were issued about the need to avoid the unrealistic expectations associated with early language labs and the need to learn other lessons from language lab history (Marty, 1981; McCoy & Weibel, 1983; Lemon, 1986; Pederson, 1987; Otto, 1989).

Ely's Bring the Lab Back to Life was published in 1984. In 1985 the president of the International Association of Learning Laboratories (IALL, the new name for the NALLD as of November 1982), Glyn Holmes, could affirm that the professional group was showing new signs of vitality (Holmes, 1985, p. 5). This rebirth was also indicated by volumes 18 and 19 of the ACTFL Foreign Language Education Series, which were devoted entirely to technology (Smith, 1987, 1989). With new life came a new look. By 1989, Otto could write that "language laboratories have been redefined as multimedia learning centers that deliver computer and video services to faculty and students in addition to familiar audio resources" (1989, p. 38). Since 1988 the reinvigorated IALL has published several monographs dealing with learning-center design and pedagogical use (e.g., Stone, 1988) and has produced several "video tours" of facilities around the country.

A further sign of the broadening of focus of language laboratories is the greater attention placed on reading and writing. A prime example of the former is the popular series of software produced by the Transparent Language Company. The Systeme-D writing assistant program of Heinle & Heinle Publishers is an indication of the latter. Major research has accompanied its usage (Bland et al., 1990).

The use of video is firmly established in foreign-language teaching. A prominent instance is the innovative first- and second-year French course that appeared in 1987, French in Action. Interestingly, an early leader in the post-NDEA labs, Pierre Capretz, was the driving force behind it. French in Action and its subsequent Spanish equivalent, Destinos, received major funding from the Annenberg Foundation and are broadcast on many Public Broadcasting System stations.

28.8.1.4. Conclusion. Surely language laboratories represent the single largest investment and installment of audio resources in education. It is no accident that the foreign-language teaching community has been heavily involved in using audio. Audio has face validity in foreign language instruction simply because much of language use is oral/aural. Granted, there has been concern that the reading and writing skills might be neglected in methodologies that make much use of recordings such as audiolingualism. Nevertheless, for foreign-language educators it has never been an issue of whether to use audio; it has been a question of how.

28.8.2 Research

The preceding historical account detailed the growth and extent of a particular application of audio technology. What has not yet been assessed is the effectiveness of this massive expenditure of effort and money. This is the task of research. This section will give the main currents of research for each period in the language laboratory's history. Details of each study will not be mentioned except insofar as they are crucial to interpreting the chief findings. The bibliography will permit the interested reader to locate and directly consult the reports cited for further information about the design and conditions of each study.

28.8.2.1. Research on the Use of Audio Resources in Foreign-Language Instruction: 1877 to 1946. There appears to have been very little attempt to provide an empirical justification for the use of the phonograph and phonetics laboratories before World War II. This is not entirely surprising, given that before the 1960s very few foreign-language scholars had training in quantitative experimental techniques: They were humanists schooled in literary and philological research methods. There are, however, accounts of problems with the use of phonographs and phonetics labs which can perhaps be classified as observational research. These observations will be noted, for they raise issues that were to be examined more rigorously later. Moreover, these records demonstrate that there was some notion of accountability among those who used early audio resources. That is, the phonograph and phonetics labs were not accepted and used uncritically.

Based on his long experimentation, C. C. Clarke (1918, p. 120) provided the first guidelines to appear in the scholarly literature on the proper use of the phonograph in teaching foreign languages. He granted that some teachers found the mechanism (p. 122) troublesome, time-consuming, and distracting. To this he countered that it afforded learners the opportunity to hear consistent native-speaker models that never suffered fatigue. He concluded that "the true success of the speech record is in teaching pronunciation and that nothing else should be expected of it" (p. 120). The emphasis on pronunciation training certainly became the hallmark of the phonetics laboratories. Waltz, the founder of the lab at Ohio State University, also cited the benefit of having tireless native-speaker models to imitate. By having the "constant control sounding in his ears" (p. 29), the student could exclude the imperfect approximations of his peers and gain confidence in his own speaking ability. However, a colleague of Waltz, Emma Schenk, complained that the earphones did not adequately keep out others' voices (1930). In addition, she complained of the poor audio quality and the lack of supervision in the Ohio State lab. She worried that students would "cultivate errors" (p. 30). She also noted much cheating on time slips and many students who were not on task while in the lab. Levin (1931) was sympathetic to labs and sought to offer constructive criticism of their use. He stressed the need for immediate feedback so as to avoid the problem Schenk had feared, namely, the development of bad speech habits. Gullette (1932) showed that this fear was justified. He noted with consternation that many students working alone in the lab reverted back to the poor pronunciation practices that earlier had been eradicated in class drill sessions. He stressed that imitation was not sufficient; what was needed was ear training such as was done in music classes. This would allow for self-diagnosis and correction.

Waltz's report (1932) of two studies he consulted on, but did not conduct himself, is the first record of an attempt to establish empirically the phonetic/language laboratory's effectiveness. It is ironic, in view of the identification of the language laboratory with foreign languages, that neither investigation involved their teaching! The first experiment had to do with the teaching of the Irish accent; the second was concerned with correct English diction. Both studies can be faulted for the low number of subjects (20 and 24), the apparent nonrandom assignment of subjects to treatments, and the lack of statistical analysis beyond a comparison of group means. Nevertheless, Waltz did note that the groups were equivalent by using scores on standardized tests of intelligence, hearing, and pitch discrimination. In the first study, the lab group's mean was 10. 1 (out of a possible 20 points). The control group's mean was 8.04. In the second study, both the lab and nonlab groups showed similar gains. Waltz argued that the comparable improvement was actually evidence in favor of the efficiency of the lab: Class and instructor time was saved by having students work independently in the lab.

For the sake of comprehensiveness, Peebles' master's thesis (1938) must be mentioned. It was included in the annotated bibliography compiled by 'Sanchez (1959). Students who volunteered to use the Phonetics Laboratory at the University of Colorado and who received one or two French pronunciation tutorial sessions were compared with students who did not avail themselves of these opportunities. Amazingly, she did not specify how much the volunteers used the lab. Neither were the total number of subjects, nor the number of subjects per group, specified. These omissions bespeak a blatant lack of control that invalidates any conclusions that might be drawn from her data, which in fact consisted only of mean numbers of pronunciation mistakes on a posttest.

28.8.2.1.1. Summary. Obviously, no firm conclusions can be drawn about the effectiveness of the phonograph and the prewar phonetics laboratory from these few observations and two cursory investigations. There appears to have been a consensus among practitioners that the best use of this equipment was for pronunciation training. All saw a potential benefit in untiring, consistent, native speaker models for students to imitate. However, complaints were raised about the sound quality of recordings, and it was observed that many learners lacked the self-monitoring ability to profit fully from diem. Just as the next period of language laboratory history saw an increase in the number and sophistication of facilities, so there was similar growth in the inquiries concerning their value.

28.8.2.2. Research on the Use of Audio Resources in Foreign-Language Instruction: 1946 to 1958. Language laboratory research of the postwar and pre-NDEA period may be described as nascent. Certain features of empirical research are seen; some are only partially present, and Others are completely absent. For example, one sees the first use of standardized tests as criterion measures, and this use is universal. On the other hand, only one study (Allen, 1960) randomly assigned subjects to treatments; intact classes were used otherwise. Only two-group designs and t tests were used. The number of subjects, when reported, was uniformly low. There certainly was not an agreed-upon research agenda. In fact, researchers of the day were either unaware of what their peers were doing (there is little citation of others' work) or they simply ignored it. With these limitations in view, the following discussion will list five studies of the period in chronological order and present their conclusions. According to Kelly (1969), more experiments were conducted than this number would suggest, "but we only know of those whose authors had the time and energy to write articles about them" (p. 245). This is corroborated by Johnson and Seerley (1960), who refer to studies done at a high school and two universities (all unnamed) and of research that was planned at the University of Massachusetts.

Stiefel's description (1952) of the language laboratory at the University of Tennessee and its usage is barely beyond the anecdotal level. Yet its mention of the University of Chicago language investigation tests and the cooperative tests (created by the forerunner of Educational Testing Services) does represent the first, inchoate desire of those involved in language labs to have an objective benchmark with which to compare groups of learners who used the lab with those who did not. In this case, Stiefel compared the scores of lab classes on these measures and on an in-house test with classes from previous years. Thus, this is an ex post facto study. He, noted higher scores for lab groups on the in-house tests, but he was hesitant to draw any strong conclusions from these. He found that both groups were comparable on the standardized tests. This he took as heartening evidence that the reading ability (as measured by the cooperative test) of the lab groups did not suffer because of their emphasis on the listening and speaking skills. This last point was of great concern to the scholarly community of the day, as further evidenced by the following study.

Supported by a grant from the Carnegie Foundation for the Advancement of Teaching, Brushwood and Polmantier (1953) at the University of Missouri sought to determine whether dialogue repetition and memorization in the language lab increased learners' aural skills. Although for administrative reasons they were unable to randomly assign subjects to treatments, these researchers did take the trouble to administer the Iowa Foreign Language Aptitude Test to the intact classes that constituted the treatment groups. Moreover, the researchers obtained access to the scores on two English proficiency tests that all the subjects had taken previously. All these tests revealed that the control and experimental groups were matched on these measures, as they were in age.

Four groups were formed: two groups of 19 subjects each who were enrolled in elementary Spanish, and two groups of 23 who were enrolled in elementary French. The control groups simply attended the standard 5-hour per week (I hour daily) course as taught at the University of Missouri. The experimental groups covered the same material (grammar, reading, and composition) as the control groups, but did so in 4 hours instead of 5. The experimental groups also attended two 1-hour laboratory sessions during the first 4 days of the week. In these sessions, they worked with a dialogue written for the experiment that incorporated the grammar and vocabulary that was studied that week. The work consisted of listening to the dialogue via earphones and chorally repeating until it was memorized. A graduate student or upperclassman lab attendant controlled the tape player and thus directed the sessions. His or her only other task was to correct gross pronunciation errors. The experimental group then had a fifth class session in which the regular instructor had the students review and act out the dialogues. The dialogue was then manipulated by changing number, person, tense, object, etc., as a transition to free conversation. This fifth hour was deemed "the crucial point in the achievement of the oral-aural objective" (p. 8).

At the end of the semester the groups were given the cooperative tests on reading, vocabulary, and grammar, and an aural comprehension test created for the experiment. For whatever reason, both t tests and F tests were calculated for the two Spanish and two French groups, but no tests were run on a combination of control and experimental groups across languages. The results showed that there were no significant differences on the cooperative measures. There were significant ts, but not Fs, in favor of the experimental groups on the aural comprehension test.

This study can be faulted on several grounds, but perhaps the most serious flaw may be the lack of control for amount of instruction. Although the authors claimed that the 2 hours of lab practice for the experimental groups were in lieu of homework required of the students in the control groups, it must be noted that the lab sessions were scheduled and monitored. Whether students in the control sections did their work or not is unknown. Moreover, the significant difference between the groups on aural comprehension was measured by a nonstandardized test, the validity and reliability of which is open to question. All of these criticisms aside, Brushwood and Polmentier's study was certainly more rigorous than previous investigations of the use of audio resources in foreign-language teaching.

Next in chronological order are two ex post facto studies that are included here for the sake of completeness. The first is the description by Fotos (1955) of the use of the language laboratory at Purdue University. In direct opposition to the Brushwood and Polmentier study, the lab at Purdue was used for "predrilling [emphasis added] the student on the French text of the basic grammar or reading lesson" (p. 142). Fotos reported that students in first-year French scored 60.1 on the cooperative tests; second-year students scored 71.3. The national averages were 56.7 and 68.8, respectively. Whether this was a significant difference cannot be ascertained.

Mueller and Borglum (1956) looked at correlations between lab attendance and course grade, final exam score, and cooperative test score at Wayne University. They noted that students who voluntarily attended the lab more than the minimum requirement of 30 minutes per week generally did better on these measures. They drew special attention to the heavy lab users' 10% increase on the cooperative reading test: "an unprecedented jump in 8 years of recorded scores" (p. 325). Moreover, they observed that even students who only attended the lab 30 minutes per week scored better than students from previous years who had no lab experience. They also noted a lower drop rate for heavy lab users. One can surmise that greater time-on-task naturally produced greater learning. In their discussion, Mueller and Borglum also acknowledged a significant teacher effect: The lab's director "succeeded in getting the students of his sections to attend the laboratory 2 or 3 times more frequently than other instructors" (p. 322).

Allen (1960) conducted a study during the 1957-58 academic year which represents the last investigation of Language laboratories in the 1946-58 period. The 54 subjects were 15- and 16-year-old students in a high school operated by Ohio State University. Allen created eight groups based on level (elementary or intermediate), language (French or Spanish), and use of the lab (55 minutes per week or none). These divisions made for groups as small as five. He administered three standardized tests in order to have a basis for pairing subjects. Once the pairs were established, he used a random-choice technique to assign students to the lab or nonlab treatments.

The lab groups spent one classroom hour listening to instructor-made tapes of "humorous or suspenseful tales" (p. 355) and answering questions about them in the target language. They recorded their answers and then spent the rest of the period listening to commercially prepared recordings. There was absolutely no written material presented during the lab hour. The nonlab group read the same stories and answered the questions in writing. If any time remained, they did free reading from a collection of books at their level.

At the end of the school year, all groups were given three standardized tests (including the cooperative) that measured reading, vocabulary, grammar, speaking, and listening. Allen only reports means and standard deviations. In all cases except one, the laboratory groups scored identical to or higher than the nonlab groups. The exception was the Intermediate Spanish lab group (n = 5), which scored lower on the speaking test. In several cases, the differences between the means were large, but Allen did not compute any test of significance. In his brief conclusion, however, he claimed that the laboratory groups "achieved significantly higher scores in reading, vocabulary, and grammar" (p. 357), but that there were no differences in speaking or listening. This author calculated a t test on the cooperative French test means for the largest groups, those in Elementary French (n = 10 each). The lab group had a mean of 57 (s.d. = 23); the nonlab group mean was 39.4 (s.d. = 20). This turned out to be significant at the 0.001 level.

It is fitting that the last of the studies of the 1946-58 period should be the one with the highest methodological standards. Yet the number of subjects was quite low for the design chosen, and it is baffling that Allen claimed to have found a significant difference in favor of the lab groups, but did not bother to report any data beyond means and standard deviations. Moreover, it is ironic that reading, grammar, and vocabulary scores were enhanced by listening in the language laboratory, whereas listening scores proper did not reveal any difference between the lab and nonlab groups. Thus, Allen's study gives weak but curious evidence of the language laboratory's contribution to foreign language learning.

28.8.2.3. Summary. Writing in the early 1960s, Carroll (1963) stated that virtually all previous foreign-language research "has only rarely been adequate with respect to research methodology" (p. 1094). For him, language laboratory research was not an exception to this rule. He briefly reviewed three studies concerning labs; these were not included in this section because they did not contain important results, were not widely circulated at the time (two were institutional reports), and were not cited by subsequent researchers. Therefore, what one can conclude from Carroll's review and this summary is that while the research during the 1946-1958 period did not firmly establish the positive value of language laboratories, it did provide circumstantial, and in one case (Allen, 1960) empirical, evidence in favor of this conclusion.

Writing at the close of the period under consideration, Koekkoek (1959) stated that labs were so "firmly established" in language teaching that "no teacher can remain today unaffected and disengaged" (p. 5). He went on to describe the ambivalence about them within the profession and closed his article with the hope that subsequent experience would resolve "basic questions to be expected from the use of laboratory machines and the best methods of obtaining the results" (p. 5). If the nascent body of research could only offer a cautious "thumbs-up" assessment, it also showed that those promoting labs were willing to be held responsible for their use. This was fortunate, for during the next phase of the lab's existence, a period of great growth because of major expenditures, the public would eventually demand an accounting.

28.8.2.4. Research on the Use of Audio Resources in Foreign-Language Instruction: 1959 to the Present. The massive increase in the number of language laboratories, thanks to the NDEA, prompted a comparable increase in the amount of research concerning their effectiveness. In fact, some of the studies were funded by the NDEA under its Title VI provisions. The extent of this research is such that this section cannot detail every investigation that was undertaken. The several dissertations listed by Davison (1973) will not be treated. This discussion will focus on four large-scale studies of labs: three in high schools and one in a university. These all received much attention at the time. Moreover, those studies that have been thoroughly reviewed elsewhere will be only briefly described.

During the 1961-62 school year, Keating (1963) conducted a study of the use of the language laboratory in French classes in New York City high schools. He cited Allen's study (1960) as the "only exception" (he was evidently unaware of the Brushwood & Polmantier study) to the rule that "the literature abounds with articles that describe the benefits of using language laboratories" but " contains virtually no reports upon the empirical validation" (p. 8) of them. He called Allen's results "quite interesting" but noted a possible Hawthorne effect, which he felt 11 severely compromised" (p. 8) them. Keating knew of the research being simultaneously conducted in New York City by Lorge (to be described later).

Keating's was a large-scale study involving approximately 5,000 subjects in 21 school districts. Schools were divided between laboratory and non-laboratory users based on a questionnaire filled out by each district's foreign-language coordinator. Besides this factor, groups were formed according to year of study (first through fourth years) and IQ scores (five levels). The dependent measures were reading comprehension, listening comprehension, and speech production. The cooperative test was used to test the fast two skills; however, first-year students were not given the listening portion because it was designed for intermediate and advanced students. The French speech production test was used to evaluate speaking. This instrument was constructed specifically for the study. Of note is that it was not administered to all subjects: only 519 students from 12 of the participating school districts were given it. The results showed a sole significant finding in favor of the lab groups, on speaking among first-year students. Otherwise, there were several cases of the nonlab groups scoring significantly higher.

Keating's findings were promptly and vehemently disputed. The April 1964 issue of the Modern Language Journal included four rebuttals (Anderson, Grittner, Porter & Porter, Stack). The criticisms showed much overlap. Keating was taken to task for numerous methodological flaws: failure to define what was meant by language laboratory and the activities that went on there, failure to control for amount of time spent in the lab, failure to control for the socioeconomic level of the schools and the quality of their lab installations, use of t tests when ANOVAs were called for, and sloppy reporting of results (the number of subjects per group was not consistent). Keating was also criticized for using several different IQ tests, rather than one, to group subjects. The validity of his speaking test was challenged for being in fact only a pronunciation measure. Keating was shown no mercy: Despite the disclaimers he gave about the generalizibility of his results, he was accused of spreading anti-lab propaganda by Grittner.

Because the literature of the period contains no defense of Keating's study, it can be concluded that it was dismissed by the scholarly community of the day. Unfortunately, the public was of another mind. It seized on the notion that if language laboratories are not useful, then the massive investment of tax dollars in facilities was a waste. An example of this attitude was a newspaper editorial about the Keating study entitled "Backwards Via 'Aid"' that was reprinted in the Modern Language Journal issue containing the four rebuttals. Such a response gives credence to the propaganda charge made by Grittner. He and Stack and Anderson pointed out, with great dismay, that the Institute of Administrative Research of Columbia Teacher's College, which had sponsored Keating's study, mailed out a five-page preliminary report to school administrators across the country. They viewed such an action as unprofessional; it was clearly inflammatory in its impact.

Lorge (1964) conducted two experiments in New York City high schools. The first took place during the 1961-62 school year, and the second was done the following year. Thus, the first study coincided with Keating's investigation. Whether there was any overlap of subjects between the two studies is unknown, but could hardly be problematic given that only two schools were involved in Lorge's first inquiry; Keating's entailed 21 districts. Lorge described the purpose of her study thus:

The object of the study was not to compare what a student learns from a teacher alone as opposed to what he learns from laboratory work alone. The question was whether the teacher improves the teaching-learning situation by using the laboratory as a teaching aid. The research was intended not to give the laboratory a passing or failing mark-if it passes, use it; if it fails, rip it out-but rather to determine in which areas it had proved to be successful, and how its use could be made more effective (p. 409).

The first study compared first-, second-, and third-year French classes. Unfortunately, the number of classes and subjects is not specified in the article, and the full report of the study is not available for consultation; by 1965 it was already out of print (Lorge, 1965). All that is known is that the classes were determined to be comparable based on the Stanford reading test and the Gallup-Thorndike vocabulary test. Half of the classes had 60 minutes a week of supervised lab practice in lieu of a fifth class period. The other half had five class meetings. The course content was the same for both groups. At the end of the school year, all classes were given the cooperative French test to gauge reading, vocabulary, and grammar skills. A speaking test and a listening test, both written by the experimenters, were also administered. All the tests contained subtests for which separate statistics were calculated. There were no differences between the groups on the cooperative test. The first and second-year laboratory groups tested significantly higher than the control groups on the fluency component of the speaking test. The second-year laboratory group also scored significantly higher on the intonation component. The third-year laboratory group was significantly superior in listening.

The second experiment compared two types of laboratory equipment: audio-active and recording-playback. The first was a headset with earphones and a microphone; the second was an identical headset plus a tape recorder for each student. The other factor was time. Daily usage of 20 minutes was compared to a once-a-week 60-minute session. Five groups of second-year French students were formed. It should be stressed that none of the subjects had previous laboratory experience. Moreover, during the study, the control group did not use any equipment. The other four groups were formed by crossing equipment type and usage time. The dependent measures were the same as in the first study, with the addition of a mimicry test.

The t test results from the 14 components are difficult to interpret. Some differences are reported at a .01 level of significance, others at a .05 level, but it is impossible to determine whether one group was significantly higher than all the other groups or only some of them. The rankings that were also reported are more helpful, for they allow trends to be detected. On measures of enunciation, the order was thus: (1) daily record-playback, (2) daily audio-active, (3) weekly record-playback, (4) weekly audio-active, and (5) control. Thus greater time, frequency, and more elaborate equipment favor one aspect of the speaking skill. However, as regards lexical and syntactic features of speech, the control group was ranked first, with the daily record-playback group coming in second. This finding should be considered along with the result from the composite score on the cooperative test. Here, the daily record-playback group ranked first and the control group was second. The difference between the two groups was not significant, but both groups were significantly higher than the other three groups. What emerges is this: The daily record-playback group and the control group scored similarly, and significantly better than the other groups, on both oral and written measures of vocabulary and grammar.

From the above findings, one is tempted to draw an "all or nothing" conclusion: Either use a fully equipped lab daily or dispense with it altogether. It seems that certain outcomes will be the same in either case. The corollary is that infrequent usage of a modest lab actually appears to be detrimental to the lexical and syntactic aspects of language learning! However, Lorge does not make such a counterintuitive deduction. She noted that in the first study, there were no differences between the lab and nonlab groups on the vocabulary and grammar tests. In the second study, she maintained that any measure showing statistically significant differences showed at least one laboratory group that equaled or exceeded the gains made by the control group. This appears to indicate that time spent in the laboratory contributes to conventional learnings as well as to listening and speaking skills (p. 419).

The last sentence is crucial. Taken together, these studies indicated an overall advantage for the language lab. Lorge also noted that a higher percentage of students in lab sections continued studying French beyond the 3 years required for high school graduation and college admission.

Lorge's study appears to have been well received by the scholarly community. Stack (1964) praised Lorge's work in his critique of the Keating study. Only Green (1965) ventured criticisms. Some of his complaints had to do with the manner in which the results were reported. He was more concerned with the apparent addition of another group after the study was underway. Lorge (1965) answered these objections easily in her rebuttal, which was included in the same issue of the Modern Language Journal as Green's piece.

In 1966, Philip D. Smith began an investigation of beginning high school French and German teaching and learning, which lasted through 1969. It was sponsored by the Federal Office of Education under Titles VI and VII of the NDEA and is commonly referred to in the literature as the Pennsylvania project because all the participating schools were in that state. Smith summarized his findings in 1969 articles in Foreign Language Annals (Smith, 1969a) and the French Review (Smith, 1969b), which are more accessible than the technical reports he submitted as part of the grant's requirements. The October 1969 issue (volume 53, number 6) of the Modern Language Journal contained six articles critiquing the Pennsylvania studies. The December 1969 issue (volume 3, number 2) of Foreign Language Annals contained the summary article by Smith and two review articles. Contemporary synopses of the project and its reviews by D. L. Lange (1968) and W. F. Smith (1970) will be relied on for this discussion.

In the first year of the study, 2,171 students participated. Three teaching strategies and three language laboratory systems were compared. The strategies were: traditional, functional skills, and functional skills with grammar. By traditional was meant that an emphasis was placed on vocabulary acquisition, reading and writing skills, translation, and grammatical analysis. Functional skills was a synonym for the audiolingual method; the command of a core vocabulary and key syntactic patterns was emphasized, as were the speaking and listening skills. Functional skills with grammar was, as the name indicates, the addition of grammatical explanations to the audiolingual method. The three language laboratory systems were: audio-active, audio-active record, and tape recorder in the classroom. The first consisted of two, 25-minute practice sessions each week in which a 10-minute drill tape was played twice. The second arrangement differed from the first in that the students recorded their first practice with the tape and then listened to their own responses. Both of the audio-active groups also practiced in the classroom with a tape recorder each day under the supervision of the instructor for one-fifth of the period. The tape recorder in the classroom group did no lab practice. What they did was at least 10 minutes of guided practice with the tape each day in class.

The results from the first year indicated no significant differences between the teaching strategies, except for reading, where the traditional group outperformed the two audio-active groups. There were no significant differences detected between laboratory systems. During the second year of the project, 639 first-year students participated in a replication study, and 1,090 of the original 2,171 subjects were observed in their second year of language study. The results from this second year of the investigation were in line with those of the first. In the third, year the number of subjects (third-year students) dropped to 277, and by the fourth Year it was down to 144 fourth-year students. The findings from these last 2 years showed the traditional students faring significantly better than the audio-active students in both reading and listening. In none of the 4 years of the study was a significant difference in outcomes found according to the laboratory system.

Although the Pennsylvania project generally received higher marks for its methodology than did the Keating report with which it was often compared, there were nevertheless several critiques leveled and questions raised. Some of these involved control issues, such as the degree of teacher adherence to experimental guideline, the consistency of laboratory installations and maintenance between schools, and the lack of data as to the amount of time the labs where actually used. Carroll (1969b) detected stowaway variables and practice effects. Perhaps the most serious criticism was the claim (Valette, 1969) that the cooperative test was an inappropriate measure of listening achievement. It was maintained that the vocabulary in this test was closer to what was in the textbook used by traditional groups than the one used by the lab groups. Moreover, evidence from other sources was cited which indicated that the cooperative test was simply too difficult for students in their first 3 years of foreign-language study. This second criticism had broad implications: It cast doubt on the instrument that had been used in all previous language laboratory studies and in many other studies of foreign-language teaching.

Carroll (1969b-) and Smith (1970) assessed the implications of the Pennsylvania project. For them, the supposed findings in favor of the traditional groups did not warrant a return to former means of teaching. Rather, they viewed the report, despite its faults, as a credible demonstration that the enthusiastic adoption of new approaches and accompanying materiel does not guarantee success. "The Pennsylvania studies have removed us from our tower of false security" (Sirdth, 1970, p. 208). For Carroll, the specific lessons to be learned were that audiolingual textbooks needed more linguistic content and that less emphasis should be placed on drills and other "habit formation" activities (1969; p. 235). Smith ended his review on an upbeat note: "It is time to meet the challenge of a new decade" (1970, p. 208). But such a positive attitude did not prevail. As was noted in the historical section above, language laboratories were in the doldrums in the 1970s and early 1980s. Davies (1982) singled out the Pennsylvania project for making complete the growing disillusionment of the period with labs. Moreover, it appears that the study discouraged other research, for it was the last of the large scale inquiries into the language laboratory's effectiveness.

The only major inquiry of the language laboratory involving postsecondary students will now be discussed. Scherer and West (1964) described in a 246-page book the 2-year NDEA-sponsored investigation they conducted from September 1960. Their goal was to compare the audiolingual approach to the traditional grammar-reading method. Thus, this was not an examination of the language laboratory per se; rather, it was an inquiry similar to the Pennsylvania project (not yet conducted), which was interested in the language lab because of its intimate connection to the audiolingual method. The subjects were beginning German students at the University of Colorado. Intact classes were used, and these were determined to be similar on measures of general academic ability, language learning aptitude, and motivation, as well as sex, age, and year in school.

All of the teaching staff received a week of training in the respective methods prior to the start of the experiment. In addition, there were weekly meetings and frequent observations by the principal investigators and outside consultants to ensure that the instructors adhered to the experiment's guidelines. The traditional approach is only scantily described, but the audiolingual procedures are elaborately detailed in Scherer and Wertheimer's book. The essence of the latter was dialogue memorization and related drill and practice in class. The frequency and duration of the lab sessions were unfortunately not specified; they were for "overlearning" (p. 83) the material presented in class. It is stated that the lab sessions were unmonitored and were of the "library-type" (p. 83), which presumably means the students attended at their convenience. Of note is the postponement of reading for the audiolingual group until the 12th week of the semester. To be specific, the audiolingual group saw absolutely no written German until that point. When reading began, it consisted of the dialogues that had been previously memorized and recombinations of the vocabulary contained in them.

The investigators claimed that they conducted a "persistent and continuous search" (p. 108) for standardized tests to use to measure the outcomes of the two teaching approaches. They were not satisfied with what they found, because "nothing that the major test distributors had to offer seemed to meet the requirements of our situation" (p. 108). They therefore constructed tests of the four language skills and two for translation: German-to-English and vice versa. The t test statistic was used for comparisons. At the end of the first year, the audiolingual students were significantly superior to the traditional students in speaking and listening. The superiority in speaking was maintained in the second year, but the advantage for listening was not. On the other hand, the traditional students significantly outperformed the audiolingual students on reading and writing during the first year, and maintained their edge on the latter skill during the second year. The traditional students also were higher in German-to-English translation during both years, and better in English-to-German translation in the first year.

In addition to these measures of linguistic proficiency, Scherer and Wertheimer also used standardized scales and questionnaires they constructed to evaluate the subjects' motivation to study German and their attitude to it and its speakers. They were also concerned with "habituated direct association." By this was meant the ability of the students to think in German, their inclination to translate or not, and their sensitivity to semantic nuances -between the two languages. Numerous intercorrelations between these and measures of affective constructs such as anomie, social inhibition, and desire for further German study were calculated. The researchers summarized their work thus:

The experiment has demonstrated that the two methods, while yielding occasionally strong and persisting differences in various aspects of proficiency in German, result in comparable overall proficiency. But the audiolingual method, whether its results are measured objectively or estimated by the students themselves, appears to produce more desirable attitudes and better habituated direct association (p. 245).

John B. Carroll (1969a) characterized the Scherer and Wertheimer study as "ambitious" (p. 869) and more rigorously designed than any previous examination of the audiolingual approach. He accepted the investigators' conclusions as valid, but offered the following:

The conclusion that emerges from this experiment is that the differences between the audiolingual and traditional methods are primarily differences of objectives; not surprisingly, students learn whatever skills are emphasized in the instruction (pp. 869-870).

Besides the large-scale and well-publicized studies of Keating, Lorge, Smith, and Scherer and Wertheimer, there have been many smaller investigations since 1959. Eight studies that appeared in major journals have been selected for inclusion here according to chronological order. Only 'their main findings will be given, since these studies in general did not generate the interest of the larger studies that were described above.

Bauer (1964) found that university students who used the language laboratory in a supervised group-practice condition performed significantly better on oral and dictation measures, but not on a writing measure, than students who studied individually and were not supervised. Two drawbacks to the study were the low number of subjects (N = 24) and the use of nonstandardized tests. Moreover, a close examination of the data reveals that the supervised subjects as a group used the lab 125 minutes more over a 3.5-week period than the unsupervised subjects, so the observed differences could possibly be attributed to greater time-on-task.

Young and Choquette's NDEA-sponsored study (1965) was a series of seven experiments that sought to determine whether any of four language laboratory equipment configurations made a difference in the subjects' abilities to self-monitor their pronunciation. The systems were characterized by the feedback options they presented: (1) passive, (2) active, (3) long-delayed comparison, and (4) short-delayed comparison. The first three systems were standard options for language laboratory installations at the time. An apparatus for the fourth condition was specially fashioned for the study by the investigators. In the passive arrangement, the subjects repeated after taped prompts, but they could not clearly hear their responses because the headsets muffled their voices. In the active arrangement, subjects could bear their responses amplified through their headsets as they spoke. In third option, subjects could record their answers for later comparison. In the fourth setup, the students could hear their recorded response within I 1h seconds of making them. Subjects in the active feedback configuration were found to have slightly superior pronunciation than subjects in the other arrangements. However, the authors qualified this finding on several grounds. Of note was the lower sound quality of the fabricated equipment used in the short-delay condition. The authors admitted that this hampered a true comparison with the other three conditions.

Buka, Freeman, and Locke (1962) and Freeman and Buka (1965) conducted experiments that sought to establish psycho-acoustic parameters for language laboratory equipment The first study determined that a high-frequency cutoff of less than 7,300 cps hindered subjects (high school students) from perceiving certain phonemic contrasts in German and French. The second study found that a low-frequency cutoff of 500 cps caused subjects (again high school students) to make significantly more errors in German phoneme discrimination than a 50-cps cutoff. However, no significant differences were found between these two levels for French phoneme discrimination. It was also found that consonant distinctions were more affected than vowel distinctions by the degradation of sound quality brought on by filtering.

Benathy and Jordan (1969) reported on a post hoc comparison of achievement scores in Bulgarian courses at the Defense Language Institute. The scores of 13 classes (87 students) that completed the course between August 1959 and September 1963 were compared to the scores of 15 classes (103 students) that finished between November 1963 and July 1967. The difference between these classes was the introduction in the fall of 1963 of the Classroom Laboratory Instructional System (CLIS): CLIS is a designed interaction of live instruction and a set of different kinds of learning experiences that make use of prepared and recorded instructional materials, delivered through the electronic media (p. 473).

The authors stressed that the CLIS system kept the learners on task much more than in a typical classroom. This was because the earphones both isolated each learner from the erroneous responses and pronunciations of others and provided quality native-speaker models. Moreover, the learner did not wait to be called on as in a regular class; it was always his or her "turn." The equipment used appeared to be that of a typical audio-active language laboratory, although the authors do not use the term in their article. Curiously, they do not cite any language laboratory literature in their discussion, yet their description and justification for CLIS are identical to those commonly found in language laboratory writings.

The two groups were found to be very similar in ages and scores of the Army Language Aptitude Test. Class sizes were nearly identical, and the same textbooks and proficiency test were used throughout the 8-year period. It was found that the CLIS classes scored significantly higher than the pre-CLIS classes on the two skills measured by the test, namely, reading and listening. The differences were especially pronounced in the case of the latter skill.

Despite the many experimental controls and the marked differences between the groups, there are three questions that may be raised about this study. First of all, as no mention of instructors is made, one wonders whether teacher effects were held constant. Secondly, the generalizability of the results to high school and university students is doubtful, given that the subjects were all adults studying for specific career purposes at the Defense Language Institute. A third consideration is a question: Why did Benathy and Jordan not more fully report on the synchronous study that Preceded the longitudinal one? They claimed similar significant results from it in favor of the CLIS. More information (i.e., number of subjects, a figure showing t values) about it would give greater credibility to their overall conclusion.

The Chomei and Houlihan (1970) study compared three language laboratory systems: instant playback, long-delay playback, and audio-active. The instant playback option allowed the subjects to have their recorded response to the program stimulus echoed back within half a second. The long-delay group had to rewind the tape to hear their recordings. The audio-active group did not record their responses. It can thus be seen that this study closely resembled what had been done by Young and Choquette (1965), but, surprisingly, this earlier work was not cited. The subjects in the Chomei and Houlihan investigation were 140 Japanese l0th-graders, who were all taught by the same instructor. It was found that the instant-playback group performed significantly better than the other groups on one out of five translation tests and on four out of five speaking tests that had been specially created for the experiment.

Sisson (1970) did a study that was sponsored by the U.S. Office of Education. Its aim was to settle the controversy among language educators as to the benefit (or lack thereof) of delayed comparison on students' ability to perceive and produce the phonemes of another language. Thus, this study shared the same goal as the work of Young and Choquette (1965) and Chomei and Houlihan (1970). That Sisson did not cite the latter is understandable, since it was contemporary to his own. What is surprising is that he ignored the former, yet did cite 39 other articles. In this oversight he followed Chomei and Houlihan, as pointed out before. Why a major study published in a leading journal was so ignored is an unanswered question in the record.

Sisson claimed that "die variables of learning environment were controlled as closely as possible with respect to identity of instructors, scheduling of laboratory lessons, and use of classroom and laboratory materials" (p. 82). The special equipment used in the study, the Plurilingua language laboratory, was thoroughly described. The subjects were 24 students of English as a second language at the University of Michigan. They were in three intact classes of eight students each. The classes were matched on the basis of a modified version of the test of Aural Perception for Latin American Students. This instrument had a phoneme discrimination section and two phoneme production portions.

Two conditions were compared. Half of the students (four from each of the three classes) listened to a taped stimulus and recorded their answer. On completion of an exercise, these subjects rewound the tape and repeated the exercise in the same manner. These subjects formed the "active group." The other group of subjects recorded their responses, as did the active group. However, at the completion of the exercise, these subjects rewound their tape and listened to their first responses rather than record them a second time. This was the "delayed-comparison group." Both groups spent I hour per week in the language laboratory during the 8-week term. The modified version of the test of Aural Perception for Latin American Students, which had been used as the pretest was also used as the posttest. Sisson found no significant difference between the two groups on either discrimination or production.

Smith (1980) conducted a study to determine whether the slowing down of recorded material had a beneficial effect on listening comprehension. The reader will recall from the section on history that, during the 1960s, equipment became available which was capable of slowing down (expanding) or speeding up (compressing) recordings without distortion. Smith claimed that his search of the literature turned up no reference to studies addressing the specific application of this technology to foreign-language instruction. This claim was proved incorrect in Driscoll's article (1981), which listed several such studies. However, it should be pointed out that Driscoll was also guilty of oversight; he omitted Smith's study even though it was in the same outlet, the NALLD Journal, as his article.

Smith's subjects were second-semester students of French at West Chester State College in Pennsylvania. The control group had 11 members, and the experimental, 12. The cooperative test was administered as a pretest, and the control group was found to be significantly better in reading ability than the experimental group, but both groups were equal in listening comprehension, the skill at issue in the investigation. The study stretched over the fall 1978 semester. The control group covered 12 audio lessons that were recorded at normal speed. The experimental group listened to four lessons that were slowed by 20%, four that were slowed by 10%, and four that were at normal speed. At the end of semester, the students were again given the cooperative tests. Contrary to expectations, the ANCOVA and Finney t test procedures showed that the control group scored significantly higher on listening comprehension than the experimental group who listened to expanded material.

Despite such a clear-cut albeit counterintuitive finding, Smith cautioned that the study needed to be replicated with a larger number of subjects and for other languages before it could be reasonably concluded that expanded speech was not beneficial, or perhaps even harmful, for the acquiring of listening proficiency in a foreign language. Unfortunately, there is no record of replications by Smith or others. Whether the magnitude of Smith's findings squelched any other initiatives can only be conjectured. It should be pointed out that manufacturers continued to include expansion and compression capabilities in the "deluxe" models of their equipment. It can only be concluded that many practitioners appreciated these features and purchased them, although they had no independent, empirical confirmation of their effectiveness.

28.8.2.4.1. Summary. Twelve studies conducted since the passage of the NDEA in 1958 were discussed in this section. They differed considerably in scale, populations, and methodology. Although all concerned language laboratories in some way, they did not all seek to answer the same questions other than the general one of effectiveness. For these reasons, it is difficult to draw a conclusion. This body of research does not offer clear-cut confirmation of the utility of language laboratories, yet neither does it suggest that they are detrimental to language learning. Perhaps the inconclusiveness of the record is because the investigations that were conducted were not following an agreed-upon agenda, The larger educational technology community began the period with such an agenda (Allen, 1959; Meierhenry, 1962). This lack of focus was costly: Pederson (1987) claimed that it was the lack of solid research concerning courseware that led to the decline of language laboratories.

It would be hasty, however, to dismiss all language laboratory research. It can readily be determined that the use of audio resources within the foreign-language community has differed significantly from that of the larger educational technology community. Not surprisingly, this different use fostered different research. What was unique to the utilization and study of audio resources within foreign-language circles? One can first note the interest in psychophysics and the acoustic parameters of equipment. Besides Buka, Freeman, and Locke (1962) and Freeman and Buka (1965), who were discussed above, Hayes (1963) should be mentioned. He culled a wide range of human factors literature in order to offer standards to be used in laboratory purchase specifications. At this time, the broader educational technology community was more concerned with visual rather than auditory perception. A clear example of this pictorial bias is the fifth issue of volume 10 of the Audio-Visual Communication Review (1962), which was entitled "Perception Theory and AV Education." It contained no mention of the aural sense. Such a slanting of interest belies the "audio" component in the name of the flagship journal of the educational technology field at the time. Saettler's The Evolution of American Educational Technology shows that this inclination persists; visual media are accorded much more attention than are audio media.

Related to acoustic and perceptual matters are equipment features. Some of the studies reviewed in this section of the chapter (e.g., Young & Choquette, 1965; Chomei & Houlihan, 1970) were concerned with this issue. This is also unique to the body of language laboratory research. Only the studies of compressed and expanded speech showed an interest in machine capabilities.

The reader must be aware that in foreign language teaching, audio materials are not used merely to present declarative information in an alternative medium; rather audio is the medium. What is meant is that language as normally used in everyday fife is predominantly oral and aural rather than visual. Knowledge of a language is more properly considered to be procedural; such knowledge must be inculcated and learned by doing. This perhaps explains why the foreign-language community showed little, if any, interest in the channel effectiveness literature and audio-tutorial instruction (ATI). That the latter was not pursued is somewhat surprising when one considers that Smith (1970), in his essay in the prominent ACTFL yearbook, had proposed that ATI held promise for better foreign-language courseware.

The record shows that periodically foreign-language practitioners have been diligently involved in the use of audio technology. The record also shows that this enthusiasm has waned, and the blame was put on the lack of adequate research. In order to avoid a repeat of this mistake, it would appear that research is required. Audio technology is still in use: less and less alone, but more and more in computer-based multimedia systems. These systems permit the easy collecting of usage data. One can hope that audio technology research will enter a new phase with this new equipment.

28.8.3 Conclusions about Language Labs

According to Last (1989), "language teachers as a body have been more ready than most to accept and explore the pedagogical potential of new technologies as they have emerged" (p. 15). Yet Richards and Nunan (1992) judge that "technology at present is underexploited in language teaming and teaching" (p. 1203). It is not the purpose of this chapter to resolve this contradiction. Suffice it to say that there is evidence for both positions, and one has reason to be optimistic about the future. In 1990 the U.S. Department of Education funded the first National Foreign Language Resource Center (by ruiz at tforge). Two centers, the University of Hawaii and San Diego State University, have been offering workshops on the use of technology. The FLAME (Foreign Language Applications in Multimedia Environment) project at the University of Michigan is another reason for hope.

At the outset of this portion of the chapter, it was stated that the larger educational technology community has not fully appreciated the history of language laboratories. The scant attention paid to them in Saettler's The Evolution of American Educational Technology was cited to support this point. Nor has the history of the research that accompanied language laboratories been acknowledged heretofore. The proof of this contention can be seen in Allen's (1971) review of past research. This essay in the AVCR by its longtime editor contained no mention of what were then recent studies, some of which had attracted much attention in the popular press. It is hoped that this summary will contribute to setting the record straight.


Updated August 3, 2001
Copyright © 2001
The Association for Educational Communications and Technology

AECT
1800 North Stonelake Drive, Suite 2
Bloomington, IN 47404

877.677.AECT (toll-free)
812.335.7675

AECT Home Membership Information Conferences & Events AECT Publications Post and Search Job Listings