fbpx
Wikipedia

Phonetics

Phonetics is a branch of linguistics that studies how humans produce and perceive sounds, or in the case of sign languages, the equivalent aspects of sign.[1] Linguists who specialize in studying the physical properties of speech are phoneticians. The field of phonetics is traditionally divided into three sub-disciplines based on the research questions involved such as how humans plan and execute movements to produce speech (articulatory phonetics), how various movements affect the properties of the resulting sound (acoustic phonetics), or how humans convert sound waves to linguistic information (auditory phonetics). Traditionally, the minimal linguistic unit of phonetics is the phone—a speech sound in a language which differs from the phonological unit of phoneme; the phoneme is an abstract categorization of phones.

Phonetics deals with two aspects of human speech: production—the ways humans make sounds—and perception—the way speech is understood. The communicative modality of a language describes the method by which a language produces and perceives languages. Languages with oral-aural modalities such as English produce speech orally (using the mouth) and perceive speech aurally (using the ears). Sign languages, such as Australian Sign Language (Auslan) and American Sign Language (ASL), have a manual-visual modality, producing speech manually (using the hands) and perceiving speech visually (using the eyes). ASL and some other sign languages have in addition a manual-manual dialect for use in tactile signing by deafblind speakers where signs are produced with the hands and perceived with the hands as well.

Language production consists of several interdependent processes which transform a non-linguistic message into a spoken or signed linguistic signal. After identifying a message to be linguistically encoded, a speaker must select the individual words—known as lexical items—to represent that message in a process called lexical selection. During phonological encoding, the mental representation of the words are assigned their phonological content as a sequence of phonemes to be produced. The phonemes are specified for articulatory features which denote particular goals such as closed lips or the tongue in a particular location. These phonemes are then coordinated into a sequence of muscle commands that can be sent to the muscles, and when these commands are executed properly the intended sounds are produced.

These movements disrupt and modify an airstream which results in a sound wave. The modification is done by the articulators, with different places and manners of articulation producing different acoustic results. For example, the words tack and sack both begin with alveolar sounds in English, but differ in how far the tongue is from the alveolar ridge. This difference has large effects on the air stream and thus the sound that is produced. Similarly, the direction and source of the airstream can affect the sound. The most common airstream mechanism is pulmonic—using the lungs—but the glottis and tongue can also be used to produce airstreams.

Language perception is the process by which a linguistic signal is decoded and understood by a listener. In order to perceive speech the continuous acoustic signal must be converted into discrete linguistic units such as phonemes, morphemes, and words. In order to correctly identify and categorize sounds, listeners prioritize certain aspects of the signal that can reliably distinguish between linguistic categories. While certain cues are prioritized over others, many aspects of the signal can contribute to perception. For example, though oral languages prioritize acoustic information, the McGurk effect shows that visual information is used to distinguish ambiguous information when the acoustic cues are unreliable.

Modern phonetics has three branches:

History

Antiquity

The first known phonetic studies were carried out as early as the 6th century BCE by Sanskrit grammarians.[2] The Hindu scholar Pāṇini is among the most well known of these early investigators, whose four-part grammar, written around 350 BCE, is influential in modern linguistics and still represents "the most complete generative grammar of any language yet written".[3] His grammar formed the basis of modern linguistics and described several important phonetic principles, including voicing. This early account described resonance as being produced either by tone, when vocal folds are closed, or noise, when vocal folds are open. The phonetic principles in the grammar are considered "primitives" in that they are the basis for his theoretical analysis rather than the objects of theoretical analysis themselves, and the principles can be inferred from his system of phonology.[4]

The Sanskrit study of phonetics is called Shiksha. The Taittiriya Upanishad, dated to 1 millennium BC defines Shiksha as follows -

Om! We will explain the Shiksha.
Sounds and accentuation, Quantity (of vowels) and the expression (of consonants),
Balancing (Saman) and connection (of sounds), So much about the study of Shiksha. || 1 |

Taittiriya Upanishad 1.2, Shikshavalli, Translated by Paul Deussen[5].

Modern

Advancements in phonetics after Pāṇini and his contemporaries were limited until the modern era, save some limited investigations by Greek and Roman grammarians. In the millennia between Indic grammarians and modern phonetics, the focus shifted from the difference between spoken and written language, which was the driving force behind Pāṇini's account, and began to focus on the physical properties of speech alone. Sustained interest in phonetics began again around 1800 CE with the term "phonetics" being first used in the present sense in 1841.[6][2] With new developments in medicine and the development of audio and visual recording devices, phonetic insights were able to use and review new and more detailed data. This early period of modern phonetics included the development of an influential phonetic alphabet based on articulatory positions by Alexander Melville Bell. Known as visible speech, it gained prominence as a tool in the oral education of deaf children.[2]

Before the widespread availability of audio recording equipment, phoneticians relied heavily on a tradition of practical phonetics to ensure that transcriptions and findings were able to be consistent across phoneticians. This training involved both ear training—the recognition of speech sounds—as well as production training—the ability to produce sounds. Phoneticians were expected to learn to recognize by ear the various sounds on the International Phonetic Alphabet and the IPA still tests and certifies speakers on their ability to accurately produce the phonetic patterns of English (though they have discontinued this practice for other languages).[7] As a revision of his visible speech method, Melville Bell developed a description of vowels by height and backness resulting in 9 cardinal vowels.[8] As part of their training in practical phonetics, phoneticians were expected to learn to produce these cardinal vowels in order to anchor their perception and transcription of these phones during fieldwork.[7] This approach was critiqued by Peter Ladefoged in the 1960s based on experimental evidence where he found that cardinal vowels were auditory rather than articulatory targets, challenging the claim that they represented articulatory anchors by which phoneticians could judge other articulations.[9]

Production

Language production consists of several interdependent processes which transform a nonlinguistic message into a spoken or signed linguistic signal. Linguists debate whether the process of language production occurs in a series of stages (serial processing) or whether production processes occur in parallel. After identifying a message to be linguistically encoded, a speaker must select the individual words—known as lexical items—to represent that message in a process called lexical selection. The words are selected based on their meaning, which in linguistics is called semantic information. Lexical selection activates the word's lemma, which contains both semantic and grammatical information about the word.[10][a]

After an utterance has been planned,[b] it then goes through phonological encoding. In this stage of language production, the mental representation of the words are assigned their phonological content as a sequence of phonemes to be produced. The phonemes are specified for articulatory features which denote particular goals such as closed lips or the tongue in a particular location. These phonemes are then coordinated into a sequence of muscle commands that can be sent to the muscles, and when these commands are executed properly the intended sounds are produced.[12] Thus the process of production from message to sound can be summarized as the following sequence:[c]

  • Message planning
  • Lemma selection
  • Retrieval and assignment of phonological word forms
  • Articulatory specification
  • Muscle commands
  • Articulation
  • Speech sounds

Place of articulation

Sounds which are made by a full or partial constriction of the vocal tract are called consonants. Consonants are pronounced in the vocal tract, usually in the mouth, and the location of this constriction affects the resulting sound. Because of the close connection between the position of the tongue and the resulting sound, the place of articulation is an important concept in many subdisciplines of phonetics.

Sounds are partly categorized by the location of a constriction as well as the part of the body doing the constricting. For example, in English the words fought and thought are a minimal pair differing only in the organ making the construction rather than the location of the construction. The "f" in fought is a labiodental articulation made with the bottom lip against the teeth. The "th" in thought is a linguodental articulation made with the tongue against the teeth. Constrictions made by the lips are called labials while those made with the tongue are called lingual.

Constrictions made with the tongue can be made in several parts of the vocal tract, broadly classified into coronal, dorsal and radical places of articulation. Coronal articulations are made with the front of the tongue, dorsal articulations are made with the back of the tongue, and radical articulations are made in the pharynx.[13] These divisions are not sufficient for distinguishing and describing all speech sounds.[13] For example, in English the sounds [s] and [ʃ] are both coronal, but they are produced in different places of the mouth. To account for this, more detailed places of articulation are needed based upon the area of the mouth in which the constriction occurs.[14]

Labial

Articulations involving the lips can be made in three different ways: with both lips (bilabial), with one lip and the teeth (labiodental), and with the tongue and the upper lip (linguolabial).[15] Depending on the definition used, some or all of these kinds of articulations may be categorized into the class of labial articulations. Bilabial consonants are made with both lips. In producing these sounds the lower lip moves farthest to meet the upper lip, which also moves down slightly,[16] though in some cases the force from air moving through the aperture (opening between the lips) may cause the lips to separate faster than they can come together.[17] Unlike most other articulations, both articulators are made from soft tissue, and so bilabial stops are more likely to be produced with incomplete closures than articulations involving hard surfaces like the teeth or palate. Bilabial stops are also unusual in that an articulator in the upper section of the vocal tract actively moves downwards, as the upper lip shows some active downward movement.[18] Linguolabial consonants are made with the blade of the tongue approaching or contacting the upper lip. Like in bilabial articulations, the upper lip moves slightly towards the more active articulator. Articulations in this group do not have their own symbols in the International Phonetic Alphabet, rather, they are formed by combining an apical symbol with a diacritic implicitly placing them in the coronal category.[19][20] They exist in a number of languages indigenous to Vanuatu such as Tangoa.

Labiodental consonants are made by the lower lip rising to the upper teeth. Labiodental consonants are most often fricatives while labiodental nasals are also typologically common.[21] There is debate as to whether true labiodental plosives occur in any natural language,[22] though a number of languages are reported to have labiodental plosives including Zulu,[23] Tonga,[24] and Shubi.[22]

Coronal

Coronal consonants are made with the tip or blade of the tongue and, because of the agility of the front of the tongue, represent a variety not only in place but in the posture of the tongue. The coronal places of articulation represent the areas of the mouth where the tongue contacts or makes a constriction, and include dental, alveolar, and post-alveolar locations. Tongue postures using the tip of the tongue can be apical if using the top of the tongue tip, laminal if made with the blade of the tongue, or sub-apical if the tongue tip is curled back and the bottom of the tongue is used. Coronals are unique as a group in that every manner of articulation is attested.[19][25] Australian languages are well known for the large number of coronal contrasts exhibited within and across languages in the region.[26] Dental consonants are made with the tip or blade of the tongue and the upper teeth. They are divided into two groups based upon the part of the tongue used to produce them: apical dental consonants are produced with the tongue tip touching the teeth; interdental consonants are produced with the blade of the tongue as the tip of the tongue sticks out in front of the teeth. No language is known to use both contrastively though they may exist allophonically. Alveolar consonants are made with the tip or blade of the tongue at the alveolar ridge just behind the teeth and can similarly be apical or laminal.[27]

Crosslinguistically, dental consonants and alveolar consonants are frequently contrasted leading to a number of generalizations of crosslinguistic patterns. The different places of articulation tend to also be contrasted in the part of the tongue used to produce them: most languages with dental stops have laminal dentals, while languages with apical stops usually have apical stops. Languages rarely have two consonants in the same place with a contrast in laminality, though Taa (ǃXóõ) is a counterexample to this pattern.[28] If a language has only one of a dental stop or an alveolar stop, it will usually be laminal if it is a dental stop, and the stop will usually be apical if it is an alveolar stop, though for example Temne and Bulgarian[29] do not follow this pattern.[30] If a language has both an apical and laminal stop, then the laminal stop is more likely to be affricated like in Isoko, though Dahalo show the opposite pattern with alveolar stops being more affricated.[31]

Retroflex consonants have several different definitions depending on whether the position of the tongue or the position on the roof of the mouth is given prominence. In general, they represent a group of articulations in which the tip of the tongue is curled upwards to some degree. In this way, retroflex articulations can occur in several different locations on the roof of the mouth including alveolar, post-alveolar, and palatal regions. If the underside of the tongue tip makes contact with the roof of the mouth, it is sub-apical though apical post-alveolar sounds are also described as retroflex.[32] Typical examples of sub-apical retroflex stops are commonly found in Dravidian languages, and in some languages indigenous to the southwest United States the contrastive difference between dental and alveolar stops is a slight retroflexion of the alveolar stop.[33] Acoustically, retroflexion tends to affect the higher formants.[33]

Articulations taking place just behind the alveolar ridge, known as post-alveolar consonants, have been referred to using a number of different terms. Apical post-alveolar consonants are often called retroflex, while laminal articulations are sometimes called palato-alveolar;[34] in the Australianist literature, these laminal stops are often described as 'palatal' though they are produced further forward than the palate region typically described as palatal.[26] Because of individual anatomical variation, the precise articulation of palato-alveolar stops (and coronals in general) can vary widely within a speech community.[35]

Dorsal

Dorsal consonants are those consonants made using the tongue body rather than the tip or blade and are typically produced at the palate, velum or uvula. Palatal consonants are made using the tongue body against the hard palate on the roof of the mouth. They are frequently contrasted with velar or uvular consonants, though it is rare for a language to contrast all three simultaneously, with Jaqaru as a possible example of a three-way contrast.[36] Velar consonants are made using the tongue body against the velum. They are incredibly common cross-linguistically; almost all languages have a velar stop. Because both velars and vowels are made using the tongue body, they are highly affected by coarticulation with vowels and can be produced as far forward as the hard palate or as far back as the uvula. These variations are typically divided into front, central, and back velars in parallel with the vowel space.[37] They can be hard to distinguish phonetically from palatal consonants, though are produced slightly behind the area of prototypical palatal consonants.[38] Uvular consonants are made by the tongue body contacting or approaching the uvula. They are rare, occurring in an estimated 19 percent of languages, and large regions of the Americas and Africa have no languages with uvular consonants. In languages with uvular consonants, stops are most frequent followed by continuants (including nasals).[39]

Pharyngeal and laryngeal

Consonants made by constrictions of the throat are pharyngeals, and those made by a constriction in the larynx are laryngeal. Laryngeals are made using the vocal folds as the larynx is too far down the throat to reach with the tongue. Pharyngeals however are close enough to the mouth that parts of the tongue can reach them.

Radical consonants either use the root of the tongue or the epiglottis during production and are produced very far back in the vocal tract.[40] Pharyngeal consonants are made by retracting the root of the tongue far enough to almost touch the wall of the pharynx. Due to production difficulties, only fricatives and approximants can produced this way.[41][42] Epiglottal consonants are made with the epiglottis and the back wall of the pharynx. Epiglottal stops have been recorded in Dahalo.[42] Voiced epiglottal consonants are not deemed possible due to the cavity between the glottis and epiglottis being too small to permit voicing.[43]

Glottal consonants are those produced using the vocal folds in the larynx. Because the vocal folds are the source of phonation and below the oro-nasal vocal tract, a number of glottal consonants are impossible such as a voiced glottal stop. Three glottal consonants are possible, a voiceless glottal stop and two glottal fricatives, and all are attested in natural languages.[19] Glottal stops, produced by closing the vocal folds, are notably common in the world's languages.[43] While many languages use them to demarcate phrase boundaries, some languages like Arabic and Huatla Mazatec have them as contrastive phonemes. Additionally, glottal stops can be realized as laryngealization of the following vowel in this language.[44] Glottal stops, especially between vowels, do usually not form a complete closure. True glottal stops normally occur only when they're geminated.[45]

The larynx

 
A top-down view of the larynx.

The larynx, commonly known as the "voice box", is a cartilaginous structure in the trachea responsible for phonation. The vocal folds (chords) are held together so that they vibrate, or held apart so that they do not. The positions of the vocal folds are achieved by movement of the arytenoid cartilages.[46] The intrinsic laryngeal muscles are responsible for moving the arytenoid cartilages as well as modulating the tension of the vocal folds.[47] If the vocal folds are not close or tense enough, they will either vibrate sporadically or not at all. If they vibrate sporadically it will result in either creaky or breathy voice, depending on the degree; if don't vibrate at all, the result will be voicelessness.

In addition to correctly positioning the vocal folds, there must also be air flowing across them or they will not vibrate. The difference in pressure across the glottis required for voicing is estimated at 1 – 2 cm H2O (98.0665 – 196.133 pascals).[48] The pressure differential can fall below levels required for phonation either because of an increase in pressure above the glottis (superglottal pressure) or a decrease in pressure below the glottis (subglottal pressure). The subglottal pressure is maintained by the respiratory muscles. Supraglottal pressure, with no constrictions or articulations, is equal to about atmospheric pressure. However, because articulations—especially consonants—represent constrictions of the airflow, the pressure in the cavity behind those constrictions can increase resulting in a higher supraglottal pressure.[49]

Lexical access

According to the lexical access model two different stages of cognition are employed; thus, this concept is known as the two-stage theory of lexical access. The first stage, lexical selection, provides information about lexical items required to construct the functional-level representation. These items are retrieved according to their specific semantic and syntactic properties, but phonological forms are not yet made available at this stage. The second stage, retrieval of wordforms, provides information required for building the positional level representation.[50]

Articulatory models

When producing speech, the articulators move through and contact particular locations in space resulting in changes to the acoustic signal. Some models of speech production take this as the basis for modeling articulation in a coordinate system that may be internal to the body (intrinsic) or external (extrinsic). Intrinsic coordinate systems model the movement of articulators as positions and angles of joints in the body. Intrinsic coordinate models of the jaw often use two to three degrees of freedom representing translation and rotation. These face issues with modeling the tongue which, unlike joints of the jaw and arms, is a muscular hydrostat—like an elephant trunk—which lacks joints.[51] Because of the different physiological structures, movement paths of the jaw are relatively straight lines during speech and mastication, while movements of the tongue follow curves.[52]

Straight-line movements have been used to argue articulations as planned in extrinsic rather than intrinsic space, though extrinsic coordinate systems also include acoustic coordinate spaces, not just physical coordinate spaces.[51] Models that assume movements are planned in extrinsic space run into an inverse problem of explaining the muscle and joint locations which produce the observed path or acoustic signal. The arm, for example, has seven degrees of freedom and 22 muscles, so multiple different joint and muscle configurations can lead to the same final position. For models of planning in extrinsic acoustic space, the same one-to-many mapping problem applies as well, with no unique mapping from physical or acoustic targets to the muscle movements required to achieve them. Concerns about the inverse problem may be exaggerated, however, as speech is a highly learned skill using neurological structures which evolved for the purpose.[53]

The equilibrium-point model proposes a resolution to the inverse problem by arguing that movement targets be represented as the position of the muscle pairs acting on a joint.[d] Importantly, muscles are modeled as springs, and the target is the equilibrium point for the modeled spring-mass system. By using springs, the equilibrium point model can easily account for compensation and response when movements are disrupted. They are considered a coordinate model because they assume that these muscle positions are represented as points in space, equilibrium points, where the spring-like action of the muscles converges.[54][55]

Gestural approaches to speech production propose that articulations are represented as movement patterns rather than particular coordinates to hit. The minimal unit is a gesture that represents a group of "functionally equivalent articulatory movement patterns that are actively controlled with reference to a given speech-relevant goal (e.g., a bilabial closure)."[56] These groups represent coordinative structures or "synergies" which view movements not as individual muscle movements but as task-dependent groupings of muscles which work together as a single unit.[57][58] This reduces the degrees of freedom in articulation planning, a problem especially in intrinsic coordinate models, which allows for any movement that achieves the speech goal, rather than encoding the particular movements in the abstract representation. Coarticulation is well described by gestural models as the articulations at faster speech rates can be explained as composites of the independent gestures at slower speech rates.[59]

Acoustics

 
A waveform (top), spectrogram (middle), and transcription (bottom) of a woman saying "Wikipedia" displayed using the Praat software for linguistic analysis.

Speech sounds are created by the modification of an airstream which results in a sound wave. The modification is done by the articulators, with different places and manners of articulation producing different acoustic results. Because the posture of the vocal tract, not just the position of the tongue can affect the resulting sound, the manner of articulation is important for describing the speech sound. The words tack and sack both begin with alveolar sounds in English, but differ in how far the tongue is from the alveolar ridge. This difference has large affects on the air stream and thus the sound that is produced. Similarly, the direction and source of the airstream can affect the sound. The most common airstream mechanism is pulmonic—using the lungs—but the glottis and tongue can also be used to produce airstreams.

Voicing and phonation types

A major distinction between speech sounds is whether they are voiced. Sounds are voiced when the vocal folds begin to vibrate in the process of phonation. Many sounds can be produced with or without phonation, though physical constraints may make phonation difficult or impossible for some articulations. When articulations are voiced, the main source of noise is the periodic vibration of the vocal folds. Articulations like voiceless plosives have no acoustic source and are noticeable by their silence, but other voiceless sounds like fricatives create their own acoustic source regardless of phonation.

Phonation is controlled by the muscles of the larynx, and languages make use of more acoustic detail than binary voicing. During phonation, the vocal folds vibrate at a certain rate. This vibration results in a periodic acoustic waveform comprising a fundamental frequency and its harmonics. The fundamental frequency of the acoustic wave can be controlled by adjusting the muscles of the larynx, and listeners perceive this fundamental frequency as pitch. Languages use pitch manipulation to convey lexical information in tonal languages, and many languages use pitch to mark prosodic or pragmatic information.

For the vocal folds to vibrate, they must be in the proper position and there must be air flowing through the glottis.[48] Phonation types are modeled on a continuum of glottal states from completely open (voiceless) to completely closed (glottal stop). The optimal position for vibration, and the phonation type most used in speech, modal voice, exists in the middle of these two extremes. If the glottis is slightly wider, breathy voice occurs, while bringing the vocal folds closer together results in creaky voice.[60]

The normal phonation pattern used in typical speech is modal voice, where the vocal folds are held close together with moderate tension. The vocal folds vibrate as a single unit periodically and efficiently with a full glottal closure and no aspiration.[61] If they are pulled farther apart, they do not vibrate and so produce voiceless phones. If they are held firmly together they produce a glottal stop.[60]

If the vocal folds are held slightly further apart than in modal voicing, they produce phonation types like breathy voice (or murmur) and whispery voice. The tension across the vocal ligaments (vocal cords) is less than in modal voicing allowing for air to flow more freely. Both breathy voice and whispery voice exist on a continuum loosely characterized as going from the more periodic waveform of breathy voice to the more noisy waveform of whispery voice. Acoustically, both tend to dampen the first formant with whispery voice showing more extreme deviations. [62]

Holding the vocal folds more tightly together results in a creaky voice. The tension across the vocal folds is less than in modal voice, but they are held tightly together resulting in only the ligaments of the vocal folds vibrating.[e] The pulses are highly irregular, with low pitch and frequency amplitude.[63]

Some languages do not maintain a voicing distinction for some consonants,[f] but all languages use voicing to some degree. For example, no language is known to have a phonemic voicing contrast for vowels with all known vowels canonically voiced.[g] Other positions of the glottis, such as breathy and creaky voice, are used in a number of languages, like Jalapa Mazatec, to contrast phonemes while in other languages, like English, they exist allophonically.

There are several ways to determine if a segment is voiced or not, the simplest being to feel the larynx during speech and note when vibrations are felt. More precise measurements can be obtained through acoustic analysis of a spectrogram or spectral slice. In a spectrographic analysis, voiced segments show a voicing bar, a region of high acoustic energy, in the low frequencies of voiced segments.[64] In examining a spectral splice, the acoustic spectrum at a given point in time a model of the vowel pronounced reverses the filtering of the mouth producing the spectrum of the glottis. A computational model of the unfiltered glottal signal is then fitted to the inverse filtered acoustic signal to determine the characteristics of the glottis.[65] Visual analysis is also available using specialized medical equipment such as ultrasound and endoscopy.[64][h]

Vowels

Vowels are broadly categorized by the area of the mouth in which they are produced, but because they are produced without a constriction in the vocal tract their precise description relies on measuring acoustic correlates of tongue position. The location of the tongue during vowel production changes the frequencies at which the cavity resonates, and it is these resonances—known as formants—which are measured and used to characterize vowels.

Vowel height traditionally refers to the highest point of the tongue during articulation.[66] The height parameter is divided into four primary levels: high (close), close-mid, open-mid and low (open). Vowels whose height are in the middle are referred to as mid. Slightly opened close vowels and slightly closed open vowels are referred to as near-close and near-open respectively. The lowest vowels are not just articulated with a lowered tongue, but also by lowering the jaw.[67]

While the IPA implies that there are seven levels of vowel height, it is unlikely that a given language can minimally contrast all seven levels. Chomsky and Halle suggest that there are only three levels,[68] although four levels of vowel height seem to be needed to describe Danish and it's possible that some languages might even need five.[69]

Vowel backness is dividing into three levels: front, central and back. Languages usually do not minimally contrast more than two levels of vowel backness. Some languages claimed to have a three-way backness distinction include Nimboran and Norwegian.[70]

In most languages, the lips during vowel production can be classified as either rounded or unrounded (spread), although other types of lip positions, such as compression and protrusion, have been described. Lip position is correlated with height and backness: front and low vowels tend to be unrounded whereas back and high vowels are usually rounded.[71] Paired vowels on the IPA chart have the spread vowel on the left and the rounded vowel on the right.[72]

Together with the universal vowel features described above, some languages have additional features such as nasality, length and different types of phonation such as voiceless or creaky. Sometimes more specialized tongue gestures such as rhoticity, advanced tongue root, pharyngealization, stridency and frication are required to describe a certain vowel.[73]

Manner of articulation

Knowing the place of articulation is not enough to fully describe a consonant, the way in which the stricture happens is equally important. Manners of articulation describe how exactly the active articulator modifies, narrows or closes off the vocal tract.[74]

Stops (also referred to as plosives) are consonants where the airstream is completely obstructed. Pressure builds up in the mouth during the stricture, which is then released as a small burst of sound when the articulators move apart. The velum is raised so that air cannot flow through the nasal cavity. If the velum is lowered and allows for air to flow through the nose, the result in a nasal stop. However, phoneticians almost always refer to nasal stops as just "nasals".[74] Affricates are a sequence of stops followed by a fricative in the same place.[75]

Fricatives are consonants where the airstream is made turbulent by partially, but not completely, obstructing part of the vocal tract.[74] Sibilants are a special type of fricative where the turbulent airstream is directed towards the teeth,[76] creating a high-pitched hissing sound.[77]

Nasals (sometimes referred to as nasal stops) are consonants in which there's a closure in the oral cavity and the velum is lowered, allowing air to flow through the nose.[78]

In an approximant, the articulators come close together, but not to such an extent that allows a turbulent airstream.[77]

Laterals are consonants in which the airstream is obstructed along the center of the vocal tract, allowing the airstream to flow freely on one or both sides.[77] Laterals have also been defined as consonants in which the tongue is contracted in such a way that the airstream is greater around the sides than over the center of the tongue.[79] The first definition does not allow for air to flow over the tongue.

Trills are consonants in which the tongue or lips are set in motion by the airstream.[80] The stricture is formed in such a way that the airstream causes a repeating pattern of opening and closing of the soft articulator(s).[81] Apical trills typically consist of two or three periods of vibration.[82]

Taps and flaps are single, rapid, usually apical gestures where the tongue is thrown against the roof of the mouth, comparable to a very rapid stop.[80] These terms are sometimes used interchangeably, but some phoneticians make a distinction.[83] In a tap, the tongue contacts the roof in a single motion whereas in a flap the tongue moves tangentially to the roof of the mouth, striking it in passing.

During a glottalic airstream mechanism, the glottis is closed, trapping a body of air. This allows for the remaining air in the vocal tract to be moved separately. An upward movement of the closed glottis will move this air out, resulting in it an ejective consonant. Alternatively, the glottis can lower, sucking more air into the mouth, which results in an implosive consonant.[84]

Clicks are stops in which tongue movement causes air to be sucked in the mouth, this is referred to as a velaric airstream.[85] During the click, the air becomes rarefied between two articulatory closures, producing a loud 'click' sound when the anterior closure is released. The release of the anterior closure is referred to as the click influx. The release of the posterior closure, which can be velar or uvular, is the click efflux. Clicks are used in several African language families, such as the Khoisan and Bantu languages.[86]

Pulmonary and subglottal system

The lungs drive nearly all speech production, and their importance in phonetics is due to their creation of pressure for pulmonic sounds. The most common kinds of sound across languages are pulmonic egress, where air is exhaled from the lungs.[87] The opposite is possible, though no language is known to have pulmonic ingressive sounds as phonemes.[88] Many languages such as Swedish use them for paralinguistic articulations such as affirmations in a number of genetically and geographically diverse languages.[89] Both egressive and ingressive sounds rely on holding the vocal folds in a particular posture and using the lungs to draw air across the vocal folds so that they either vibrate (voiced) or do not vibrate (voiceless).[87] Pulmonic articulations are restricted by the volume of air able to be exhaled in a given respiratory cycle, known as the vital capacity.

The lungs are used to maintain two kinds of pressure simultaneously in order to produce and modify phonation. To produce phonation at all, the lungs must maintain a pressure of 3–5 cm H2O higher than the pressure above the glottis. However small and fast adjustments are made to the subglottal pressure to modify speech for suprasegmental features like stress. A number of thoracic muscles are used to make these adjustments. Because the lungs and thorax stretch during inhalation, the elastic forces of the lungs alone can produce pressure differentials sufficient for phonation at lung volumes above 50 percent of vital capacity.[90] Above 50 percent of vital capacity, the respiratory muscles are used to "check" the elastic forces of the thorax to maintain a stable pressure differential. Below that volume, they are used to increase the subglottal pressure by actively exhaling air.

During speech, the respiratory cycle is modified to accommodate both linguistic and biological needs. Exhalation, usually about 60 percent of the respiratory cycle at rest, is increased to about 90 percent of the respiratory cycle. Because metabolic needs are relatively stable, the total volume of air moved in most cases of speech remains about the same as quiet tidal breathing.[91] Increases in speech intensity of 18 dB (a loud conversation) has relatively little impact on the volume of air moved. Because their respiratory systems are not as developed as adults, children tend to use a larger proportion of their vital capacity compared to adults, with more deep inhales.[92]

Source–filter theory

The source–filter model of speech is a theory of speech production which explains the link between vocal tract posture and the acoustic consequences. Under this model, the vocal tract can be modeled as a noise source coupled onto an acoustic filter.[93] The noise source in many cases is the larynx during the process of voicing, though other noise sources can be modeled in the same way. The shape of the supraglottal vocal tract acts as the filter, and different configurations of the articulators result in different acoustic patterns. These changes are predictable. The vocal tract can be modeled as a sequence of tubes, closed at one end, with varying diameters, and by using equations for acoustic resonance the acoustic effect of an articulatory posture can be derived.[94] The process of inverse filtering uses this principle to analyze the source spectrum produced by the vocal folds during voicing. By taking the inverse of a predicted filter, the acoustic effect of the supraglottal vocal tract can be undone giving the acoustic spectrum produced by the vocal folds.[95] This allows quantitative study of the various phonation types.

Perception

Language perception is the process by which a linguistic signal is decoded and understood by a listener.[i] In order to perceive speech the continuous acoustic signal must be converted into discrete linguistic units such as phonemes, morphemes, and words.[96] In order to correctly identify and categorize sounds, listeners prioritize certain aspects of the signal that can reliably distinguish between linguistic categories.[97] While certain cues are prioritized over others, many aspects of the signal can contribute to perception. For example, though oral languages prioritize acoustic information, the McGurk effect shows that visual information is used to distinguish ambiguous information when the acoustic cues are unreliable.[98]

While listeners can use a variety of information to segment the speech signal, the relationship between acoustic signal and category perception is not a perfect mapping. Because of coarticulation, noisy environments, and individual differences, there is a high degree of acoustic variability within categories.[99] Known as the problem of perceptual invariance, listeners are able to reliably perceive categories despite the variability in acoustic instantiation.[100] In order to do this, listeners rapidly accommodate to new speakers and will shift their boundaries between categories to match the acoustic distinctions their conversational partner is making.[101]

Audition

How sounds make their way from the source to the brain

Audition, the process of hearing sounds, is the first stage of perceiving speech. Articulators cause systematic changes in air pressure which travel as sound waves to the listener's ear. The sound waves then hit the listener's ear drum causing it to vibrate. The vibration of the ear drum is transmitted by the ossicles—three small bones of the middle ear—to the cochlea.[102] The cochlea is a spiral-shaped, fluid-filled tube divided lengthwise by the organ of Corti which contains the basilar membrane. The basilar membrane increases in thickness as it travels through the cochlea causing different frequencies to resonate at different locations. This tonotopic design allows for the ear to analyze sound in a manner similar to a Fourier transform.[103]

The differential vibration of the basilar causes the hair cells within the organ of Corti to move. This causes depolarization of the hair cells and ultimately a conversion of the acoustic signal into a neuronal signal.[104] While the hair cells do not produce action potentials themselves, they release neurotransmitter at synapses with the fibers of the auditory nerve, which does produce action potentials. In this way, the patterns of oscillations on the basilar membrane are converted to spatiotemporal patterns of firings which transmit information about the sound to the brainstem.[105]

Prosody

Besides consonants and vowels, phonetics also describes the properties of speech that are not localized to segments but greater units of speech, such as syllables and phrases. Prosody includes auditory characteristics such as pitch, speech rate, duration, and loudness. Languages use these properties to different degrees to implement stress, pitch accents, and intonation — for example, stress in English and Spanish is correlated with changes in pitch and duration, whereas stress in Welsh is more consistently correlated with pitch than duration and stress in Thai is only correlated with duration.[106]

Theories of speech perception

Early theories of speech perception such as motor theory attempted to solve the problem of perceptual invariance by arguing that speech perception and production were closely linked. In its strongest form, motor theory argues that speech perception requires the listener to access the articulatory representation of sounds;[107] in order to properly categorize a sound, a listener reverse engineers the articulation which would produce that sound and by identifying these gestures is able to retrieve the intended linguistic category.[108] While findings such as the McGurk effect and case studies from patients with neurological injuries have provided support for motor theory, further experiments have not supported the strong form of motor theory, though there is some support for weaker forms of motor theory which claim a non-deterministic relationship between production and perception.[108][109][110]

Successor theories of speech perception place the focus on acoustic cues to sound categories and can be grouped into two broad categories: abstractionist theories and episodic theories.[111] In abstractionist theories, speech perception involves the identification of an idealized lexical object based on a signal reduced to its necessary components and normalizing the signal to counteract speaker variability. Episodic theories such as the exemplar model argue that speech perception involves accessing detailed memories (i.e., episodic memories) of previously heard tokens. The problem of perceptual invariance is explained by episodic theories as an issue of familiarity: normalization is a byproduct of exposure to more variable distributions rather than a discrete process as abstractionist theories claim.[111]

Subdisciplines

Acoustic phonetics

Acoustic phonetics deals with the acoustic properties of speech sounds. The sensation of sound is caused by pressure fluctuations which cause the eardrum to move. The ear transforms this movement into neural signals that the brain registers as sound. Acoustic waveforms are records that measure these pressure fluctuations.[112]

Articulatory phonetics

Articulatory phonetics deals with the ways in which speech sounds are made.

Auditory phonetics

Auditory phonetics studies how humans perceive speech sounds. Due to the anatomical features of the auditory system distorting the speech signal, humans do not experience speech sounds as perfect acoustic records. For example, the auditory impressions of volume, measured in decibels (dB), does not linearly match the difference in sound pressure.[113]

The mismatch between acoustic analyses and what the listener hears is especially noticeable in speech sounds that have a lot of high-frequency energy, such as certain fricatives. To reconcile this mismatch, functional models of the auditory system have been developed.[114]

Describing sounds

Human languages use many different sounds and in order to compare them linguists must be able to describe sounds in a way that is language independent. Speech sounds can be described in a number of ways. Most commonly speech sounds are referred to by the mouth movements needed to produce them. Consonants and vowels are two gross categories that phoneticians define by the movements in a speech sound. More fine-grained descriptors are parameters such as place of articulation. Place of articulation, manner of articulation, and voicing are used to describe consonants and are the main divisions of the International Phonetic Alphabet consonant chart. Vowels are described by their height, backness, and rounding. Sign language are described using a similar but distinct set of parameters to describe signs: location, movement, hand shape, palm orientation, and non-manual features. In addition to articulatory descriptions, sounds used in oral languages can be described using their acoustics. Because the acoustics are a consequence of the articulation, both methods of description are sufficient to distinguish sounds with the choice between systems dependent on the phonetic feature being investigated.

Consonants are speech sounds that are articulated with a complete or partial closure of the vocal tract. They are generally produced by the modification of an airstream exhaled from the lungs. The respiratory organs used to create and modify airflow are divided into three regions: the vocal tract (supralaryngeal), the larynx, and the subglottal system. The airstream can be either egressive (out of the vocal tract) or ingressive (into the vocal tract). In pulmonic sounds, the airstream is produced by the lungs in the subglottal system and passes through the larynx and vocal tract. Glottalic sounds use an airstream created by movements of the larynx without airflow from the lungs. Click consonants are articulated through the rarefaction of air using the tongue, followed by releasing the forward closure of the tongue.

Vowels are syllabic speech sounds that are pronounced without any obstruction in the vocal tract.[115] Unlike consonants, which usually have definite places of articulation, vowels are defined in relation to a set of reference vowels called cardinal vowels. Three properties are needed to define vowels: tongue height, tongue backness and lip roundedness. Vowels that are articulated with a stable quality are called monophthongs; a combination of two separate vowels in the same syllable is a diphthong.[116] In the IPA, the vowels are represented on a trapezoid shape representing the human mouth: the vertical axis representing the mouth from floor to roof and the horizontal axis represents the front-back dimension.[117]

Transcription

Phonetic transcription is a system for transcribing phones that occur in a language, whether oral or sign. The most widely known system of phonetic transcription, the International Phonetic Alphabet (IPA), provides a standardized set of symbols for oral phones.[118][119] The standardized nature of the IPA enables its users to transcribe accurately and consistently the phones of different languages, dialects, and idiolects.[118][120][121] The IPA is a useful tool not only for the study of phonetics but also for language teaching, professional acting, and speech pathology.[120]

While no sign language has a standardized writing system, linguists have developed their own notation systems that describe the handshape, location and movement. The Hamburg Notation System (HamNoSys) is similar to the IPA in that it allows for varying levels of detail. Some notation systems such as KOMVA and the Stokoe system were designed for use in dictionaries; they also make use of alphabetic letters in the local language for handshapes whereas HamNoSys represents the handshape directly. SignWriting aims to be an easy-to-learn writing system for sign languages, although it has not been officially adopted by any deaf community yet.[122]

Sign languages

Unlike spoken languages, words in sign languages are perceived with the eyes instead of the ears. Signs are articulated with the hands, upper body and head. The main articulators are the hands and arms. Relative parts of the arm are described with the terms proximal and distal. Proximal refers to a part closer to the torso whereas a distal part is further away from it. For example, a wrist movement is distal compared to an elbow movement. Due to requiring less energy, distal movements are generally easier to produce. Various factors – such as muscle flexibility or being considered taboo – restrict what can be considered a sign.[123] Native signers do not look at their conversation partner's hands. Instead, their gaze is fixated on the face. Because peripheral vision is not as focused as the center of the visual field, signs articulated near the face allow for more subtle differences in finger movement and location to be perceived.[124]

Unlike spoken languages, sign languages have two identical articulators: the hands. Signers may use whichever hand they prefer with no disruption in communication. Due to universal neurological limitations, two-handed signs generally have the same kind of articulation in both hands; this is referred to as the Symmetry Condition.[123] The second universal constraint is the Dominance Condition, which holds that when two handshapes are involved, one hand will remain stationary and have a more limited set handshapes compared to the dominant, moving hand.[125] Additionally, it is common for one hand in a two-handed sign to be dropped during informal conversations, a process referred to as weak drop.[123] Just like words in spoken languages, coarticulation may cause signs to influence each other's form. Examples include the handshapes of neighboring signs becoming more similar to each other (assimilation) or weak drop (an instance of deletion).[126]

See also

References

Notes

  1. ^ Linguists debate whether these stages can interact or whether they occur serially (compare Dell & Reich (1981) and Motley, Camden & Baars (1982)). For ease of description, the language production process is described as a series of independent stages, though recent evidence shows this is inaccurate.[11] For further descriptions of interactive activation models see Jaeger, Furth & Hilliard (2012).
  2. ^ or after part of an utterance has been planned; see Gleitman et al. (2007) for evidence of production before a message has been completely planned
  3. ^ adapted from Sedivy (2019, p. 411) and Boersma (1998, p. 11)
  4. ^ See Feldman (1966) for the original proposal.
  5. ^ See #The larynx for further information on the anatomy of phonation.
  6. ^ Hawaiian, for example, does not contrast voiced and voiceless plosives.
  7. ^ There are languages, like Japanese, where vowels are produced as voiceless in certain contexts.
  8. ^ See #Articulatory models for further information on acoustic modeling.
  9. ^ As with speech production, the nature of the linguistic signal varies depending on the language modality. The signal can be acoustic for oral speech, visual for signed languages, or tactile for manual-tactile sign languages. For simplicity acoustic speech is described here; for sign language perception specifically, see Sign language#Sign perception.

Citations

  1. ^ O'Grady 2005, p. 15.
  2. ^ a b c Caffrey 2017.
  3. ^ Kiparsky 1993, p. 2918.
  4. ^ Kiparsky 1993, pp. 2922–3.
  5. ^ Deussen, Paul (1980). Sixty Upanishads of the Veda, Volume I. Motilal Banarasidass. p. 222. ISBN 978-8120814684.
  6. ^ Oxford English Dictionary 2018.
  7. ^ a b Roach 2015.
  8. ^ Ladefoged 1960, p. 388.
  9. ^ Ladefoged 1960.
  10. ^ Dell & O'Seaghdha 1992.
  11. ^ Sedivy 2019, p. 439.
  12. ^ Boersma 1998.
  13. ^ a b Ladefoged 2001, p. 5.
  14. ^ Ladefoged & Maddieson 1996, p. 9.
  15. ^ Ladefoged & Maddieson 1996, p. 16.
  16. ^ Maddieson 1993.
  17. ^ Fujimura 1961.
  18. ^ Ladefoged & Maddieson 1996, pp. 16–17.
  19. ^ a b c International Phonetic Association 2015.
  20. ^ Ladefoged & Maddieson 1996, p. 18.
  21. ^ Ladefoged & Maddieson 1996, pp. 17–18.
  22. ^ a b Ladefoged & Maddieson 1996, p. 17.
  23. ^ Doke 1926.
  24. ^ Guthrie 1948, p. 61.
  25. ^ Ladefoged & Maddieson 1996, pp. 19–31.
  26. ^ a b Ladefoged & Maddieson 1996, p. 28.
  27. ^ Ladefoged & Maddieson 1996, pp. 19–25.
  28. ^ Ladefoged & Maddieson 1996, pp. 20, 40–1.
  29. ^ Scatton 1984, p. 60.
  30. ^ Ladefoged & Maddieson 1996, p. 23.
  31. ^ Ladefoged & Maddieson 1996, pp. 23–5.
  32. ^ Ladefoged & Maddieson 1996, pp. 25, 27–8.
  33. ^ a b Ladefoged & Maddieson 1996, p. 27.
  34. ^ Ladefoged & Maddieson 1996, pp. 27–8.
  35. ^ Ladefoged & Maddieson 1996, p. 32.
  36. ^ Ladefoged & Maddieson 1996, p. 35.
  37. ^ Ladefoged & Maddieson 1996, pp. 33–34.
  38. ^ Keating & Lahiri 1993, p. 89.
  39. ^ Maddieson 2013.
  40. ^ Ladefoged & Maddieson 1996, p. 11.
  41. ^ Lodge 2009, p. 33.
  42. ^ a b Ladefoged & Maddieson 1996, p. 37.
  43. ^ a b Ladefoged & Maddieson 1996, p. 38.
  44. ^ Ladefoged & Maddieson 1996, p. 74.
  45. ^ Ladefoged & Maddieson 1996, p. 75.
  46. ^ Ladefoged 2001, p. 123.
  47. ^ Seikel, Drumright & King 2016, p. 222.
  48. ^ a b Ohala 1997, p. 1.
  49. ^ Chomsky & Halle 1968, pp. 300–301.
  50. ^ Altmann 2002.
  51. ^ a b Löfqvist 2010, p. 359.
  52. ^ Munhall, Ostry & Flanagan 1991, p. 299, et seq.
  53. ^ Löfqvist 2010, p. 360.
  54. ^ Bizzi et al. 1992.
  55. ^ Löfqvist 2010, p. 361.
  56. ^ Saltzman & Munhall 1989.
  57. ^ Mattingly 1990.
  58. ^ Löfqvist 2010, pp. 362–4.
  59. ^ Löfqvist 2010, p. 364.
  60. ^ a b Gordon & Ladefoged 2001.
  61. ^ Gobl & Ní Chasaide 2010, p. 399.
  62. ^ Gobl & Ní Chasaide 2010, p. 400-401.
  63. ^ Gobl & Ní Chasaide 2010, p. 401.
  64. ^ a b Dawson & Phelan 2016.
  65. ^ Gobl & Ní Chasaide 2010, pp. 388, et seq.
  66. ^ Ladefoged & Maddieson 1996, p. 282.
  67. ^ Lodge 2009, p. 39.
  68. ^ Chomsky & Halle 1968.
  69. ^ Ladefoged & Maddieson 1996, p. 289.
  70. ^ Ladefoged & Maddieson 1996, p. 290.
  71. ^ Ladefoged & Maddieson 1996, p. 292-295.
  72. ^ Lodge 2009, p. 40.
  73. ^ Ladefoged & Maddieson 1996, p. 298.
  74. ^ a b c Ladefoged & Johnson 2011, p. 14.
  75. ^ Ladefoged & Johnson 2011, p. 67.
  76. ^ Ladefoged & Maddieson 1996, p. 145.
  77. ^ a b c Ladefoged & Johnson 2011, p. 15.
  78. ^ Ladefoged & Maddieson 1996, p. 102.
  79. ^ Ladefoged & Maddieson 1996, p. 182.
  80. ^ a b Ladefoged & Johnson 2011, p. 175.
  81. ^ Ladefoged & Maddieson 1996, p. 217.
  82. ^ Ladefoged & Maddieson 1996, p. 218.
  83. ^ Ladefoged & Maddieson 1996, p. 230-231.
  84. ^ Ladefoged & Johnson 2011, p. 137.
  85. ^ Ladefoged & Maddieson 1996, p. 78.
  86. ^ Ladefoged & Maddieson 1996, p. 246-247.
  87. ^ a b Ladefoged 2001, p. 1.
  88. ^ Eklund 2008, p. 237.
  89. ^ Eklund 2008.
  90. ^ Seikel, Drumright & King 2016, p. 176.
  91. ^ Seikel, Drumright & King 2016, p. 171.
  92. ^ Seikel, Drumright & King 2016, pp. 168–77.
  93. ^ Johnson 2008, p. 83–5.
  94. ^ Johnson 2008, p. 104–5.
  95. ^ Johnson 2008, p. 157.
  96. ^ Sedivy 2019, p. 259–60.
  97. ^ Sedivy 2019, p. 269.
  98. ^ Sedivy 2019, p. 273.
  99. ^ Sedivy 2019, p. 259.
  100. ^ Sedivy 2019, p. 260.
  101. ^ Sedivy 2019, p. 274–85.
  102. ^ Johnson 2003, p. 46–7.
  103. ^ Johnson 2003, p. 47.
  104. ^ Schacter, Gilbert & Wegner 2011, p. 158–9.
  105. ^ Yost 2003, p. 130.
  106. ^ Cutler 2005.
  107. ^ Sedivy 2019, p. 289.
  108. ^ a b Galantucci, Fowler & Turvey 2006.
  109. ^ Sedivy 2019, p. 292–3.
  110. ^ Skipper, Devlin & Lametti 2017.
  111. ^ a b Goldinger 1996.
  112. ^ Johnson 2003, p. 1.
  113. ^ Johnson 2003, p. 46-49.
  114. ^ Johnson 2003, p. 53.
  115. ^ Ladefoged & Maddieson 1996, p. 281.
  116. ^ Gussenhoven & Jacobs 2017, p. 26-27.
  117. ^ Lodge 2009, p. 38.
  118. ^ a b O'Grady 2005, p. 17.
  119. ^ International Phonetic Association 1999.
  120. ^ a b Ladefoged 2005.
  121. ^ Ladefoged & Maddieson 1996.
  122. ^ Baker et al. 2016, p. 242-244.
  123. ^ a b c Baker et al. 2016, p. 229-235.
  124. ^ Baker et al. 2016, p. 236.
  125. ^ Baker et al. 2016, p. 286.
  126. ^ Baker et al. 2016, p. 239.

Works cited

  • Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh: Chicago, Aldine Pub. Co.
  • Altmann, Gerry (2002). Psycholinguistics : critical concepts in psychology. London: Routledge. ISBN 978-0415229906. OCLC 48014482.
  • Baker, Anne; van den Bogaerde, Beppie; Pfau, Roland; Schermer, Trude (2016). The Linguistics of Sign Languages. Amsterdam/Philadelphia: John Benjamins Publishing Company. ISBN 978-90-272-1230-6.
  • Baumbach, E. J. M (1987). Analytical Tsonga Grammar. Pretoria: University of South Africa.
  • Bizzi, E.; Hogan, N.; Mussa-Ivaldi, F.; Giszter, S. (1992). "Does the nervous system use equilibrium-point control to guide single and multiple joint movements?". Behavioral and Brain Sciences. 15 (4): 603–13. doi:10.1017/S0140525X00072538. PMID 23302290.
  • Bock, Kathryn; Levelt, Willem (2002). Atlmann, Gerry (ed.). Psycholinguistics: Critical Concepts in Psychology. Vol. 5. New York: Routledge. pp. 405–407. ISBN 978-0-415-26701-4.
  • Boersma, Paul (1998). Functional phonology: Formalizing the interactions between articulatory and perceptual drives. The Hague: Holland Academic Graphics. ISBN 9055690546. OCLC 40563066.
  • Caffrey, Cait (2017). "Phonetics". Salem Press Encyclopedia. Salem Press.
  • Catford, J. C. (2001). A Practical Introduction to Phonetics (2nd ed.). Oxford University Press. ISBN 978-0-19-924635-9.
  • Chomsky, Noam; Halle, Morris (1968). Sound Pattern of English. Harper and Row.
  • Cutler, Anne (2005). "Lexical Stress" (PDF). In Pisoni, David B.; Remez, Robert (eds.). The Handbook of Speech Perception. Blackwell. pp. 264–289. doi:10.1002/9780470757024.ch11. ISBN 978-0-631-22927-8. OCLC 749782145. Retrieved 2019-12-29.
  • Dawson, Hope; Phelan, Michael, eds. (2016). Language Files: Materials for an Introduction to Linguistics (12th ed.). The Ohio State University Press. ISBN 978-0-8142-5270-3.
  • Dell, Gary; O'Seaghdha, Padraig (1992). "Stages of lexical access in language production". Cognition. 42 (1–3): 287–314. doi:10.1016/0010-0277(92)90046-k. PMID 1582160. S2CID 37962027.
  • Dell, Gary; Reich, Peter (1981). "Stages in sentence production: An analysis of speech error data". Journal of Memory and Language. 20 (6): 611–629. doi:10.1016/S0022-5371(81)90202-4.
  • Doke, Clement M (1926). The Phonetics of the Zulu Language. Bantu Studies. Johannesburg: Wiwatersrand University Press.
  • Eklund, Robert (2008). "Pulmonic ingressive phonation: Diachronic and synchronic characteristics, distribution and function in animal and human sound production and in human speech". Journal of the International Phonetic Association. 38 (3): 235–324. doi:10.1017/S0025100308003563. S2CID 146616135.
  • Feldman, Anatol G. (1966). "Functional tuning of the nervous system with control of movement or maintenance of a steady posture, III: Mechanographic analysis of the execution by man of the simplest motor task". Biophysics. 11: 565–578.
  • Fujimura, Osamu (1961). "Bilabial stop and nasal consonants: A motion picture study and its acoustical implications". Journal of Speech and Hearing Research. 4 (3): 233–47. doi:10.1044/jshr.0403.233. PMID 13702471.
  • Galantucci, Bruno; Fowler, Carol; Turvey, Michael (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/BF03193857. PMC 2746041. PMID 17048719.
  • Gleitman, Lila; January, David; Nappa, Rebecca; Trueswell, John (2007). "On the give and take between event apprehension and utterance formulation". Journal of Memory and Language. 57 (4): 544–569. doi:10.1016/j.jml.2007.01.007. PMC 2151743. PMID 18978929.
  • Gobl, Christer; Ní Chasaide, Ailbhe (2010). "Voice source variation and its communicative functions". The Handbook of Phonetic Sciences (2nd ed.). pp. 378–424.
  • Goldinger, Stephen (1996). "Words and voices: episodic traces in spoken word identification and recognition memory". Journal of Experimental Psychology: Learning, Memory, and Cognition. 22 (5): 1166–83. doi:10.1037/0278-7393.22.5.1166. PMID 8926483.
  • Gordon, Matthew; Ladefoged, Peter (2001). "Phonation types: a cross-linguistic overview". Journal of Phonetics. 29 (4): 383–406. doi:10.1006/jpho.2001.0147.
  • Guthrie, Malcolm (1948). The classification of the Bantu languages. London: Oxford University Press.
  • Gussenhoven, Carlos; Jacobs, Haike (2017). Understanding phonology (Fourth ed.). London and New York: Routledge. ISBN 9781138961418. OCLC 958066102.
  • Hall, Tracy Alan (2001). "Introduction: Phonological representations and phonetic implementation of distinctive features". In Hall, Tracy Alan (ed.). Distinctive Feature Theory. de Gruyter. pp. 1–40.
  • Halle, Morris (1983). "On Distinctive Features and their articulatory implementation". Natural Language and Linguistic Theory. 1 (1): 91–105. doi:10.1007/BF00210377. S2CID 170466631.
  • Hardcastle, William; Laver, John; Gibbon, Fiona, eds. (2010). The Handbook of Phonetic Sciences (2nd ed.). Wiley-Blackwell. ISBN 978-1-405-14590-9.
  • International Phonetic Association (1999). Handbook of the International Phonetic Association. Cambridge University Press.
  • International Phonetic Association (2015). International Phonetic Alphabet. International Phonetic Association.
  • Jaeger, Florian; Furth, Katrina; Hilliard, Caitlin (2012). "Phonological overlap affects lexical selection during sentence production". Journal of Experimental Psychology: Learning, Memory, and Cognition. 38 (5): 1439–1449. doi:10.1037/a0027862. PMID 22468803.
  • Jakobson, Roman; Fant, Gunnar; Halle, Morris (1976). Preliminaries to Speech Analysis: The Distinctive Features and their Correlates. MIT Press. ISBN 978-0-262-60001-9.
  • Johnson, Keith (2003). Acoustic and auditory phonetics (2nd ed.). Blackwell Pub. ISBN 1405101229. OCLC 50198698.
  • Johnson, Keith (2011). Acoustic and Auditory Phonetics (3rd ed.). Wiley-Blackwell. ISBN 978-1-444-34308-3.
  • Jones, Daniel (1948). "The London school of phonetics". Zeitschrift für Phonetik. 11 (3/4): 127–135. (Reprinted in Jones, W. E.; Laver, J., eds. (1973). Phonetics in Linguistics. Longman. pp. 180–186.)
  • Keating, Patricia; Lahiri, Aditi (1993). "Fronted Velars, Palatalized Velars, and Palatals". Phonetica. 50 (2): 73–101. doi:10.1159/000261928. PMID 8316582. S2CID 3272781.
  • Kingston, John (2007). "The Phonetics-Phonology Interface". In DeLacy, Paul (ed.). The Cambridge Handbook of Phonology. Cambridge University Press. ISBN 978-0-521-84879-4.
  • Kiparsky, Paul (1993). "Pāṇinian linguistics". In Asher, R.E. (ed.). Encyclopedia of Languages and Linguistics. Oxford: Pergamon.
  • Ladefoged, Peter (1960). "The Value of Phonetic Statements". Language. 36 (3): 387–96. doi:10.2307/410966. JSTOR 410966.
  • Ladefoged, Peter (2001). A Course in Phonetics (4th ed.). Boston: Thomson/Wadsworth. ISBN 978-1-413-00688-9.
  • Ladefoged, Peter (2005). A Course in Phonetics (5th ed.). Boston: Thomson/Wadsworth. ISBN 978-1-413-00688-9.
  • Ladefoged, Peter; Johnson, Keith (2011). A Course in Phonetics (6th ed.). Wadsworth. ISBN 978-1-42823126-9.
  • Ladefoged, Peter; Maddieson, Ian (1996). The Sounds of the World's Languages. Oxford: Blackwell. ISBN 978-0-631-19815-4.
  • Levelt, Willem (1999). "A theory of lexical access in speech production". Behavioral and Brain Sciences. 22 (1): 3–6. doi:10.1017/s0140525x99001776. hdl:11858/00-001M-0000-0013-3E7A-A. PMID 11301520. S2CID 152230066.
  • Lodge, Ken (2009). A Critical Introduction to Phonetics. New York: Continuum International Publishing Group. ISBN 978-0-8264-8873-2.
  • Löfqvist, Anders (2010). "Theories and Models of Speech Production". Handbook of Phonetic Sciences (2nd ed.). pp. 353–78.
  • Maddieson, Ian (1993). "Investigating Ewe articulations with electromagnetic articulography". Forschungberichte des Intituts für Phonetik und Sprachliche Kommunikation der Universität München. 31: 181–214.
  • Maddieson, Ian (2013). "Uvular Consonants". In Dryer, Matthew S.; Haspelmath, Martin (eds.). The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.
  • Mattingly, Ignatius (1990). "The global character of phonetic gestures" (PDF). Journal of Phonetics. 18 (3): 445–52. doi:10.1016/S0095-4470(19)30372-9.
  • Motley, Michael; Camden, Carl; Baars, Bernard (1982). "Covert formulation and editing of anomalies in speech production: Evidence from experimentally elicited slips of the tongue". Journal of Verbal Learning and Verbal Behavior. 21 (5): 578–594. doi:10.1016/S0022-5371(82)90791-5.
  • Munhall, K.; Ostry, D; Flanagan, J. (1991). "Coordinate spaces in speech planning". Journal of Phonetics. 19 (3–4): 293–307. doi:10.1016/S0095-4470(19)30346-8.
  • O'Connor, J.D. (1973). Phonetics. Pelican. pp. 16–17. ISBN 978-0140215601.
  • O'Grady, William (2005). Contemporary Linguistics: An Introduction (5th ed.). Bedford/St. Martin's. ISBN 978-0-312-41936-3.
  • Ohala, John (1997). "Aerodynamics of phonology". Proceedings of the Seoul International Conference on Linguistics. 92.
  • "Phonetics, n.". Oxford English Dictionary Online. Oxford University Press. 2018.
  • Roach, Peter (2015). "Practical Phonetic Training". Peter Roach. Retrieved 10 May 2019.
  • Saltzman, Elliot; Munhall, Kevin (1989). "Dynamical Approach to Gestural Patterning in Speech Production" (PDF). Ecological Psychology. 1 (4): 333–82. doi:10.1207/s15326969eco0104_2.
  • Scatton, Ernest (1984). A reference grammar of modern Bulgarian. Slavica. ISBN 978-0893571238.
  • Schacter, Daniel; Gilbert, Daniel; Wegner, Daniel (2011). "Sensation and Perception". In Charles Linsmeiser (ed.). Psychology. Worth Publishers. ISBN 978-1-4292-3719-2.
  • Schiller, Niels; Bles, Mart; Jansma, Bernadette (2003). "Tracking the time course of phonological encoding in speech production: an event-related brain potential study". Cognitive Brain Research. 17 (3): 819–831. doi:10.1016/s0926-6410(03)00204-0. hdl:11858/00-001M-0000-0013-17B1-A. PMID 14561465.
  • Sedivy, Julie (2019). Language in Mind: An Introduction to Psycholinguistics (2nd ed.). ISBN 978-1605357058.
  • Seikel, J. Anthony; Drumright, David; King, Douglas (2016). Anatomy and Physiology for Speech, Language, and Hearing (5th ed.). Cengage. ISBN 978-1-285-19824-8.
  • Skipper, Jeremy; Devlin, Joseph; Lametti, Daniel (2017). "The hearing ear is always found close to the speaking tongue: Review of the role of the motor system in speech perception". Brain and Language. 164: 77–105. doi:10.1016/j.bandl.2016.10.004. PMID 27821280.
  • Stearns, Peter; Adas, Michael; Schwartz, Stuart; Gilbert, Marc Jason (2001). World Civilizations (3rd ed.). New York: Longman. ISBN 978-0-321-04479-2.
  • Trask, R.L. (1996). A Dictionary of Phonetics and Phonology. Abingdon: Routledge. ISBN 978-0-415-11261-1.
  • Yost, William (2003). "Audition". In Alice F. Healy; Robert W. Proctor (eds.). Handbook of Psychology: Experimental psychology. John Wiley and Sons. p. 130. ISBN 978-0-471-39262-0.

External links

  •   Media related to Phonetics at Wikimedia Commons
  • Collection of phonetics resources by the University of North Carolina
  • "A Little Encyclopedia of Phonetics" by Peter Roach.
  • Pink Trombone, an interactive articulation simulator by Neil Thapen.

phonetics, study, phonemes, phonology, method, teaching, reading, writing, phonics, other, uses, disambiguation, branch, linguistics, that, studies, humans, produce, perceive, sounds, case, sign, languages, equivalent, aspects, sign, linguists, specialize, stu. For the study of phonemes see Phonology For the method of teaching reading and writing see Phonics For other uses see Phonetics disambiguation Phonetics is a branch of linguistics that studies how humans produce and perceive sounds or in the case of sign languages the equivalent aspects of sign 1 Linguists who specialize in studying the physical properties of speech are phoneticians The field of phonetics is traditionally divided into three sub disciplines based on the research questions involved such as how humans plan and execute movements to produce speech articulatory phonetics how various movements affect the properties of the resulting sound acoustic phonetics or how humans convert sound waves to linguistic information auditory phonetics Traditionally the minimal linguistic unit of phonetics is the phone a speech sound in a language which differs from the phonological unit of phoneme the phoneme is an abstract categorization of phones Phonetics deals with two aspects of human speech production the ways humans make sounds and perception the way speech is understood The communicative modality of a language describes the method by which a language produces and perceives languages Languages with oral aural modalities such as English produce speech orally using the mouth and perceive speech aurally using the ears Sign languages such as Australian Sign Language Auslan and American Sign Language ASL have a manual visual modality producing speech manually using the hands and perceiving speech visually using the eyes ASL and some other sign languages have in addition a manual manual dialect for use in tactile signing by deafblind speakers where signs are produced with the hands and perceived with the hands as well Language production consists of several interdependent processes which transform a non linguistic message into a spoken or signed linguistic signal After identifying a message to be linguistically encoded a speaker must select the individual words known as lexical items to represent that message in a process called lexical selection During phonological encoding the mental representation of the words are assigned their phonological content as a sequence of phonemes to be produced The phonemes are specified for articulatory features which denote particular goals such as closed lips or the tongue in a particular location These phonemes are then coordinated into a sequence of muscle commands that can be sent to the muscles and when these commands are executed properly the intended sounds are produced These movements disrupt and modify an airstream which results in a sound wave The modification is done by the articulators with different places and manners of articulation producing different acoustic results For example the words tack and sack both begin with alveolar sounds in English but differ in how far the tongue is from the alveolar ridge This difference has large effects on the air stream and thus the sound that is produced Similarly the direction and source of the airstream can affect the sound The most common airstream mechanism is pulmonic using the lungs but the glottis and tongue can also be used to produce airstreams Language perception is the process by which a linguistic signal is decoded and understood by a listener In order to perceive speech the continuous acoustic signal must be converted into discrete linguistic units such as phonemes morphemes and words In order to correctly identify and categorize sounds listeners prioritize certain aspects of the signal that can reliably distinguish between linguistic categories While certain cues are prioritized over others many aspects of the signal can contribute to perception For example though oral languages prioritize acoustic information the McGurk effect shows that visual information is used to distinguish ambiguous information when the acoustic cues are unreliable Modern phonetics has three branches Articulatory phonetics which addresses the way sounds are made with the articulators Acoustic phonetics which addresses the acoustic results of different articulations and Auditory phonetics which addresses the way listeners perceive and understand linguistic signals Contents 1 History 1 1 Antiquity 1 2 Modern 2 Production 2 1 Place of articulation 2 1 1 Labial 2 1 2 Coronal 2 1 3 Dorsal 2 1 4 Pharyngeal and laryngeal 2 2 The larynx 2 3 Lexical access 2 4 Articulatory models 3 Acoustics 3 1 Voicing and phonation types 3 2 Vowels 3 3 Manner of articulation 3 4 Pulmonary and subglottal system 3 5 Source filter theory 4 Perception 4 1 Audition 4 2 Prosody 4 3 Theories of speech perception 5 Subdisciplines 5 1 Acoustic phonetics 5 2 Articulatory phonetics 5 3 Auditory phonetics 6 Describing sounds 6 1 Transcription 7 Sign languages 8 See also 9 References 9 1 Notes 9 2 Citations 9 3 Works cited 10 External linksHistory EditAntiquity Edit The first known phonetic studies were carried out as early as the 6th century BCE by Sanskrit grammarians 2 The Hindu scholar Paṇini is among the most well known of these early investigators whose four part grammar written around 350 BCE is influential in modern linguistics and still represents the most complete generative grammar of any language yet written 3 His grammar formed the basis of modern linguistics and described several important phonetic principles including voicing This early account described resonance as being produced either by tone when vocal folds are closed or noise when vocal folds are open The phonetic principles in the grammar are considered primitives in that they are the basis for his theoretical analysis rather than the objects of theoretical analysis themselves and the principles can be inferred from his system of phonology 4 The Sanskrit study of phonetics is called Shiksha The Taittiriya Upanishad dated to 1 millennium BC defines Shiksha as follows Om We will explain the Shiksha Sounds and accentuation Quantity of vowels and the expression of consonants Balancing Saman and connection of sounds So much about the study of Shiksha 1 Taittiriya Upanishad 1 2 Shikshavalli Translated by Paul Deussen 5 Modern Edit Advancements in phonetics after Paṇini and his contemporaries were limited until the modern era save some limited investigations by Greek and Roman grammarians In the millennia between Indic grammarians and modern phonetics the focus shifted from the difference between spoken and written language which was the driving force behind Paṇini s account and began to focus on the physical properties of speech alone Sustained interest in phonetics began again around 1800 CE with the term phonetics being first used in the present sense in 1841 6 2 With new developments in medicine and the development of audio and visual recording devices phonetic insights were able to use and review new and more detailed data This early period of modern phonetics included the development of an influential phonetic alphabet based on articulatory positions by Alexander Melville Bell Known as visible speech it gained prominence as a tool in the oral education of deaf children 2 Before the widespread availability of audio recording equipment phoneticians relied heavily on a tradition of practical phonetics to ensure that transcriptions and findings were able to be consistent across phoneticians This training involved both ear training the recognition of speech sounds as well as production training the ability to produce sounds Phoneticians were expected to learn to recognize by ear the various sounds on the International Phonetic Alphabet and the IPA still tests and certifies speakers on their ability to accurately produce the phonetic patterns of English though they have discontinued this practice for other languages 7 As a revision of his visible speech method Melville Bell developed a description of vowels by height and backness resulting in 9 cardinal vowels 8 As part of their training in practical phonetics phoneticians were expected to learn to produce these cardinal vowels in order to anchor their perception and transcription of these phones during fieldwork 7 This approach was critiqued by Peter Ladefoged in the 1960s based on experimental evidence where he found that cardinal vowels were auditory rather than articulatory targets challenging the claim that they represented articulatory anchors by which phoneticians could judge other articulations 9 Production EditMain article Language production Language production consists of several interdependent processes which transform a nonlinguistic message into a spoken or signed linguistic signal Linguists debate whether the process of language production occurs in a series of stages serial processing or whether production processes occur in parallel After identifying a message to be linguistically encoded a speaker must select the individual words known as lexical items to represent that message in a process called lexical selection The words are selected based on their meaning which in linguistics is called semantic information Lexical selection activates the word s lemma which contains both semantic and grammatical information about the word 10 a After an utterance has been planned b it then goes through phonological encoding In this stage of language production the mental representation of the words are assigned their phonological content as a sequence of phonemes to be produced The phonemes are specified for articulatory features which denote particular goals such as closed lips or the tongue in a particular location These phonemes are then coordinated into a sequence of muscle commands that can be sent to the muscles and when these commands are executed properly the intended sounds are produced 12 Thus the process of production from message to sound can be summarized as the following sequence c Message planning Lemma selection Retrieval and assignment of phonological word forms Articulatory specification Muscle commands Articulation Speech soundsPlace of articulation Edit Main article Place of articulation Sounds which are made by a full or partial constriction of the vocal tract are called consonants Consonants are pronounced in the vocal tract usually in the mouth and the location of this constriction affects the resulting sound Because of the close connection between the position of the tongue and the resulting sound the place of articulation is an important concept in many subdisciplines of phonetics Sounds are partly categorized by the location of a constriction as well as the part of the body doing the constricting For example in English the words fought and thought are a minimal pair differing only in the organ making the construction rather than the location of the construction The f in fought is a labiodental articulation made with the bottom lip against the teeth The th in thought is a linguodental articulation made with the tongue against the teeth Constrictions made by the lips are called labials while those made with the tongue are called lingual Constrictions made with the tongue can be made in several parts of the vocal tract broadly classified into coronal dorsal and radical places of articulation Coronal articulations are made with the front of the tongue dorsal articulations are made with the back of the tongue and radical articulations are made in the pharynx 13 These divisions are not sufficient for distinguishing and describing all speech sounds 13 For example in English the sounds s and ʃ are both coronal but they are produced in different places of the mouth To account for this more detailed places of articulation are needed based upon the area of the mouth in which the constriction occurs 14 Labial Edit Articulations involving the lips can be made in three different ways with both lips bilabial with one lip and the teeth labiodental and with the tongue and the upper lip linguolabial 15 Depending on the definition used some or all of these kinds of articulations may be categorized into the class of labial articulations Bilabial consonants are made with both lips In producing these sounds the lower lip moves farthest to meet the upper lip which also moves down slightly 16 though in some cases the force from air moving through the aperture opening between the lips may cause the lips to separate faster than they can come together 17 Unlike most other articulations both articulators are made from soft tissue and so bilabial stops are more likely to be produced with incomplete closures than articulations involving hard surfaces like the teeth or palate Bilabial stops are also unusual in that an articulator in the upper section of the vocal tract actively moves downwards as the upper lip shows some active downward movement 18 Linguolabial consonants are made with the blade of the tongue approaching or contacting the upper lip Like in bilabial articulations the upper lip moves slightly towards the more active articulator Articulations in this group do not have their own symbols in the International Phonetic Alphabet rather they are formed by combining an apical symbol with a diacritic implicitly placing them in the coronal category 19 20 They exist in a number of languages indigenous to Vanuatu such as Tangoa Labiodental consonants are made by the lower lip rising to the upper teeth Labiodental consonants are most often fricatives while labiodental nasals are also typologically common 21 There is debate as to whether true labiodental plosives occur in any natural language 22 though a number of languages are reported to have labiodental plosives including Zulu 23 Tonga 24 and Shubi 22 Coronal Edit Coronal consonants are made with the tip or blade of the tongue and because of the agility of the front of the tongue represent a variety not only in place but in the posture of the tongue The coronal places of articulation represent the areas of the mouth where the tongue contacts or makes a constriction and include dental alveolar and post alveolar locations Tongue postures using the tip of the tongue can be apical if using the top of the tongue tip laminal if made with the blade of the tongue or sub apical if the tongue tip is curled back and the bottom of the tongue is used Coronals are unique as a group in that every manner of articulation is attested 19 25 Australian languages are well known for the large number of coronal contrasts exhibited within and across languages in the region 26 Dental consonants are made with the tip or blade of the tongue and the upper teeth They are divided into two groups based upon the part of the tongue used to produce them apical dental consonants are produced with the tongue tip touching the teeth interdental consonants are produced with the blade of the tongue as the tip of the tongue sticks out in front of the teeth No language is known to use both contrastively though they may exist allophonically Alveolar consonants are made with the tip or blade of the tongue at the alveolar ridge just behind the teeth and can similarly be apical or laminal 27 Crosslinguistically dental consonants and alveolar consonants are frequently contrasted leading to a number of generalizations of crosslinguistic patterns The different places of articulation tend to also be contrasted in the part of the tongue used to produce them most languages with dental stops have laminal dentals while languages with apical stops usually have apical stops Languages rarely have two consonants in the same place with a contrast in laminality though Taa ǃXoo is a counterexample to this pattern 28 If a language has only one of a dental stop or an alveolar stop it will usually be laminal if it is a dental stop and the stop will usually be apical if it is an alveolar stop though for example Temne and Bulgarian 29 do not follow this pattern 30 If a language has both an apical and laminal stop then the laminal stop is more likely to be affricated like in Isoko though Dahalo show the opposite pattern with alveolar stops being more affricated 31 Retroflex consonants have several different definitions depending on whether the position of the tongue or the position on the roof of the mouth is given prominence In general they represent a group of articulations in which the tip of the tongue is curled upwards to some degree In this way retroflex articulations can occur in several different locations on the roof of the mouth including alveolar post alveolar and palatal regions If the underside of the tongue tip makes contact with the roof of the mouth it is sub apical though apical post alveolar sounds are also described as retroflex 32 Typical examples of sub apical retroflex stops are commonly found in Dravidian languages and in some languages indigenous to the southwest United States the contrastive difference between dental and alveolar stops is a slight retroflexion of the alveolar stop 33 Acoustically retroflexion tends to affect the higher formants 33 Articulations taking place just behind the alveolar ridge known as post alveolar consonants have been referred to using a number of different terms Apical post alveolar consonants are often called retroflex while laminal articulations are sometimes called palato alveolar 34 in the Australianist literature these laminal stops are often described as palatal though they are produced further forward than the palate region typically described as palatal 26 Because of individual anatomical variation the precise articulation of palato alveolar stops and coronals in general can vary widely within a speech community 35 Dorsal Edit Dorsal consonants are those consonants made using the tongue body rather than the tip or blade and are typically produced at the palate velum or uvula Palatal consonants are made using the tongue body against the hard palate on the roof of the mouth They are frequently contrasted with velar or uvular consonants though it is rare for a language to contrast all three simultaneously with Jaqaru as a possible example of a three way contrast 36 Velar consonants are made using the tongue body against the velum They are incredibly common cross linguistically almost all languages have a velar stop Because both velars and vowels are made using the tongue body they are highly affected by coarticulation with vowels and can be produced as far forward as the hard palate or as far back as the uvula These variations are typically divided into front central and back velars in parallel with the vowel space 37 They can be hard to distinguish phonetically from palatal consonants though are produced slightly behind the area of prototypical palatal consonants 38 Uvular consonants are made by the tongue body contacting or approaching the uvula They are rare occurring in an estimated 19 percent of languages and large regions of the Americas and Africa have no languages with uvular consonants In languages with uvular consonants stops are most frequent followed by continuants including nasals 39 Pharyngeal and laryngeal Edit Consonants made by constrictions of the throat are pharyngeals and those made by a constriction in the larynx are laryngeal Laryngeals are made using the vocal folds as the larynx is too far down the throat to reach with the tongue Pharyngeals however are close enough to the mouth that parts of the tongue can reach them Radical consonants either use the root of the tongue or the epiglottis during production and are produced very far back in the vocal tract 40 Pharyngeal consonants are made by retracting the root of the tongue far enough to almost touch the wall of the pharynx Due to production difficulties only fricatives and approximants can produced this way 41 42 Epiglottal consonants are made with the epiglottis and the back wall of the pharynx Epiglottal stops have been recorded in Dahalo 42 Voiced epiglottal consonants are not deemed possible due to the cavity between the glottis and epiglottis being too small to permit voicing 43 Glottal consonants are those produced using the vocal folds in the larynx Because the vocal folds are the source of phonation and below the oro nasal vocal tract a number of glottal consonants are impossible such as a voiced glottal stop Three glottal consonants are possible a voiceless glottal stop and two glottal fricatives and all are attested in natural languages 19 Glottal stops produced by closing the vocal folds are notably common in the world s languages 43 While many languages use them to demarcate phrase boundaries some languages like Arabic and Huatla Mazatec have them as contrastive phonemes Additionally glottal stops can be realized as laryngealization of the following vowel in this language 44 Glottal stops especially between vowels do usually not form a complete closure True glottal stops normally occur only when they re geminated 45 The larynx Edit Further information Larynx A top down view of the larynx The larynx commonly known as the voice box is a cartilaginous structure in the trachea responsible for phonation The vocal folds chords are held together so that they vibrate or held apart so that they do not The positions of the vocal folds are achieved by movement of the arytenoid cartilages 46 The intrinsic laryngeal muscles are responsible for moving the arytenoid cartilages as well as modulating the tension of the vocal folds 47 If the vocal folds are not close or tense enough they will either vibrate sporadically or not at all If they vibrate sporadically it will result in either creaky or breathy voice depending on the degree if don t vibrate at all the result will be voicelessness In addition to correctly positioning the vocal folds there must also be air flowing across them or they will not vibrate The difference in pressure across the glottis required for voicing is estimated at 1 2 cm H2O 98 0665 196 133 pascals 48 The pressure differential can fall below levels required for phonation either because of an increase in pressure above the glottis superglottal pressure or a decrease in pressure below the glottis subglottal pressure The subglottal pressure is maintained by the respiratory muscles Supraglottal pressure with no constrictions or articulations is equal to about atmospheric pressure However because articulations especially consonants represent constrictions of the airflow the pressure in the cavity behind those constrictions can increase resulting in a higher supraglottal pressure 49 Lexical access Edit According to the lexical access model two different stages of cognition are employed thus this concept is known as the two stage theory of lexical access The first stage lexical selection provides information about lexical items required to construct the functional level representation These items are retrieved according to their specific semantic and syntactic properties but phonological forms are not yet made available at this stage The second stage retrieval of wordforms provides information required for building the positional level representation 50 Articulatory models Edit When producing speech the articulators move through and contact particular locations in space resulting in changes to the acoustic signal Some models of speech production take this as the basis for modeling articulation in a coordinate system that may be internal to the body intrinsic or external extrinsic Intrinsic coordinate systems model the movement of articulators as positions and angles of joints in the body Intrinsic coordinate models of the jaw often use two to three degrees of freedom representing translation and rotation These face issues with modeling the tongue which unlike joints of the jaw and arms is a muscular hydrostat like an elephant trunk which lacks joints 51 Because of the different physiological structures movement paths of the jaw are relatively straight lines during speech and mastication while movements of the tongue follow curves 52 Straight line movements have been used to argue articulations as planned in extrinsic rather than intrinsic space though extrinsic coordinate systems also include acoustic coordinate spaces not just physical coordinate spaces 51 Models that assume movements are planned in extrinsic space run into an inverse problem of explaining the muscle and joint locations which produce the observed path or acoustic signal The arm for example has seven degrees of freedom and 22 muscles so multiple different joint and muscle configurations can lead to the same final position For models of planning in extrinsic acoustic space the same one to many mapping problem applies as well with no unique mapping from physical or acoustic targets to the muscle movements required to achieve them Concerns about the inverse problem may be exaggerated however as speech is a highly learned skill using neurological structures which evolved for the purpose 53 The equilibrium point model proposes a resolution to the inverse problem by arguing that movement targets be represented as the position of the muscle pairs acting on a joint d Importantly muscles are modeled as springs and the target is the equilibrium point for the modeled spring mass system By using springs the equilibrium point model can easily account for compensation and response when movements are disrupted They are considered a coordinate model because they assume that these muscle positions are represented as points in space equilibrium points where the spring like action of the muscles converges 54 55 Gestural approaches to speech production propose that articulations are represented as movement patterns rather than particular coordinates to hit The minimal unit is a gesture that represents a group of functionally equivalent articulatory movement patterns that are actively controlled with reference to a given speech relevant goal e g a bilabial closure 56 These groups represent coordinative structures or synergies which view movements not as individual muscle movements but as task dependent groupings of muscles which work together as a single unit 57 58 This reduces the degrees of freedom in articulation planning a problem especially in intrinsic coordinate models which allows for any movement that achieves the speech goal rather than encoding the particular movements in the abstract representation Coarticulation is well described by gestural models as the articulations at faster speech rates can be explained as composites of the independent gestures at slower speech rates 59 Acoustics Edit A waveform top spectrogram middle and transcription bottom of a woman saying Wikipedia displayed using the Praat software for linguistic analysis Listen source source track track track track The accompanying audio Speech sounds are created by the modification of an airstream which results in a sound wave The modification is done by the articulators with different places and manners of articulation producing different acoustic results Because the posture of the vocal tract not just the position of the tongue can affect the resulting sound the manner of articulation is important for describing the speech sound The words tack and sack both begin with alveolar sounds in English but differ in how far the tongue is from the alveolar ridge This difference has large affects on the air stream and thus the sound that is produced Similarly the direction and source of the airstream can affect the sound The most common airstream mechanism is pulmonic using the lungs but the glottis and tongue can also be used to produce airstreams Voicing and phonation types Edit A major distinction between speech sounds is whether they are voiced Sounds are voiced when the vocal folds begin to vibrate in the process of phonation Many sounds can be produced with or without phonation though physical constraints may make phonation difficult or impossible for some articulations When articulations are voiced the main source of noise is the periodic vibration of the vocal folds Articulations like voiceless plosives have no acoustic source and are noticeable by their silence but other voiceless sounds like fricatives create their own acoustic source regardless of phonation Phonation is controlled by the muscles of the larynx and languages make use of more acoustic detail than binary voicing During phonation the vocal folds vibrate at a certain rate This vibration results in a periodic acoustic waveform comprising a fundamental frequency and its harmonics The fundamental frequency of the acoustic wave can be controlled by adjusting the muscles of the larynx and listeners perceive this fundamental frequency as pitch Languages use pitch manipulation to convey lexical information in tonal languages and many languages use pitch to mark prosodic or pragmatic information For the vocal folds to vibrate they must be in the proper position and there must be air flowing through the glottis 48 Phonation types are modeled on a continuum of glottal states from completely open voiceless to completely closed glottal stop The optimal position for vibration and the phonation type most used in speech modal voice exists in the middle of these two extremes If the glottis is slightly wider breathy voice occurs while bringing the vocal folds closer together results in creaky voice 60 The normal phonation pattern used in typical speech is modal voice where the vocal folds are held close together with moderate tension The vocal folds vibrate as a single unit periodically and efficiently with a full glottal closure and no aspiration 61 If they are pulled farther apart they do not vibrate and so produce voiceless phones If they are held firmly together they produce a glottal stop 60 If the vocal folds are held slightly further apart than in modal voicing they produce phonation types like breathy voice or murmur and whispery voice The tension across the vocal ligaments vocal cords is less than in modal voicing allowing for air to flow more freely Both breathy voice and whispery voice exist on a continuum loosely characterized as going from the more periodic waveform of breathy voice to the more noisy waveform of whispery voice Acoustically both tend to dampen the first formant with whispery voice showing more extreme deviations 62 Holding the vocal folds more tightly together results in a creaky voice The tension across the vocal folds is less than in modal voice but they are held tightly together resulting in only the ligaments of the vocal folds vibrating e The pulses are highly irregular with low pitch and frequency amplitude 63 Some languages do not maintain a voicing distinction for some consonants f but all languages use voicing to some degree For example no language is known to have a phonemic voicing contrast for vowels with all known vowels canonically voiced g Other positions of the glottis such as breathy and creaky voice are used in a number of languages like Jalapa Mazatec to contrast phonemes while in other languages like English they exist allophonically There are several ways to determine if a segment is voiced or not the simplest being to feel the larynx during speech and note when vibrations are felt More precise measurements can be obtained through acoustic analysis of a spectrogram or spectral slice In a spectrographic analysis voiced segments show a voicing bar a region of high acoustic energy in the low frequencies of voiced segments 64 In examining a spectral splice the acoustic spectrum at a given point in time a model of the vowel pronounced reverses the filtering of the mouth producing the spectrum of the glottis A computational model of the unfiltered glottal signal is then fitted to the inverse filtered acoustic signal to determine the characteristics of the glottis 65 Visual analysis is also available using specialized medical equipment such as ultrasound and endoscopy 64 h Vowels Edit IPA VowelsFront Central BackClose i y ɨ ʉ ɯ uNear close ɪ ʏ ʊClose mid e o ɘ ɵ ɤ oMid e o e ɤ o Open mid ɛ œ ɜ ɞ ʌ ɔNear open ae ɐOpen a ɶ a ɑ ɒIPA help audio full chart template Legend unrounded roundedVowels are broadly categorized by the area of the mouth in which they are produced but because they are produced without a constriction in the vocal tract their precise description relies on measuring acoustic correlates of tongue position The location of the tongue during vowel production changes the frequencies at which the cavity resonates and it is these resonances known as formants which are measured and used to characterize vowels Vowel height traditionally refers to the highest point of the tongue during articulation 66 The height parameter is divided into four primary levels high close close mid open mid and low open Vowels whose height are in the middle are referred to as mid Slightly opened close vowels and slightly closed open vowels are referred to as near close and near open respectively The lowest vowels are not just articulated with a lowered tongue but also by lowering the jaw 67 While the IPA implies that there are seven levels of vowel height it is unlikely that a given language can minimally contrast all seven levels Chomsky and Halle suggest that there are only three levels 68 although four levels of vowel height seem to be needed to describe Danish and it s possible that some languages might even need five 69 Vowel backness is dividing into three levels front central and back Languages usually do not minimally contrast more than two levels of vowel backness Some languages claimed to have a three way backness distinction include Nimboran and Norwegian 70 In most languages the lips during vowel production can be classified as either rounded or unrounded spread although other types of lip positions such as compression and protrusion have been described Lip position is correlated with height and backness front and low vowels tend to be unrounded whereas back and high vowels are usually rounded 71 Paired vowels on the IPA chart have the spread vowel on the left and the rounded vowel on the right 72 Together with the universal vowel features described above some languages have additional features such as nasality length and different types of phonation such as voiceless or creaky Sometimes more specialized tongue gestures such as rhoticity advanced tongue root pharyngealization stridency and frication are required to describe a certain vowel 73 Manner of articulation Edit Main article Manner of articulation Knowing the place of articulation is not enough to fully describe a consonant the way in which the stricture happens is equally important Manners of articulation describe how exactly the active articulator modifies narrows or closes off the vocal tract 74 Stops also referred to as plosives are consonants where the airstream is completely obstructed Pressure builds up in the mouth during the stricture which is then released as a small burst of sound when the articulators move apart The velum is raised so that air cannot flow through the nasal cavity If the velum is lowered and allows for air to flow through the nose the result in a nasal stop However phoneticians almost always refer to nasal stops as just nasals 74 Affricates are a sequence of stops followed by a fricative in the same place 75 Fricatives are consonants where the airstream is made turbulent by partially but not completely obstructing part of the vocal tract 74 Sibilants are a special type of fricative where the turbulent airstream is directed towards the teeth 76 creating a high pitched hissing sound 77 Nasals sometimes referred to as nasal stops are consonants in which there s a closure in the oral cavity and the velum is lowered allowing air to flow through the nose 78 In an approximant the articulators come close together but not to such an extent that allows a turbulent airstream 77 Laterals are consonants in which the airstream is obstructed along the center of the vocal tract allowing the airstream to flow freely on one or both sides 77 Laterals have also been defined as consonants in which the tongue is contracted in such a way that the airstream is greater around the sides than over the center of the tongue 79 The first definition does not allow for air to flow over the tongue Trills are consonants in which the tongue or lips are set in motion by the airstream 80 The stricture is formed in such a way that the airstream causes a repeating pattern of opening and closing of the soft articulator s 81 Apical trills typically consist of two or three periods of vibration 82 Taps and flaps are single rapid usually apical gestures where the tongue is thrown against the roof of the mouth comparable to a very rapid stop 80 These terms are sometimes used interchangeably but some phoneticians make a distinction 83 In a tap the tongue contacts the roof in a single motion whereas in a flap the tongue moves tangentially to the roof of the mouth striking it in passing During a glottalic airstream mechanism the glottis is closed trapping a body of air This allows for the remaining air in the vocal tract to be moved separately An upward movement of the closed glottis will move this air out resulting in it an ejective consonant Alternatively the glottis can lower sucking more air into the mouth which results in an implosive consonant 84 Clicks are stops in which tongue movement causes air to be sucked in the mouth this is referred to as a velaric airstream 85 During the click the air becomes rarefied between two articulatory closures producing a loud click sound when the anterior closure is released The release of the anterior closure is referred to as the click influx The release of the posterior closure which can be velar or uvular is the click efflux Clicks are used in several African language families such as the Khoisan and Bantu languages 86 Pulmonary and subglottal system Edit Further information Breathing The lungs drive nearly all speech production and their importance in phonetics is due to their creation of pressure for pulmonic sounds The most common kinds of sound across languages are pulmonic egress where air is exhaled from the lungs 87 The opposite is possible though no language is known to have pulmonic ingressive sounds as phonemes 88 Many languages such as Swedish use them for paralinguistic articulations such as affirmations in a number of genetically and geographically diverse languages 89 Both egressive and ingressive sounds rely on holding the vocal folds in a particular posture and using the lungs to draw air across the vocal folds so that they either vibrate voiced or do not vibrate voiceless 87 Pulmonic articulations are restricted by the volume of air able to be exhaled in a given respiratory cycle known as the vital capacity The lungs are used to maintain two kinds of pressure simultaneously in order to produce and modify phonation To produce phonation at all the lungs must maintain a pressure of 3 5 cm H2O higher than the pressure above the glottis However small and fast adjustments are made to the subglottal pressure to modify speech for suprasegmental features like stress A number of thoracic muscles are used to make these adjustments Because the lungs and thorax stretch during inhalation the elastic forces of the lungs alone can produce pressure differentials sufficient for phonation at lung volumes above 50 percent of vital capacity 90 Above 50 percent of vital capacity the respiratory muscles are used to check the elastic forces of the thorax to maintain a stable pressure differential Below that volume they are used to increase the subglottal pressure by actively exhaling air During speech the respiratory cycle is modified to accommodate both linguistic and biological needs Exhalation usually about 60 percent of the respiratory cycle at rest is increased to about 90 percent of the respiratory cycle Because metabolic needs are relatively stable the total volume of air moved in most cases of speech remains about the same as quiet tidal breathing 91 Increases in speech intensity of 18 dB a loud conversation has relatively little impact on the volume of air moved Because their respiratory systems are not as developed as adults children tend to use a larger proportion of their vital capacity compared to adults with more deep inhales 92 Source filter theory Edit Main article Source filter model This section needs expansion You can help by adding to it February 2020 The source filter model of speech is a theory of speech production which explains the link between vocal tract posture and the acoustic consequences Under this model the vocal tract can be modeled as a noise source coupled onto an acoustic filter 93 The noise source in many cases is the larynx during the process of voicing though other noise sources can be modeled in the same way The shape of the supraglottal vocal tract acts as the filter and different configurations of the articulators result in different acoustic patterns These changes are predictable The vocal tract can be modeled as a sequence of tubes closed at one end with varying diameters and by using equations for acoustic resonance the acoustic effect of an articulatory posture can be derived 94 The process of inverse filtering uses this principle to analyze the source spectrum produced by the vocal folds during voicing By taking the inverse of a predicted filter the acoustic effect of the supraglottal vocal tract can be undone giving the acoustic spectrum produced by the vocal folds 95 This allows quantitative study of the various phonation types Perception EditMain article Speech perception Language perception is the process by which a linguistic signal is decoded and understood by a listener i In order to perceive speech the continuous acoustic signal must be converted into discrete linguistic units such as phonemes morphemes and words 96 In order to correctly identify and categorize sounds listeners prioritize certain aspects of the signal that can reliably distinguish between linguistic categories 97 While certain cues are prioritized over others many aspects of the signal can contribute to perception For example though oral languages prioritize acoustic information the McGurk effect shows that visual information is used to distinguish ambiguous information when the acoustic cues are unreliable 98 While listeners can use a variety of information to segment the speech signal the relationship between acoustic signal and category perception is not a perfect mapping Because of coarticulation noisy environments and individual differences there is a high degree of acoustic variability within categories 99 Known as the problem of perceptual invariance listeners are able to reliably perceive categories despite the variability in acoustic instantiation 100 In order to do this listeners rapidly accommodate to new speakers and will shift their boundaries between categories to match the acoustic distinctions their conversational partner is making 101 Audition Edit Main article Hearing for further information see Neuronal encoding of sound source source source source source source source source source source source source track track track track track track How sounds make their way from the source to the brain Audition the process of hearing sounds is the first stage of perceiving speech Articulators cause systematic changes in air pressure which travel as sound waves to the listener s ear The sound waves then hit the listener s ear drum causing it to vibrate The vibration of the ear drum is transmitted by the ossicles three small bones of the middle ear to the cochlea 102 The cochlea is a spiral shaped fluid filled tube divided lengthwise by the organ of Corti which contains the basilar membrane The basilar membrane increases in thickness as it travels through the cochlea causing different frequencies to resonate at different locations This tonotopic design allows for the ear to analyze sound in a manner similar to a Fourier transform 103 The differential vibration of the basilar causes the hair cells within the organ of Corti to move This causes depolarization of the hair cells and ultimately a conversion of the acoustic signal into a neuronal signal 104 While the hair cells do not produce action potentials themselves they release neurotransmitter at synapses with the fibers of the auditory nerve which does produce action potentials In this way the patterns of oscillations on the basilar membrane are converted to spatiotemporal patterns of firings which transmit information about the sound to the brainstem 105 Prosody Edit Main article Prosody linguistics Besides consonants and vowels phonetics also describes the properties of speech that are not localized to segments but greater units of speech such as syllables and phrases Prosody includes auditory characteristics such as pitch speech rate duration and loudness Languages use these properties to different degrees to implement stress pitch accents and intonation for example stress in English and Spanish is correlated with changes in pitch and duration whereas stress in Welsh is more consistently correlated with pitch than duration and stress in Thai is only correlated with duration 106 Theories of speech perception Edit Early theories of speech perception such as motor theory attempted to solve the problem of perceptual invariance by arguing that speech perception and production were closely linked In its strongest form motor theory argues that speech perception requires the listener to access the articulatory representation of sounds 107 in order to properly categorize a sound a listener reverse engineers the articulation which would produce that sound and by identifying these gestures is able to retrieve the intended linguistic category 108 While findings such as the McGurk effect and case studies from patients with neurological injuries have provided support for motor theory further experiments have not supported the strong form of motor theory though there is some support for weaker forms of motor theory which claim a non deterministic relationship between production and perception 108 109 110 Successor theories of speech perception place the focus on acoustic cues to sound categories and can be grouped into two broad categories abstractionist theories and episodic theories 111 In abstractionist theories speech perception involves the identification of an idealized lexical object based on a signal reduced to its necessary components and normalizing the signal to counteract speaker variability Episodic theories such as the exemplar model argue that speech perception involves accessing detailed memories i e episodic memories of previously heard tokens The problem of perceptual invariance is explained by episodic theories as an issue of familiarity normalization is a byproduct of exposure to more variable distributions rather than a discrete process as abstractionist theories claim 111 Subdisciplines EditAcoustic phonetics Edit Main article Acoustic phonetics Acoustic phonetics deals with the acoustic properties of speech sounds The sensation of sound is caused by pressure fluctuations which cause the eardrum to move The ear transforms this movement into neural signals that the brain registers as sound Acoustic waveforms are records that measure these pressure fluctuations 112 Articulatory phonetics Edit Main article Articulatory phonetics Articulatory phonetics deals with the ways in which speech sounds are made Auditory phonetics Edit Main article Auditory phonetics Auditory phonetics studies how humans perceive speech sounds Due to the anatomical features of the auditory system distorting the speech signal humans do not experience speech sounds as perfect acoustic records For example the auditory impressions of volume measured in decibels dB does not linearly match the difference in sound pressure 113 The mismatch between acoustic analyses and what the listener hears is especially noticeable in speech sounds that have a lot of high frequency energy such as certain fricatives To reconcile this mismatch functional models of the auditory system have been developed 114 Describing sounds EditHuman languages use many different sounds and in order to compare them linguists must be able to describe sounds in a way that is language independent Speech sounds can be described in a number of ways Most commonly speech sounds are referred to by the mouth movements needed to produce them Consonants and vowels are two gross categories that phoneticians define by the movements in a speech sound More fine grained descriptors are parameters such as place of articulation Place of articulation manner of articulation and voicing are used to describe consonants and are the main divisions of the International Phonetic Alphabet consonant chart Vowels are described by their height backness and rounding Sign language are described using a similar but distinct set of parameters to describe signs location movement hand shape palm orientation and non manual features In addition to articulatory descriptions sounds used in oral languages can be described using their acoustics Because the acoustics are a consequence of the articulation both methods of description are sufficient to distinguish sounds with the choice between systems dependent on the phonetic feature being investigated Consonants are speech sounds that are articulated with a complete or partial closure of the vocal tract They are generally produced by the modification of an airstream exhaled from the lungs The respiratory organs used to create and modify airflow are divided into three regions the vocal tract supralaryngeal the larynx and the subglottal system The airstream can be either egressive out of the vocal tract or ingressive into the vocal tract In pulmonic sounds the airstream is produced by the lungs in the subglottal system and passes through the larynx and vocal tract Glottalic sounds use an airstream created by movements of the larynx without airflow from the lungs Click consonants are articulated through the rarefaction of air using the tongue followed by releasing the forward closure of the tongue Vowels are syllabic speech sounds that are pronounced without any obstruction in the vocal tract 115 Unlike consonants which usually have definite places of articulation vowels are defined in relation to a set of reference vowels called cardinal vowels Three properties are needed to define vowels tongue height tongue backness and lip roundedness Vowels that are articulated with a stable quality are called monophthongs a combination of two separate vowels in the same syllable is a diphthong 116 In the IPA the vowels are represented on a trapezoid shape representing the human mouth the vertical axis representing the mouth from floor to roof and the horizontal axis represents the front back dimension 117 Transcription Edit Main article Phonetic transcription Phonetic transcription is a system for transcribing phones that occur in a language whether oral or sign The most widely known system of phonetic transcription the International Phonetic Alphabet IPA provides a standardized set of symbols for oral phones 118 119 The standardized nature of the IPA enables its users to transcribe accurately and consistently the phones of different languages dialects and idiolects 118 120 121 The IPA is a useful tool not only for the study of phonetics but also for language teaching professional acting and speech pathology 120 While no sign language has a standardized writing system linguists have developed their own notation systems that describe the handshape location and movement The Hamburg Notation System HamNoSys is similar to the IPA in that it allows for varying levels of detail Some notation systems such as KOMVA and the Stokoe system were designed for use in dictionaries they also make use of alphabetic letters in the local language for handshapes whereas HamNoSys represents the handshape directly SignWriting aims to be an easy to learn writing system for sign languages although it has not been officially adopted by any deaf community yet 122 Sign languages EditUnlike spoken languages words in sign languages are perceived with the eyes instead of the ears Signs are articulated with the hands upper body and head The main articulators are the hands and arms Relative parts of the arm are described with the terms proximal and distal Proximal refers to a part closer to the torso whereas a distal part is further away from it For example a wrist movement is distal compared to an elbow movement Due to requiring less energy distal movements are generally easier to produce Various factors such as muscle flexibility or being considered taboo restrict what can be considered a sign 123 Native signers do not look at their conversation partner s hands Instead their gaze is fixated on the face Because peripheral vision is not as focused as the center of the visual field signs articulated near the face allow for more subtle differences in finger movement and location to be perceived 124 Unlike spoken languages sign languages have two identical articulators the hands Signers may use whichever hand they prefer with no disruption in communication Due to universal neurological limitations two handed signs generally have the same kind of articulation in both hands this is referred to as the Symmetry Condition 123 The second universal constraint is the Dominance Condition which holds that when two handshapes are involved one hand will remain stationary and have a more limited set handshapes compared to the dominant moving hand 125 Additionally it is common for one hand in a two handed sign to be dropped during informal conversations a process referred to as weak drop 123 Just like words in spoken languages coarticulation may cause signs to influence each other s form Examples include the handshapes of neighboring signs becoming more similar to each other assimilation or weak drop an instance of deletion 126 See also EditMotor theory of speech perception Exemplar theory Articulatory phonologyReferences EditNotes Edit Linguists debate whether these stages can interact or whether they occur serially compare Dell amp Reich 1981 and Motley Camden amp Baars 1982 For ease of description the language production process is described as a series of independent stages though recent evidence shows this is inaccurate 11 For further descriptions of interactive activation models see Jaeger Furth amp Hilliard 2012 or after part of an utterance has been planned see Gleitman et al 2007 for evidence of production before a message has been completely planned adapted from Sedivy 2019 p 411 and Boersma 1998 p 11 See Feldman 1966 for the original proposal See The larynx for further information on the anatomy of phonation Hawaiian for example does not contrast voiced and voiceless plosives There are languages like Japanese where vowels are produced as voiceless in certain contexts See Articulatory models for further information on acoustic modeling As with speech production the nature of the linguistic signal varies depending on the language modality The signal can be acoustic for oral speech visual for signed languages or tactile for manual tactile sign languages For simplicity acoustic speech is described here for sign language perception specifically see Sign language Sign perception Citations Edit O Grady 2005 p 15 a b c Caffrey 2017 Kiparsky 1993 p 2918 Kiparsky 1993 pp 2922 3 Deussen Paul 1980 Sixty Upanishads of the Veda Volume I Motilal Banarasidass p 222 ISBN 978 8120814684 Oxford English Dictionary 2018 a b Roach 2015 Ladefoged 1960 p 388 Ladefoged 1960 Dell amp O Seaghdha 1992 Sedivy 2019 p 439 Boersma 1998 a b Ladefoged 2001 p 5 Ladefoged amp Maddieson 1996 p 9 Ladefoged amp Maddieson 1996 p 16 Maddieson 1993 Fujimura 1961 Ladefoged amp Maddieson 1996 pp 16 17 a b c International Phonetic Association 2015 Ladefoged amp Maddieson 1996 p 18 Ladefoged amp Maddieson 1996 pp 17 18 a b Ladefoged amp Maddieson 1996 p 17 Doke 1926 Guthrie 1948 p 61 Ladefoged amp Maddieson 1996 pp 19 31 a b Ladefoged amp Maddieson 1996 p 28 Ladefoged amp Maddieson 1996 pp 19 25 Ladefoged amp Maddieson 1996 pp 20 40 1 Scatton 1984 p 60 Ladefoged amp Maddieson 1996 p 23 Ladefoged amp Maddieson 1996 pp 23 5 Ladefoged amp Maddieson 1996 pp 25 27 8 a b Ladefoged amp Maddieson 1996 p 27 Ladefoged amp Maddieson 1996 pp 27 8 Ladefoged amp Maddieson 1996 p 32 Ladefoged amp Maddieson 1996 p 35 Ladefoged amp Maddieson 1996 pp 33 34 Keating amp Lahiri 1993 p 89 Maddieson 2013 Ladefoged amp Maddieson 1996 p 11 Lodge 2009 p 33 a b Ladefoged amp Maddieson 1996 p 37 a b Ladefoged amp Maddieson 1996 p 38 Ladefoged amp Maddieson 1996 p 74 Ladefoged amp Maddieson 1996 p 75 Ladefoged 2001 p 123 Seikel Drumright amp King 2016 p 222 a b Ohala 1997 p 1 Chomsky amp Halle 1968 pp 300 301 Altmann 2002 a b Lofqvist 2010 p 359 Munhall Ostry amp Flanagan 1991 p 299 et seq Lofqvist 2010 p 360 Bizzi et al 1992 Lofqvist 2010 p 361 Saltzman amp Munhall 1989 Mattingly 1990 Lofqvist 2010 pp 362 4 Lofqvist 2010 p 364 a b Gordon amp Ladefoged 2001 Gobl amp Ni Chasaide 2010 p 399 Gobl amp Ni Chasaide 2010 p 400 401 Gobl amp Ni Chasaide 2010 p 401 a b Dawson amp Phelan 2016 Gobl amp Ni Chasaide 2010 pp 388 et seq Ladefoged amp Maddieson 1996 p 282 Lodge 2009 p 39 Chomsky amp Halle 1968 Ladefoged amp Maddieson 1996 p 289 Ladefoged amp Maddieson 1996 p 290 Ladefoged amp Maddieson 1996 p 292 295 Lodge 2009 p 40 Ladefoged amp Maddieson 1996 p 298 a b c Ladefoged amp Johnson 2011 p 14 Ladefoged amp Johnson 2011 p 67 Ladefoged amp Maddieson 1996 p 145 a b c Ladefoged amp Johnson 2011 p 15 Ladefoged amp Maddieson 1996 p 102 Ladefoged amp Maddieson 1996 p 182 a b Ladefoged amp Johnson 2011 p 175 Ladefoged amp Maddieson 1996 p 217 Ladefoged amp Maddieson 1996 p 218 Ladefoged amp Maddieson 1996 p 230 231 Ladefoged amp Johnson 2011 p 137 Ladefoged amp Maddieson 1996 p 78 Ladefoged amp Maddieson 1996 p 246 247 a b Ladefoged 2001 p 1 Eklund 2008 p 237 Eklund 2008 Seikel Drumright amp King 2016 p 176 Seikel Drumright amp King 2016 p 171 Seikel Drumright amp King 2016 pp 168 77 Johnson 2008 p 83 5 sfn error no target CITEREFJohnson2008 help Johnson 2008 p 104 5 sfn error no target CITEREFJohnson2008 help Johnson 2008 p 157 sfn error no target CITEREFJohnson2008 help Sedivy 2019 p 259 60 Sedivy 2019 p 269 Sedivy 2019 p 273 Sedivy 2019 p 259 Sedivy 2019 p 260 Sedivy 2019 p 274 85 Johnson 2003 p 46 7 Johnson 2003 p 47 Schacter Gilbert amp Wegner 2011 p 158 9 Yost 2003 p 130 Cutler 2005 Sedivy 2019 p 289 a b Galantucci Fowler amp Turvey 2006 Sedivy 2019 p 292 3 Skipper Devlin amp Lametti 2017 a b Goldinger 1996 Johnson 2003 p 1 Johnson 2003 p 46 49 Johnson 2003 p 53 Ladefoged amp Maddieson 1996 p 281 Gussenhoven amp Jacobs 2017 p 26 27 Lodge 2009 p 38 a b O Grady 2005 p 17 International Phonetic Association 1999 a b Ladefoged 2005 Ladefoged amp Maddieson 1996 Baker et al 2016 p 242 244 a b c Baker et al 2016 p 229 235 Baker et al 2016 p 236 Baker et al 2016 p 286 Baker et al 2016 p 239 Works cited Edit Abercrombie D 1967 Elements of General Phonetics Edinburgh Chicago Aldine Pub Co Altmann Gerry 2002 Psycholinguistics critical concepts in psychology London Routledge ISBN 978 0415229906 OCLC 48014482 Baker Anne van den Bogaerde Beppie Pfau Roland Schermer Trude 2016 The Linguistics of Sign Languages Amsterdam Philadelphia John Benjamins Publishing Company ISBN 978 90 272 1230 6 Baumbach E J M 1987 Analytical Tsonga Grammar Pretoria University of South Africa Bizzi E Hogan N Mussa Ivaldi F Giszter S 1992 Does the nervous system use equilibrium point control to guide single and multiple joint movements Behavioral and Brain Sciences 15 4 603 13 doi 10 1017 S0140525X00072538 PMID 23302290 Bock Kathryn Levelt Willem 2002 Atlmann Gerry ed Psycholinguistics Critical Concepts in Psychology Vol 5 New York Routledge pp 405 407 ISBN 978 0 415 26701 4 Boersma Paul 1998 Functional phonology Formalizing the interactions between articulatory and perceptual drives The Hague Holland Academic Graphics ISBN 9055690546 OCLC 40563066 Caffrey Cait 2017 Phonetics Salem Press Encyclopedia Salem Press Catford J C 2001 A Practical Introduction to Phonetics 2nd ed Oxford University Press ISBN 978 0 19 924635 9 Chomsky Noam Halle Morris 1968 Sound Pattern of English Harper and Row Cutler Anne 2005 Lexical Stress PDF In Pisoni David B Remez Robert eds The Handbook of Speech Perception Blackwell pp 264 289 doi 10 1002 9780470757024 ch11 ISBN 978 0 631 22927 8 OCLC 749782145 Retrieved 2019 12 29 Dawson Hope Phelan Michael eds 2016 Language Files Materials for an Introduction to Linguistics 12th ed The Ohio State University Press ISBN 978 0 8142 5270 3 Dell Gary O Seaghdha Padraig 1992 Stages of lexical access in language production Cognition 42 1 3 287 314 doi 10 1016 0010 0277 92 90046 k PMID 1582160 S2CID 37962027 Dell Gary Reich Peter 1981 Stages in sentence production An analysis of speech error data Journal of Memory and Language 20 6 611 629 doi 10 1016 S0022 5371 81 90202 4 Doke Clement M 1926 The Phonetics of the Zulu Language Bantu Studies Johannesburg Wiwatersrand University Press Eklund Robert 2008 Pulmonic ingressive phonation Diachronic and synchronic characteristics distribution and function in animal and human sound production and in human speech Journal of the International Phonetic Association 38 3 235 324 doi 10 1017 S0025100308003563 S2CID 146616135 Feldman Anatol G 1966 Functional tuning of the nervous system with control of movement or maintenance of a steady posture III Mechanographic analysis of the execution by man of the simplest motor task Biophysics 11 565 578 Fujimura Osamu 1961 Bilabial stop and nasal consonants A motion picture study and its acoustical implications Journal of Speech and Hearing Research 4 3 233 47 doi 10 1044 jshr 0403 233 PMID 13702471 Galantucci Bruno Fowler Carol Turvey Michael 2006 The motor theory of speech perception reviewed Psychonomic Bulletin amp Review 13 3 361 377 doi 10 3758 BF03193857 PMC 2746041 PMID 17048719 Gleitman Lila January David Nappa Rebecca Trueswell John 2007 On the give and take between event apprehension and utterance formulation Journal of Memory and Language 57 4 544 569 doi 10 1016 j jml 2007 01 007 PMC 2151743 PMID 18978929 Gobl Christer Ni Chasaide Ailbhe 2010 Voice source variation and its communicative functions The Handbook of Phonetic Sciences 2nd ed pp 378 424 Goldinger Stephen 1996 Words and voices episodic traces in spoken word identification and recognition memory Journal of Experimental Psychology Learning Memory and Cognition 22 5 1166 83 doi 10 1037 0278 7393 22 5 1166 PMID 8926483 Gordon Matthew Ladefoged Peter 2001 Phonation types a cross linguistic overview Journal of Phonetics 29 4 383 406 doi 10 1006 jpho 2001 0147 Guthrie Malcolm 1948 The classification of the Bantu languages London Oxford University Press Gussenhoven Carlos Jacobs Haike 2017 Understanding phonology Fourth ed London and New York Routledge ISBN 9781138961418 OCLC 958066102 Hall Tracy Alan 2001 Introduction Phonological representations and phonetic implementation of distinctive features In Hall Tracy Alan ed Distinctive Feature Theory de Gruyter pp 1 40 Halle Morris 1983 On Distinctive Features and their articulatory implementation Natural Language and Linguistic Theory 1 1 91 105 doi 10 1007 BF00210377 S2CID 170466631 Hardcastle William Laver John Gibbon Fiona eds 2010 The Handbook of Phonetic Sciences 2nd ed Wiley Blackwell ISBN 978 1 405 14590 9 International Phonetic Association 1999 Handbook of the International Phonetic Association Cambridge University Press International Phonetic Association 2015 International Phonetic Alphabet International Phonetic Association Jaeger Florian Furth Katrina Hilliard Caitlin 2012 Phonological overlap affects lexical selection during sentence production Journal of Experimental Psychology Learning Memory and Cognition 38 5 1439 1449 doi 10 1037 a0027862 PMID 22468803 Jakobson Roman Fant Gunnar Halle Morris 1976 Preliminaries to Speech Analysis The Distinctive Features and their Correlates MIT Press ISBN 978 0 262 60001 9 Johnson Keith 2003 Acoustic and auditory phonetics 2nd ed Blackwell Pub ISBN 1405101229 OCLC 50198698 Johnson Keith 2011 Acoustic and Auditory Phonetics 3rd ed Wiley Blackwell ISBN 978 1 444 34308 3 Jones Daniel 1948 The London school of phonetics Zeitschrift fur Phonetik 11 3 4 127 135 Reprinted in Jones W E Laver J eds 1973 Phonetics in Linguistics Longman pp 180 186 Keating Patricia Lahiri Aditi 1993 Fronted Velars Palatalized Velars and Palatals Phonetica 50 2 73 101 doi 10 1159 000261928 PMID 8316582 S2CID 3272781 Kingston John 2007 The Phonetics Phonology Interface In DeLacy Paul ed The Cambridge Handbook of Phonology Cambridge University Press ISBN 978 0 521 84879 4 Kiparsky Paul 1993 Paṇinian linguistics In Asher R E ed Encyclopedia of Languages and Linguistics Oxford Pergamon Ladefoged Peter 1960 The Value of Phonetic Statements Language 36 3 387 96 doi 10 2307 410966 JSTOR 410966 Ladefoged Peter 2001 A Course in Phonetics 4th ed Boston Thomson Wadsworth ISBN 978 1 413 00688 9 Ladefoged Peter 2005 A Course in Phonetics 5th ed Boston Thomson Wadsworth ISBN 978 1 413 00688 9 Ladefoged Peter Johnson Keith 2011 A Course in Phonetics 6th ed Wadsworth ISBN 978 1 42823126 9 Ladefoged Peter Maddieson Ian 1996 The Sounds of the World s Languages Oxford Blackwell ISBN 978 0 631 19815 4 Levelt Willem 1999 A theory of lexical access in speech production Behavioral and Brain Sciences 22 1 3 6 doi 10 1017 s0140525x99001776 hdl 11858 00 001M 0000 0013 3E7A A PMID 11301520 S2CID 152230066 Lodge Ken 2009 A Critical Introduction to Phonetics New York Continuum International Publishing Group ISBN 978 0 8264 8873 2 Lofqvist Anders 2010 Theories and Models of Speech Production Handbook of Phonetic Sciences 2nd ed pp 353 78 Maddieson Ian 1993 Investigating Ewe articulations with electromagnetic articulography Forschungberichte des Intituts fur Phonetik und Sprachliche Kommunikation der Universitat Munchen 31 181 214 Maddieson Ian 2013 Uvular Consonants In Dryer Matthew S Haspelmath Martin eds The World Atlas of Language Structures Online Leipzig Max Planck Institute for Evolutionary Anthropology Mattingly Ignatius 1990 The global character of phonetic gestures PDF Journal of Phonetics 18 3 445 52 doi 10 1016 S0095 4470 19 30372 9 Motley Michael Camden Carl Baars Bernard 1982 Covert formulation and editing of anomalies in speech production Evidence from experimentally elicited slips of the tongue Journal of Verbal Learning and Verbal Behavior 21 5 578 594 doi 10 1016 S0022 5371 82 90791 5 Munhall K Ostry D Flanagan J 1991 Coordinate spaces in speech planning Journal of Phonetics 19 3 4 293 307 doi 10 1016 S0095 4470 19 30346 8 O Connor J D 1973 Phonetics Pelican pp 16 17 ISBN 978 0140215601 O Grady William 2005 Contemporary Linguistics An Introduction 5th ed Bedford St Martin s ISBN 978 0 312 41936 3 Ohala John 1997 Aerodynamics of phonology Proceedings of the Seoul International Conference on Linguistics 92 Phonetics n Oxford English Dictionary Online Oxford University Press 2018 Roach Peter 2015 Practical Phonetic Training Peter Roach Retrieved 10 May 2019 Saltzman Elliot Munhall Kevin 1989 Dynamical Approach to Gestural Patterning in Speech Production PDF Ecological Psychology 1 4 333 82 doi 10 1207 s15326969eco0104 2 Scatton Ernest 1984 A reference grammar of modern Bulgarian Slavica ISBN 978 0893571238 Schacter Daniel Gilbert Daniel Wegner Daniel 2011 Sensation and Perception In Charles Linsmeiser ed Psychology Worth Publishers ISBN 978 1 4292 3719 2 Schiller Niels Bles Mart Jansma Bernadette 2003 Tracking the time course of phonological encoding in speech production an event related brain potential study Cognitive Brain Research 17 3 819 831 doi 10 1016 s0926 6410 03 00204 0 hdl 11858 00 001M 0000 0013 17B1 A PMID 14561465 Sedivy Julie 2019 Language in Mind An Introduction to Psycholinguistics 2nd ed ISBN 978 1605357058 Seikel J Anthony Drumright David King Douglas 2016 Anatomy and Physiology for Speech Language and Hearing 5th ed Cengage ISBN 978 1 285 19824 8 Skipper Jeremy Devlin Joseph Lametti Daniel 2017 The hearing ear is always found close to the speaking tongue Review of the role of the motor system in speech perception Brain and Language 164 77 105 doi 10 1016 j bandl 2016 10 004 PMID 27821280 Stearns Peter Adas Michael Schwartz Stuart Gilbert Marc Jason 2001 World Civilizations 3rd ed New York Longman ISBN 978 0 321 04479 2 Trask R L 1996 A Dictionary of Phonetics and Phonology Abingdon Routledge ISBN 978 0 415 11261 1 Yost William 2003 Audition In Alice F Healy Robert W Proctor eds Handbook of Psychology Experimental psychology John Wiley and Sons p 130 ISBN 978 0 471 39262 0 External links Edit Wikisource has the text of The New Student s Reference Work article Phonetics Media related to Phonetics at Wikimedia Commons Collection of phonetics resources by the University of North Carolina A Little Encyclopedia of Phonetics by Peter Roach Pink Trombone an interactive articulation simulator by Neil Thapen Retrieved from https en wikipedia org w index php title Phonetics amp oldid 1129545015, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.