North American English regional phonology is the study of variations in the pronunciation of spoken North American English (English of the United States and Canada)—what are commonly known simply as "regional accents". Though studies of regional dialects can be based on multiple characteristics, often including characteristics that are phonemic (sound-based, focusing on major word-differentiating patterns and structures in speech), phonetic (sound-based, focusing on any more exact and specific details of speech), lexical (vocabulary-based), and syntactic (grammar-based), this article focuses only on the former two items. North American English includes American English, which has several highly developed and distinct regional varieties, along with the closely related Canadian English, which is more homogeneous geographically. American English (especially Western dialects) and Canadian English have more in common with each other than with varieties of English outside North America.

The most recent work documenting and studying the phonology of North American English dialects as a whole is the 2006 Atlas of North American English (ANAE) by William Labov, Sharon Ash, and Charles Boberg, on which much of the description below is based, following on a tradition of sociolinguistics dating to the 1960s; earlier large-scale American dialectology focused more on lexicology than on phonology.

Regional dialects in North America are historically the most strongly differentiated along the Eastern seaboard, due to distinctive speech patterns of urban centers of the American East Coast like Boston, New York City, and certain Southern cities, all of these accents historically noted by their London-like r-dropping (called non-rhoticity), a feature gradually receding among younger generations, especially in the South. The Connecticut River is now regarded as the southern and western boundary of the traditional New England accents, today still centered on Boston and much of Eastern New England. The Potomac River generally divides a group of Northeastern coastal dialects from an area of older Southeastern coastal dialects. All older Southern dialects, however, have mostly now receded in favor of a strongly rhotic, more unified accent group spread throughout the entire Southern United States since the late 1800s and into the early 1900s. In-between the two aforementioned rivers, some other variations exist, most famous among them being New York City English.

Outside of the Eastern seaboard, all other North American English (both in the U.S. and Canada) has been firmly rhotic (pronouncing all r sounds), since the very first arrival of English-speaking settlers. Rhoticity is a feature shared today with the English of Ireland, for example, rather than most of the English of England, which has become non-rhotic since the late 1700s. The sound of Western U.S. English, overall, is much more homogeneous than Eastern U.S. English. The interior and western half of the country was settled by people who were no longer closely connected to England, living farther from the British-influenced Atlantic Coast.

Another prominent differentiating feature in regional North American English is fronting of the /oʊ/ in words like goat, home, and toe and /u/ in words like goose, two, and glue. This fronting characterizes Midland, Mid-Atlantic, and Southern U.S. accents; these accents also front and raise the /aʊ/ vowel (of words like house, now, and loud), making yowl sound something like yeah-wool or even yale. Northern U.S. English, however, tends to keep all these vowels more backed. Southern and some Midland U.S. accents are often most quickly recognized by the weakening or deleting of the "glide" sound of the /aɪ/ vowel in words like thyme, mile, and fine, making the word spy sound something like spa.

One phenomenon apparently unique to North American American accents is the irregular behavior of words that in the British English standard, Received Pronunciation, have /ɒrV/ (where V stands for any vowel). Words of this class include, among others: origin, Florida, horrible, quarrel, warren, borrow, tomorrow, sorry, and sorrow. In General American there is a split: the majority of these words have /ɔr/ (the sound of the word oar), but the last four words of the list above have /ɑr/ (the sound of the words are). In Canada, all of these words are pronounced as /oʊr/ (same as General American /ɔr/ but analyzed differently). In the accents of Greater New York City, Philadelphia, and the Carolinas (and older Southern), most or all of these words are pronounced /ɑr/ (Shitara 1993).

Almost entirely based upon the findings and categorizations of the 2006 The Atlas of North American English (or ANAE), the following is one well-supported way to hierarchically classify North American English accents at the level of broad geographic regions, sub-regions, etc. Most of the rest of this article is organized according to this ANAE classification. Below, the accent represented by each branch, in addition to each of its own features, also contains all the features of the branch it extends from.

Combining information from the phonetic research through interviews of Labov et al. in the ANAE (2006) and the phonological research through surveys of Vaux (2004), Hedges (2017) performed a latent class analysis (cluster analysis) to generate six clusters, each with American English features that naturally occurred together and each expected to match up with one of these six broad U.S. accent regions: the North, the South, the West, New England, the Midland, and the Mid-Atlantic (including New York City). The results showed that the accent regions/clusters were largely consistent with those outlined in the ANAE.

Below is a chart of particular pronunciations of particular words that have more than an 86% likelihood of occurring in a particular cluster. The defining pronunciations are: pajamas with either the phoneme /æ/ or the phoneme /ɑ/; coupon with either /ju/ or /u/; Monday with either /eɪ/ or /i/; Florida with either /ɔ/ or other possibilities (such as /ɑ/); caramel with either two or three syllables; handkerchief with either /ɪ/ or /i/; lawyer as either /ˈlɔɪər/ or /ˈlɔjər/; poem with either one or two syllables; route with either /u/ or /aʊ/; mayonnaise with either two or three syllables; and been with either /ɪ/ or other possibilities (such as /ɛ/). Parentheses in the chart indicate that the likelihood of the pronunciation inside them occurs ovewhelmingly in a particular region (well over 50% likelihood) but still does not meet the >86% threshold set by Hedges (2017) for what necessarily defines one of the six regional accents. Blank boxes in the chart indicate regions where neither pronunciation variant particularly dominates over the other; in some of these instances, the data simply may be inconclusive or unclear.[30]

Presumed accent region (cluster)

pajamas

coupon

Monday

Florida

caramel

handkerchief

lawyer

poem

route

mayonnaise

been

North

/æ/

/ju/

/eɪ/

/ɔ/

2 syll.

(/ɪ/)

(/ɔɪ/)

South

/ɑ/

(/eɪ/)

(/ɔ/)

3 syll.

/ɪ/

/ɔj/

2 syll.

(/ɪ/)

West

/ɑ/★

/eɪ/

/ɔ/

/ɪ/

/ɔɪ/

(2 syll.)

(/ɪ/)

New England

/ju/

/eɪ/

3 syll.

/ɔɪ/

(2 syll.)

/u/

3 syll.

Midland

/æ/

/u/

/eɪ/

/ɔ/

2 syll.

/ɔɪ/★

(2 syll.)

Mid-Atlantic& NYC

/ɑ/

/u/

/eɪ/

3 syll.

/ɪ/

/ɔɪ/

(2 syll.)

/u/

(3 syll.)

/ɪ/

★ Hedges (2017) acknowledges that the two pronunciations marked by this star are discrepancies of her latent class analysis, since they conflict with Vaux (2004)'s surveys. Conversely, the surveys show that /æ/ is the much more common vowel for pajamas in the West, and /ɔɪ/ and /ɔj/ are in fact both common variants for lawyer in the Midland.

General American is an umbrella accent of American English perceived by many Americans to be "neutral" and free of regional characteristics. A General American accent is not a specific well-defined standard English in the way that Received Pronunciation (RP) has historically been the standard prestigious variant of the English language in England; rather, accents with a variety of features can all be perceived by Americans as "General American" so long as they lack certain sociolinguistically salient features: namely, that is, lacking regional features (such as R-dropping, which usually identifies an American speaker as being from the East Coast or South), ethnic features (such as the "clear L" sound, which often identifies speakers as being Hispanic), or socioeconomic features (such as th-stopping, which often identifies speakers of a lower-class background).

The English dialect region encompassing the Western United States and Canada is the largest one in North America and also the one with the fewest distinctive phonological features. This can be attributed to the fact that the West is the region most recently settled by English speakers, and so there has not been sufficient time for the region either to develop highly distinctive innovations or to split into strongly distinct dialectological subregions.[citation needed] The main phonological features of the Western U.S. and Canada are a completed cot-caught merger, a backed GOAT vowel (like the Northern U.S.), and a fronted GOOSE vowel (like the Southern U.S.).

The accents of Atlantic Canada are more marked than the accents of the whole rest of English-speaking Canada. English of this region broadly includes /ɑr/ (START) fronting and full Canadian raising, but no Canadian Shift (the vowel shift documented in Standard Canadian English).

All of Canada, except the Atlantic Provinces and French-speaking Québec, speaks Standard Canadian English: the relatively uniform variety of North American English native to inland and western Canada, linguistically related to the Pacific Northwest, a region extending from British Columbia south into the Northwestern United States (particularly Washington and Oregon). The vowel [ɛ] is raised and diphthongized to [ɛɪ] or [eɪ] and [æ] as [eɪ] all before /ɡ/ and /ŋ/, merging words like leg and lag[leɪɡ]; tang is pronounced [teɪŋ]. The cot–caught merger to [ɒ] creates a hole in the short vowel sub-system[31] and triggers a sound change known as the Canadian Shift, mainly found in Ontario, English-speaking Montreal, and further west, and led by Ontarians and women; it involves the front lax vowels /æ/, /ɛ/, /ɪ/. The /æ/ of TRAP is retracted to [a] (except before nasals, where it is raised and diphthongized to [eə]), then /ɛ/ (DRESS) and /ɪ/ (KIT) are lowered in the direction of [æ] and [ɛ] and/or retracted; the exact trajectory of the shift is still disputed.[32] Increasing numbers of Canadians and Northwestern Americans have a feature called "Canadian raising", in which the nucleus of the diphthongs/aɪ/ and /aʊ/ are more "raised" before voiceless consonants. Thus for Canadians and Northwestern Americans, word pairs like pouter/powder ([ˈpɐʊɾɚ] versus [ˈpaʊɾɚ]) and rider/writer are pronounced differently.

California, the most populated U.S. state, has been documented as having some notable new subsets of Western U.S. English. Some youthful urban Californians possess a vowel shift partly identical to the Canadian shift in its backing or lowering of each front vowel one space in the mouth. Before /ŋ/, /ɪ/ is raised to [i], so king has the same vowel as keen rather than kin.[33] Before /ŋ//æ/ may be identified with the phoneme /eɪ/, so rang is pronounced with the same vowel as ray. Elsewhere /æ/ is lowered in the direction of [a]. /ʊ/ is moving towards [ʌ], so put sounds more like putt. /ʌ/ towards [ɛ], so putt can sound slightly similar to pet. The vowel /oʊ/ (GOAT) may be more fronted, i.e. [ʉ] and [ɵʉ]. The pin–pen merger is complete in Bakersfield and rural areas of the Central Valley, and speakers in Sacramento either perceive or produce an approximation of this merger.[34]

As in Eastern New England, the accents of New York City, Long Island, and adjoining New Jersey cities are traditionally non-rhotic, while other greater New York area varieties falling under the same sweeping dialect are usually rhotic or variably rhotic. Metropolitan New York shows the back GOAT and GOOSE vowels of the North, but a fronted MOUTH vowel. The vowels of cot[kät] and caught[kɔət] are distinct; in fact the New York dialect has perhaps the highest realizations of /ɔ/ in North American English, even approaching [oə] or [ʊə]. Furthermore, the father vowel is traditionally kept distinct from either vowel, resulting in a three "lot-palm-father distinction".[35]

The r-colored vowel of cart is back and often rounded [kɒt], and not fronted as it famously is in Boston. New York City and its surrounding areas are also known for a complicated[citation needed]short-a split into lax [æ] versus tense [eə], so that words, for example, like cast, calf, and cab have a different, higher, tenser vowel sound than cat, catch, and cap. The New York accent is well attested in American movies and television shows, often exaggerated, and especially ones about American mobsters from the area. Though it is sometimes known as a "Bronx" or "Brooklyn accent", no research has confirmed differences of accent between the city's boroughs.

One vast super-dialectal area commonly identified by linguists is "the North", usually meaning New England, inland areas of the Mid-Atlantic states, and the North-Central States. There is no cot–caught merger in the North around the Great Lakes and southern New England, although the merger is in progress in the North-bordering Midland and is completed in northern New England, including as far down the Atlantic coast as Boston. The western portions of the North may also show a transitioning or completing cot-caught merger. The diphthong /aʊ/ is [aʊ~äʊ], and /oʊ/ remains a back vowel, as does and /u/ after non-coronal consonants (unlike the rest of the country). Indeed, in part of the North (much of Wisconsin and Minnesota), /u/ remains back in all environments. Where the Southeast has /ɔ/ the single word on, the North has /ɑ/. The Canadian raising of /aɪ/ (to [ʌɪ]) before voiceless consonants occurs is common in the North, and is becoming more common elsewhere in North America.

The traditional and linguistically conservative North (as defined by the Atlas of North American English) includes /ɑr/ being often raised or fronted, or both, as well as a firm resistance to the cot-caught merger (though possibly weakening in dialects reversing the fronting of /ɑ/[5]). Maintaining these two features, but also developing several new ones, a younger accent of the North is now predominating at its center, around the Great Lakes and away from the Atlantic coast: the Inland North.

This map shows the approximate extent of the Northern Cities Vowel Shift, and thus the approximate area where the Inland North dialect predominates. Note that the region surrounding Erie, Pennsylvania is excluded.

The Inland North is a dialect region once considered the home of "standard Midwestern" speech that was the basis for General American in the mid-20th century. However, the Inland North dialect has been modified in the mid-1900s by the Northern Cities Vowel Shift (NCS), which is now the region's main outstanding feature, though it has been observed to be reversing at least in some areas, in particular with regards to /æ/ raising before non-nasal consonants and /ɑ/ fronting[5][6]. The Inland North is centered on the area on the U.S. side of the Great Lakes, most prominently including central and western New York State (including Syracuse, Binghamton, Rochester, and Buffalo), much of Michigan's Lower Peninsula (Detroit, Grand Rapids), Toledo, Cleveland, Chicago, Gary, and southeastern Wisconsin (Milwaukee, Racine, Kenosha), but broken up by the city of Erie, whose accent today is non-Inland Northern and even Midland-like. The NCS itself is not uniform throughout the Inland North; it is most advanced in Western New York and Michigan, and less developed elsewhere. The NCS is a chain shift involving movements of six vowel phonemes: the raising, tensing, and diphthongization of /æ/ towards [ɪə] in all environments (cat being pronounced more like "kyat"), then the fronting of /ɑ/ to [a] (cot sounding like cat), then the lowering of /ɔ/ towards [ɑ] (caught sounding like cot, but without the two merging due to the previous step), then the backing and sometimes lowering of /ɛ/, toward either [ə] or [æ], then the backing and rounding of /ʌ/ towards [ɔ], so that (cut sounding like caught), then lastly the lowering and backing of /ɪ/ (but without any pin–pen merger).

New England does not form a single unified dialect region, but rather houses as few as four native varieties of English, with some linguists identifying even more. Only Southwestern New England (Connecticut and western Massachusetts) neatly fits under the aforementioned definition of "the North". Otherwise, speakers, namely of Eastern New England, show very unusual other qualities. All of New England has a nasal short-a system, meaning that the short-a vowel most strongly raises before nasal consonants, as in much of the rest of the country.

The local and historical dialect of the coastal portions of New England, sometimes called Eastern New England English, now only encompasses Northeastern New England: Maine, New Hampshire (some of whose urban speakers are retreating from this local accent), and eastern Massachusetts (including Greater Boston). The accents spoken here share the Canadian raising of /aɪ/ as well as often /aʊ/, but they also possess the cot-caught merger, which is not associated with rest of "the North". Most famously, Northern New England accents (with the exception of Northwestern New England, much of southern New Hampshire, and Martha's Vineyard) are often non-rhotic. Some Northeastern New England accents are unique in North America for having resisted what is known as father–bother merger: in other words, the stressed vowel phonemes of father and bother remain distinct as /a/ and /ɒ/, so that the two words do not rhyme as they do in most American accents. Many Eastern New England speakers also once had a class of words with "broad a"—that is, /a/ as in father in words that in most accents contain /æ/, such as bath, half, and can't, similar to their pronunciation in London and southern England. The distinction between the vowels of horse and hoarse is maintained in traditional non-rhotic New England accents as [hɒs] for horse (with the same vowel as cot and caught) vs. [hoəs] for hoarse, though the horse–hoarse merger is certainly on the rise in the region today. The /æ/ phoneme has highly distinct allophones before nasal consonants. /ɑr/ fronting is usual.

Rhode Island, dialectally identified as "Southeastern New England", is sometimes grouped with the Eastern New England dialect region, both by the dialectologists of the mid–20th century and in certain situations by the Atlas of North American English; it shares Eastern New England's traditional non-rhoticity (or "R Dropping"). A key linguistic difference between Rhode Island and the rest of the Eastern New England, however, is that Rhode Island is subject to the father–bother merger and yet neither the cot–caught merger nor /ɑr/ fronting. Indeed, Rhode Island shares with New York and Philadelphia an unusually high and back allophone of /ɔ/ (as in caught), even compared to other communities that do not have the cot–caught merger. In the Atlas of North American English, the city of Providence (the only Rhode Island community sampled by the Atlas) is also distinguished by having the backest realizations of /u/, /oʊ/, and /aʊ/ in North America. Therefore, Rhode Island English aligns in some features more with Boston English and other features more with New York City English.

Recognized by research since the 1940s is the linguistic boundary between Eastern and Western New England, the latter settled from the Connecticut and New Haven colonies, rather than the Massachusetts Bay and Plymouth colonies. Western New Englanders settled most upstate New York the Inland North. Dialectological research has revealed some phonological nuances separating a Northwestern and Southwestern New England accent. Vermont, sometimes dialectally identified as "Northwestern New England", has the full cot-caught merger and /ɑr/ fronting of Boston or Maine English, and yet none of the other marked features of Eastern New England, nor much evidence of the NCS, which is more robustly documented, though still variable, in Southwestern New England. Rhoticity predominates in all of Western New England, as does the father–bother merger of the rest of the nation. Southwestern New England merely forms a "less strong" extension of the Inland North dialect region, and it centers on Connecticut and western Massachusetts. It shows the same general phonological system as the Inland North, including variable elements of Northern Cities Vowel Shift (NCS)—for instance, an /æ/ that is somewhat higher and tenser than average, an /ɑ/ that is fronter than /ʌ/, and so on. The caught–cot merger is approximated in western Massachusetts but usually still resisted in Connecticut. The "tail" of Connecticut may have some character diffused from New York City English.

The North Central or Upper Midwest dialect region of the United States extends from the Upper Peninsula of Michigan westward across northern Minnesota and North Dakota into the middle of Montana. Although the Atlas of North American English does not include the North Central region as part of the North proper, it shares all of the features listed above as properties of the North as a whole. The North Central is a linguistically conservative region; it participates in few of the major ongoing sound changes of North American English. Its /oʊ/ (GOAT and /eɪ/ (FACE) vowels are frequently even monophthongs: [o] and [e], respectively. The movie Fargo, which takes place in the North Central region, famously features strong versions of this accent.[36] Unlike most of the rest of the North, the cot–caught merger is prevalent in the North Central region.

Blue represents major cities of the Southern accent; darker blue represents cities with the strongest features of this accent.[37] Purple represents definitively non-Southern accents (mostly Midland accents), which together with the Southern accent fall under a "Southeastern super-region" (defined in this section).[37] Red represents cities outside of that super-region.

The 2006 Atlas of North American English identifies a "Southeastern super-region", in which all accents of the Southern States, as well as accents all along their regional margins, constitute a vast area of recent linguistic unity in certain respects:[38] namely, the movement of four vowel sounds (those in the words GOOSE, STRUT, GOAT, and MOUTH) towards the center or front of the mouth, all of which is notably different from the accents of the Northern United States.

Fronting of /aʊ/ and /oʊ/: The gliding vowels/aʊ/ (as in cow or ouch) and /oʊ/ (as in goat or bone) both start considerably forward in the mouth, approximately [ɛɔ~æɒ] and [ɜu], respectively. /oʊ/ may even end in a very forward position[40]—something like [ɜy~œʏ]. However, this fronting does not occur in younger speakers before /l/ (as in goal or colt) or before a syllable break between two vowels (as in going or poet), in which /oʊ/ remains back in the mouth as [ɔu~ɒu].[41]

Lacking or transitioning cot–caught merger: The historical distinction between the two vowels sounds /ɔ/ and /ɒ/, in words like caught and cot or stalk and stock is mainly preserved.[38] In much of the South during the 1900s, there was a trend to lower the vowel found in words like stalk and caught, often with an upglide, so that the most common result today is the gliding vowel [ɑɒ]. However, the cot–caught merger is becoming increasingly common throughout the United States, thus affecting Southeastern (even some Southern) dialects, towards a merged vowel [ɑ].[42] In the South, this merger, or a transition towards this merger, is especially documented in central, northern, and (particularly) western Texas.[43]

The merger of pin and pen in Southern American English. In the purple areas, the merger is complete for most speakers. Note the exclusion of the New Orleans area, Southern Florida, and of the Lowcountry of South Carolina and Georgia. The purple area in California consists of the Bakersfield and Kern County area, where migrants from the south-central states settled during the Dust Bowl. There is also debate whether or not Austin, Texas is an exclusion. Based on Labov, Ash & Boberg (2006:68).

Pin–pen merger in transition: The vowels [ɛ] and [ɪ] often merge when before nasal consonants, so that pen and pin, for instance, or hem and him, are pronounced the same, as pin or him, respectively.[38] The merger is towards the sound [ɪ]. This merger is now firmly completed throughout the Southern dialect region; however, it is not found in some vestigial varieties of the older South, and other geographically Southern U.S. varieties that have eluded the Southern Vowel Shift, such as the Yat dialect of New Orleans or the anomalous dialect of Savannah, Georgia. The pin–pen merger has also spread beyond the South in recent decades and is now found in isolated parts of the West and the southern Midwest as well.

Rhoticity: Dropping of postvocalicr (and, in some dialects, intervocalicr) was historically widespread in the South, particularly in former plantation area.[44] This phenomenon, non-rhoticity, was considered prestigious across the nation before World War II, after which the social perception reversed. Rhoticity (sometimes called r-fulness), in which all r sounds are pronounced, historically found only in the Midland, Appalachia, and some other Southeastern regions, has now become dominant throughout the entire Southeastern super-region, as in most American English, and even more so among younger and female white Southerners; the only major exception is among African American Southerners, whose modern vernacular dialect continues to be mostly non-rhotic.[45] The sound quality of the Southeastern r is the distinctive "bunch-tongued r", produced by strongly constricting the root and/or midsection of the tongue.[46]

A band of the United States from Pennsylvania west to the Great Plains is what twentieth-century linguists identified as the "Midland" dialect region, though this dialect's same features are now reported in certain other pockets of the country too (for example, some major cities in Texas, all in Central and South Florida, and particular cities that are otherwise Southern). In older and traditional dialectological research, focused on lexicology (vocabulary) rather than phonology (accent), the Midland was divided into two discrete geographical subdivisions: the "North Midland" that begins north of the Ohio River valley area and, south of that, the "South Midland" dialect area. The North Midland region stretches from east-to-west across central and southern Ohio, central Indiana, central Illinois, Iowa, and northern Missouri, as well as Nebraska and Kansas where it begins to blend into the West. The South Midland dialect region follows the Ohio River in a generally southwesterly direction, moving across from Kentucky, southern Indiana, and southern Illinois to southern Missouri, Arkansas, southeastern Kansas, and Oklahoma, west of the Mississippi River. The distinction between a "North" versus "South Midland" was discarded in the 2006 Atlas of North American English, in which the former "North Midland" is now simply called "the Midland" (and argued to have a "stronger claim" to a General American accent than any other region) and the "South Midland" is considered merely as the upper portion of "the South"; this ANAE reevaluation is primarily on the basis of phonology. The Midland is characterized by having a distinctly fronter realization of the /oʊ/ phoneme (as in boat) than many other American accents, particularly those of the North; the phoneme is frequently realized with a central nucleus, approximating [əʊ]. Likewise, /aʊ/ has a fronter nucleus than /aɪ/, approaching [æʊ]. Another feature distinguishing the Midland from the North is that the word on contains the phoneme /ɔ/ (as in caught) rather than /ɑ/ (as in cot). For this reason, one of the names for the North-Midland boundary is the "on line". However, since the twentieth century, this area is currently undergoing a vowel merger of the "short o" /ɑ/ (as in cot) and 'aw' /ɔ/ (as in caught) phonemes, known as the cot-caught merger. Many speakers show transitional forms of the merger. The /æ/ phoneme (as in cat) shows most commonly a so-called "continuous" distribution: /æ/ is raised and tensed toward [eə] before nasal consonants, as in much of the country.

Atlanta, Georgia has been characterized by a massive movement of non-Southerners into the area during the 1990s, leading the city to becoming hugely inconsistent in terms of dialect.[47] Currently, /aɪ/ is variably monophthongized (as in the Southern U.S.); no complete cot–caught merger is reported; and the pin–pen merger is variable.

Charleston, South Carolina is an area where, today, most speakers have clearly conformed to a Midland regional accent, rather than any Southern accent. Charleston was once home to its own very locally-unique accent that encompassed elements of older British English while resisting Southern regional accent trends, perhaps with additional linguistic influence from FrenchHuguenots, Sephardi Jews, and, due to Charleston's high concentration of African-Americans that spoke the Gullah language, Gullah African Americans. The most distinguishing feature of this now-dying accent is the way speakers pronounce the name of the city, to which a standard listener would hear "Chahlston", with a silent "r". Unlike Southern regional accents, Charlestonian speakers have never exhibited inglide long mid vowels, such as those found in typical Southern /aɪ/ and /aʊ/.

Central and South Florida show no evidence of any type of /aɪ/ glide deletion, Central Florida shows a pin–pen merger, and South Florida does not. Otherwise, Central and South Florida easily fit under the definition of the Midland dialect, including the cot-caught merger being transitional. In South Florida, particularly in and around Miami-Dade, Broward, and Monroe counties, a unique dialect, commonly called the "Miami accent", is widely spoken. The dialect first developed among second- or third-generation Hispanics, including Cuban-Americans, whose first language was English.[48] Unlike the older Florida Cracker dialect, "Miami accent" is rhotic. It also incorporates a rhythm and pronunciation heavily influenced by Spanish (wherein rhythm is syllable-timed).[49]

The cities of the Mid-Atlantic States around the Delaware Valley (South Jersey, southeastern Pennsylvania, northern Delaware, and eastern Maryland) are typically classified together, their speakers most popularly labelled as having a Philadelphia accent or a Baltimore accent. While Labov et al. state that the dialect could potentially be included in the Midland super-region, the dialect is not included in Midland proper as a result of distinct phonological features defining the dialect.[50] The Mid-Atlantic split of /æ/ into two separate phonemes, similar to but not exactly the same as New York City English, is one major defining feature of the dialect region, as is a resistance to the Mary–marry–merry merger and cot-caught merger (a raising and diphthongizing of the "caught" vowel), and a maintained distinction between historical short o and long o before intervocalic /r/, so that, for example, orange, Florida, and horrible have a different stressed vowel than story and chorus; all of these features are shared between Mid-Atlantic American and New York City English. Other features include that water is sometimes pronounced [ˈwʊɾər], that is, with the vowel of wood; the single word on is pronounced /ɔn/ not /ɑn/, so that, as in the South and Midland (and unlike New York and the North) it rhymes with dawn rather than don; the /oʊ/ of goat and boat is fronted, so it is pronounced [əʊ], as in the advanced accents of the Midland and South. Canadian raising occurs for /aɪ/ (price) but not for /aʊ/ (mouth).

According to linguist Barbara Johnstone, migration patterns and geography affected the Philadelphia dialect's development, which was especially influenced by immigrants from Northern England, Scotland, and Northern Ireland.[51]

The Southern United States is often dialectally identified as "The South," as in ANAE. There is still great variation between sub-regions in the South (see here for more information) and between older and younger generations. Southern American English as Americans popularly imagine began to take its current shape only after the beginning of the twentieth century. Some generalizations include: the conditional merger of [ɛ] and [ɪ] before nasal consonants, the pin–pen merger; the diphthong /aɪ/ becomes monophthongized to [a]; lax and tense vowels often merge before /l/. The South Midland dialect (now considered the upper portion of the Southern U.S. dialect and often not distinguished phonologically) follows the Ohio River in a generally southwesterly direction, moves across Arkansas and some of Oklahoma west of the Mississippi, and peters out in West Texas; it also includes some of North Florida, namely around Jacksonville. It most noticeably has the loss of the diphthong [aɪ], which becomes [a]. It also shows fronting of initial vowel of /aʊ/ to [æʊ] (often lengthened and prolonged) yielding [æːʊ]; nasalization of vowels, esp. diphthongs, before [n]; raising of /æ/ to [e]; can't → cain't, etc.; fully rhoticity, unlike classical coastal varieties of older Southern American English, now mostly declined. In the Southern Vowel Shift of the early 1900s up to the present, [ɪ] moves to become a high front vowel, and [ɛ] to become a mid front unrounded vowel. In a parallel shift, the /i/ and /eɪ/ relax and become less front; the back vowels /u/ in boon and /oʊ/ in code shift considerably forward to [ʉ] and [ɞ], respectively; and, the open back unrounded vowel /ɑr/ in card shifts upward towards [ɔ] as in board, which in turn moves up towards the old location of /u/ in boon. This particular shift probably does not occur for speakers with the cot–caught merger. The lowering movement of the Southern Vowel Shift is also accompanied by a raising and "drawling" movement of vowels. The term Southern drawl has been used to refer to the diphthongization/triphthongization of the traditional short front vowels, as in the words pat, pet, and pit. these develop a glide up from their original starting position to [j], and then in some cases back down to schwa; thus: /æ/ → [æjə], /ɛ/ → [ɛjə], and /ɪ/ → [ɪjə].

The ANAE identifies two important, especially advanced subsets of the South in terms of their leading the Southern Vowel Shift (detailed above): the "Inland South" located in the southern half of Appalachia and the "Texas South," which only covers the north-central region of Texas (Dallas), Odessa, and Lubbock, but not Abilene, El Paso, or southern Texas (which have more Midland-like accents). One Texan distinction from the rest of the South is that all Texan accents have been reported as showing a pure, non-gliding /ɔ/ vowel,[43] and the identified "Texas South" accent, specifically, is at a transitional stage of the cot-caught merger; the "Inland South" accent of Appalachia, however, firmly resists the merger. Pronunciations of the Southern dialect in Texas may also show notable influence derived from an early Spanish-speaking population or from German immigrants.

The following Southeastern super-regional locations do not cleanly fit into any of the aforementioned subsets of the Southeast, and may even be marginal-at-best members of the super-region itself:

Chesapeake and the Outer Banks (North Carolina) islands are enclaves of a traditional "Hoi Toider" dialect, in which /aɪ/ is typically backed and rounded. Many other features of phonological (and lexical) note exist here too; for example, Ocracoke, North Carolina shows no cot–caught merger and its monophthongs are diphthongized (up-gliding) before /ʃ/ and /tʃ/ and Smith Island, Maryland shows an /i/ that is diphthongized (like the South) and no happy tensing.

New Orleans, Louisiana has been home to a type of accent with parallels to the New York City accent reported for over a century. This variety of New Orleans English has been locally nicknamed "Yat" since at least the 1980s, from a traditional greeting "Where y'at" ("Where are you at?", meaning "How are you?"). The Yat/NYC parallels include the split of the historic short-a class into tense [eə] and lax [æ] versions, as well as pronunciation of cot and caught as [kät] and [kɔət]. The stereotypical New York curl–coil merger of "toity-toid street" (33rd Street) used to be a common New Orleans feature as well, though it has mostly receded today. One of the most detailed phonetic depictions of an extreme "yat" accent of the early 20th century is found in the speech of the character Krazy Kat in the comic strip of the same name by George Herriman. Such extreme accents still be found in parts of Mid-City and the 9th ward, Jefferson Parish, as well as in St. Bernard Parish, just east of New Orleans. The novel A Confederacy of Dunces by John Kennedy Toole often employs the Yat accent.

Oklahoma City, Oklahoma, according to the ANAE's research, is not quite a member of the Midland dialect region.[52] Rather, its features seem to be a blend of the Western and Midland dialects. The overview of ANAE's studied features for Oklahoma City speakers include a conservative /aɪ/, conservative /oʊ/, transitional cot-caught merger, and variable pin–pen merger.

Savannah, Georgia once had a local accent that is now "giving way to regional patterns" of the Midland.[52] According to the ANAE, there is much transition in Savannah, and the following features are reported as inconsistent or highly variable in the city: the Southern phenomenon of /aɪ/ being monophthongized, non-rhoticity, /oʊ/ fronting, the cot–caught merger, the pin–pen merger, and conservative /aʊ/ (which is otherwise rarely if ever reported in either the South or the Midland).

St. Louis, Missouri is historically one among several (North) Midland cities, but it is largely considered by ANAE to classify under blends of Inland North accents, with the Northern Cities Vowel Shift (NCS), and Midland accents. The "St. Louis Corridor" demonstrates this variability in speakers following a line formed by U.S. Route 66 in Illinois (now Interstate 55), going from Chicago southwest to St. Louis. This corridor of speakers cuts right through the center of what is otherwise the firmly-documented Midland region. Older St. Louisans demonstrate a card-cord merger, so that "I-44" is pronounced like "I farty-four".[53] St. Louis resists the cot–caught merger and middle-aged speakers show the most advanced stages of the NCS,[43] while maintaining many of the other Midland features.

The dialect of the western half of Pennsylvania is like the Midland proper in many features, including the fronting of /oʊ/ and /aʊ/. The chief distinguishing feature of Western Pennsylvania as a whole is that the cot–caught merger is noticeably complete here, whereas it is still in progress in most of the Midland. The merger has also spread from Western Pennsylvania into adjacent West Virginia, historically in the South Midland dialect region. The city of Pittsburgh shows an especially advanced subset of Western Pennsylvania English, additionally characterized by a sound change that is unique in North America: the monophthongization of /aʊ/ to [a]. This is the source of the stereotypical Pittsburgh pronunciation of downtown as "dahntahn". Pittsburgh also features an unusually low allophone of /ʌ/ (as in cut); it approaches [ɑ] (/ɑ/ itself having moved out of the way and become a rounded vowel in its merger with /ɔ/).

^Dialects are considered "rhotic" if they pronounce the r sound in all historical environments, without ever "dropping" this sound. The father–bother merger is the pronunciation of /ɒ/ (as in cot, lot, bother, etc.) the same as /ɑ/ (as in spa, haha, Ma), causing words like con and Kahn and like sob and Saab to sound identical, with the vowel usually realized in the back or middle of the mouth as [ɑ~ä]. Finally, most of the U.S. participates in a continuous nasal system of the "short a" vowel (in cat, trap, bath, etc.), causing /æ/ to be pronounced with the tongue raised and with a glide quality (typically sounding like [ɛə]) particularly when before a nasal consonant; thus, mad is [mæd], but man is more like [mɛən].

^The only notable exceptions of the South being a subset of the "Southeastern super-region" are two Southern metropolitan areas, described as such because they participate in Stage 1 of the Southern Vowel Shift, but lack the other defining Southeastern features: Savannah, Georgia and Amarillo, Texas.