This guide covers the full scope of language diversity in America, from the 150+ indigenous languages still spoken today to the immigrant languages reshaping city neighborhoods, the sign languages serving millions of Deaf Americans, and the endangered tongues that may disappear within a generation.
Whether you work in localization, education, healthcare, government services, or AI data operations, understanding this diversity is not optional. It is operational reality.
How Many Languages Are Spoken in the United States?
According to the US Census Bureau’s American Community Survey (ACS), approximately 67.8 million US residents (about 21.7% of the population aged 5 and older) speak a language other than English at home. The Endangered Languages Project and UNESCO list over 350 distinct languages currently used within US borders.
These break down into several categories:
- English is the dominant language, spoken by approximately 78% of the population as their primary home language.
- Spanish is the most widely spoken non-English language, with over 41 million speakers, roughly 13.5% of the population.
- Indigenous and Native American languages account for 150+ living languages, though many have fewer than 1,000 remaining speakers.
- Immigrant languages include Chinese (3.5 million speakers), Tagalog (1.8 million), Vietnamese (1.5 million), Arabic (1.3 million), French (1.2 million), and Korean (1.1 million).
- Sign languages include American Sign Language (ASL), used by an estimated 500,000 to 2 million people, plus regional and community sign varieties.
- Creoles and pidgins such as Louisiana Creole, Hawaiian Pidgin (Hawaiian Creole English), and Gullah Geechee.
Top 15 Non-English Languages in the US by Number of Speakers
| Rank | Language | Approximate Speakers | Primary Regions |
|---|---|---|---|
| 1 | Spanish | 41.8 million | California, Texas, Florida, New York, Illinois |
| 2 | Chinese (incl. Mandarin, Cantonese) | 3.5 million | California, New York, Hawaii |
| 3 | Tagalog (incl. Filipino) | 1.8 million | California, Hawaii, Nevada |
| 4 | Vietnamese | 1.5 million | California, Texas, Washington |
| 5 | Arabic | 1.3 million | Michigan, California, New York, New Jersey |
| 6 | French (incl. Cajun) | 1.2 million | Louisiana, Maine, New Hampshire |
| 7 | Korean | 1.1 million | California, New York, New Jersey |
| 8 | Russian | 960,000 | New York, California, Washington |
| 9 | Hindi | 890,000 | New Jersey, California, Texas, New York |
| 10 | Portuguese | 830,000 | Massachusetts, New Jersey, Florida, California |
| 11 | Haitian Creole | 800,000 | Florida, New York, Massachusetts |
| 12 | German | 780,000 | Pennsylvania, Ohio, Wisconsin, Texas |
| 13 | Urdu | 570,000 | New York, Texas, New Jersey, Virginia |
| 14 | Japanese | 520,000 | Hawaii, California, Washington |
| 15 | Gujarati | 490,000 | New Jersey, New York, California, Texas |
Source: US Census Bureau American Community Survey estimates. Figures are approximate and vary by survey year.
Indigenous and Native American Languages
Before European colonization, an estimated 300 to 600 distinct languages were spoken across what is now the United States. Today, roughly 150 to 175 indigenous languages survive, but the majority are critically endangered.
Major Language Families
North American indigenous languages belong to dozens of language families with no genetic relationship to each other or to European languages. The major families include:
- Algonquian: Ojibwe, Cree, Blackfoot, Cheyenne, Arapaho. Historically one of the most widespread families across the Great Lakes, Northeast, and Plains regions.
- Na-Dene (Athabaskan): Navajo (Dine Bizaad), Apache, Tlingit, Gwich’in. Navajo is the most widely spoken Native American language in the US, with approximately 170,000 speakers.
- Uto-Aztecan: Hopi, Comanche, Shoshoni, O’odham (Pima-Papago). Spans from the Great Basin to Central America.
- Siouan-Catawban: Lakota, Dakota, Crow, Osage, Hidatsa. Historically dominant across the Plains.
- Iroquoian: Cherokee, Mohawk, Seneca, Oneida. Cherokee has its own writing system (the Sequoyah syllabary) and roughly 22,000 speakers.
- Muskogean: Choctaw, Chickasaw, Muscogee (Creek), Seminole. Centered in the southeastern US.
- Eskimo-Aleut: Yup’ik, Inupiaq, Aleut. Spoken in Alaska, where 20+ indigenous languages remain active.
Navajo: A Revitalization Success Story
Navajo (Dine Bizaad) stands out as the most spoken Native American language in the US, with approximately 170,000 speakers. The Navajo Nation operates Dine-language media, immersion schools, and has integrated the language into governance and daily life. Despite this relative strength, UNESCO still classifies Navajo as “vulnerable” because most fluent speakers are over 40 years old, and intergenerational transmission has declined.
The language gained global recognition through the Navajo Code Talkers of World War II, whose use of Navajo for military communications created an unbreakable code.
Language Loss and Revitalization Efforts
The US government’s historical boarding school policies (1870s-1960s) deliberately suppressed indigenous languages. Children were punished for speaking their native tongues, breaking intergenerational transmission across hundreds of communities.
The Native American Languages Act of 1990 and the Esther Martinez Native American Languages Preservation Act of 2006 established federal support for language revitalization. Current efforts include:
- Language immersion programs (e.g., Hawaiian language nests, Mohawk immersion in Kahnawake)
- Digital documentation projects using audio and video recording
- University programs in indigenous linguistics (University of Hawaii, University of Arizona, MIT)
- Technology projects creating keyboards, apps, and social media tools in Native languages
Hawaiian: Digital-Age Revival
Hawaiian (Olelo Hawaii) nearly died out in the mid-20th century, declining to fewer than 50 child speakers by 1980. The Punana Leo immersion preschools, established in 1984, reversed this decline. Today, approximately 24,000 people speak Hawaiian, immersion programs run from preschool through university, and Hawaiian holds co-official status alongside English in the state of Hawaii.
Immigrant Languages Reshaping American Cities
Immigration continuously reshapes America’s linguistic map. Some trends from the past two decades:
Spanish: Far More Than One Language
With 41.8 million speakers, Spanish is the second-most-spoken language in the US. But “Spanish” encompasses enormous dialectal variation: Mexican Spanish in the Southwest and Midwest, Caribbean Spanish (Puerto Rican, Dominican, Cuban) in New York and Florida, Central American varieties in Washington DC and Los Angeles, and South American dialects scattered across major metros.
These dialectal differences affect vocabulary, pronunciation, and cultural reference points. A healthcare interpreter working with a Guatemalan patient and a Cuban patient needs different regional knowledge, even though both speak “Spanish.”
Asian Languages: Fast-Growing Communities
Chinese, Tagalog, Vietnamese, Korean, Japanese, Hindi, and Urdu collectively account for over 10 million speakers in the US. Several patterns stand out:
- Chinese is not one language but a family: Mandarin, Cantonese, Fuzhounese, and other varieties coexist in cities like New York, where Fuzhounese speakers now outnumber Cantonese speakers in some Chinatown neighborhoods.
- Hindi and Urdu are linguistically similar but culturally distinct. Combined, they represent approximately 1.5 million US speakers.
- Tagalog is the third-most-spoken non-English language, concentrated in California and Hawaii’s large Filipino-American communities.
African Languages: The Newest Growth Sector
Languages like Somali, Amharic, Yoruba, Igbo, Swahili, and Hausa are growing rapidly in the US. Minneapolis-St. Paul has the largest Somali-speaking community outside East Africa. The Washington DC metropolitan area hosts significant Amharic-speaking Ethiopian and Eritrean communities. West African languages like Yoruba and Igbo are increasingly heard in Houston, Atlanta, and New York.
For organizations providing public services, healthcare interpretation, or educational support, these communities represent urgent and growing demand.
Arabic: Regional Diversity
Arabic is spoken by approximately 1.3 million Americans, but “Arabic” covers distinct regional varieties: Levantine (Syrian, Lebanese, Palestinian, Jordanian), Egyptian, Iraqi, Gulf, and North African (Moroccan, Algerian, Tunisian). Michigan’s Dearborn is home to one of the largest Arab-American communities in the country, with Lebanese, Yemeni, and Iraqi dialects all represented.
Sign Languages in the United States
American Sign Language (ASL) is used by an estimated 500,000 to 2 million people in the US, including Deaf, hard-of-hearing, and hearing individuals. ASL is a complete, natural language with its own grammar, syntax, and regional dialects. It is not a signed version of English.
Other sign languages used in the US include:
- Black ASL: A distinct variety of ASL that developed in segregated Deaf schools in the American South. It has its own vocabulary, grammatical features, and cultural context.
- Protactile Sign Language: Used by DeafBlind communities, incorporating touch-based communication.
- Plains Indian Sign Language (PISL): A Native American sign language historically used as a lingua franca across the Great Plains. Now critically endangered, with revitalization efforts underway.
- Signed Exact English (SEE) and other manual codes: These are not natural languages but manual representations of English used in some educational settings.
ASL interpreting services are required under the Americans with Disabilities Act (ADA) in healthcare, legal, educational, and government settings. Video Remote Interpreting (VRI) has expanded access significantly, especially in rural areas where in-person ASL interpreters are scarce.
Endangered Languages in the US
The US is one of the most active sites of language death in the world. UNESCO classifies languages on a scale from “safe” to “extinct,” and scores of American languages fall into the most critical categories.
How Many Languages Are at Risk?
Of the roughly 150-175 indigenous languages still spoken in the US, the Endangered Languages Project estimates that over 100 are critically endangered, meaning they are spoken only by elderly community members and are no longer being transmitted to children.
| UNESCO Category | Definition | US Examples |
|---|---|---|
| Vulnerable | Most children speak it, but restricted to certain domains | Navajo, Yup’ik |
| Definitely Endangered | Children no longer learn it as a mother tongue | Cherokee, Choctaw |
| Severely Endangered | Spoken by grandparents; parents may understand but do not speak to children | Crow, Osage, Mohawk |
| Critically Endangered | Spoken by few elderly speakers | Arapaho, Comanche, many Californian languages |
| Extinct | No living speakers | Eyak (last speaker died 2008), many others |
Why Language Diversity Matters
Each language encodes unique knowledge about the natural world, social organization, and human cognition. When a language dies, that knowledge is permanently lost. Linguists estimate that about half of the world’s 7,000+ languages will disappear by the end of this century, and the US is a major site of this loss.
For AI and NLP teams, the practical dimension is equally important: low-resource languages are precisely the ones where AI models perform worst, because training data does not exist. Building NLP tools for these languages requires the kind of native-speaker data collection and annotation that no amount of web scraping can replace.
Language Diversity by Region
Language diversity in the US is not evenly distributed. Some states and metropolitan areas are far more linguistically complex than others:
- California: The most linguistically diverse state, with over 200 languages spoken. Los Angeles County alone has significant populations speaking Spanish, Chinese, Korean, Armenian, Tagalog, Persian, Vietnamese, and Russian.
- New York: Queens, New York, has been called the most linguistically diverse place on Earth, with an estimated 138+ languages spoken in a single borough.
- Texas: Large Spanish-speaking populations plus growing Vietnamese, Chinese, Hindi, Urdu, and Arabic communities. Native American languages like Comanche and Alabama-Coushatta persist in small communities.
- Alaska: At least 20 indigenous languages across Eskimo-Aleut, Na-Dene, and Tsimshianic families. Four of these languages have co-official status alongside English.
- Hawaii: Hawaiian and English are co-official. Significant populations also speak Ilocano, Tagalog, Japanese, and Samoan.
What This Means for Organizations
America’s language diversity creates real operational requirements across multiple sectors:
- Healthcare: Federal law (Section 1557 of the ACA) requires healthcare providers receiving federal funding to provide language access services. With 350+ languages spoken in the US, hospitals in diverse metro areas need interpretation capacity in dozens of languages simultaneously.
- Legal services: Court interpreting is required under the Court Interpreters Act. Immigration courts alone handle cases in hundreds of languages, including many rare ones where qualified interpreters are scarce.
- Government services: Executive Order 13166 requires federal agencies to provide meaningful access to people with limited English proficiency (LEP). State and local governments in diverse areas face the same obligation.
- Education: Over 5 million English Language Learners (ELLs) are enrolled in US public schools, speaking more than 400 languages. Bilingual education programs, parent communication, and assessment translation all require multilingual capacity.
- AI and technology: Products launching in the US market need to handle multilingual user bases from day one. Voice assistants, chatbots, content moderation tools, and search algorithms all underperform when they assume English-only users.

