Skip to main content
&Sageio
All posts

Blog

Swahili meeting translation: noun-class concord and Sheng code-switching

Swahili sorts nouns into many classes, and the class prefix ripples across the verb, adjective, and demonstrative. Why that breaks transcription, plus Sheng — and how to translate a Swahili meeting correctly.

By Ming · · 6 min read

Swahili is the lingua franca of East Africa — Kenya, Tanzania, Uganda, parts of the DRC — and it's structured in ways that quietly defeat tools built for European languages. It sorts every noun into one of many classes, and the class isn't decorative: its prefix echoes onto the verb, the adjective, and the demonstrative so the whole sentence agrees. It also packs a subject marker, a tense, an object marker, and the verb root into a single word. Add Sheng — the Swahili-English code-switching that runs Nairobi business and youth culture — and the coastal-versus-inland variation along the way, and "supports Swahili" on a feature list tells you very little. Here's what actually decides whether a Swahili meeting comes back usable.

Noun classes make agreement ripple across the sentence

Swahili sorts nouns into many classes — people, objects, abstractions, plants, and more — and each class carries a prefix that doesn't just sit on the noun: it echoes onto everything that agrees with it. "One nice book" is kitabu kimoja kizuri kile — the ki- of class 7 reappears on the number, the adjective, and the demonstrative. Make it plural and the whole line shifts to class 8: vitabu viwili vizuri vile, every ki- becoming vi-. People sit in a different class again (the m-/wa- class: mtu, "person," watu, "people"), with its own concord. If a recognizer mishears the class on the noun, the agreement it triggers downstream collapses — the verb, the adjective, and the demonstrative now disagree, and the sentence reads as broken rather than merely misspelled.

One word, several grammatical pieces

Swahili is agglutinative: a verb is built by stacking a subject marker, a tense marker, an optional object marker, and the root into one written word. Ninakupenda is ni-na-ku-penda — "I-(present)-you-love," four pieces, one word. Change the subject and tense and object and you change the prefixes, not the word boundaries: atakusaidia, "he/she will help you," is a-ta-ku-saidia. A recognizer that expects meaning to come in separate words mis-segments this — it may split one verb into fragments or fuse a marker onto the wrong neighbour, and the translation inherits the error. Getting Swahili right means modelling the morphology inside the word, not just the gaps between words.

Sheng is the Nairobi business register

In Nairobi, a lot of real professional talk isn't textbook Swahili — it's Sheng: a Swahili frame with English (and other) words dropped in, often carrying Swahili markers. "Tuta-deploy hii feature kwa sprint ijayo" is one normal sentence — English content words, Swahili grammar and tense. A tool that detects "Swahili" may leave the English untranslated; one that detects "English" leaves the Swahili. On top of that, the polished coastal Swahili of Mombasa and Zanzibar differs from the inland varieties spoken up-country, in vocabulary and rhythm both. A reader in another language needs a complete sentence rebuilt for them — not a half-translated line with the other half left as it was spoken.

Why "supports Swahili" isn't enough

A tool can list Swahili, transcribe a clean dictionary sentence, and still fall apart on the noun-class concord, the packed verb, and the Sheng your team actually speaks. The feature list won't tell you which. One real call will: does a native speaker read the captions and transcript and recognize how the room actually talked — the class agreement holding across the line, the verbs segmented right, the Sheng kept whole? For why this pattern repeats across under-served languages, see real-time translation for remote teams.

How to do it with Sageio

  1. Add bot@sageio.net to your Google Meet calendar invite. It joins on its own — no extension, nothing to install.
  2. Each participant picks their caption language. The Nairobi team reads clean Swahili, a colleague elsewhere reads clean English — both from the same spoken Swahili, at the same time. (Sageio translates into 20+ languages.)
  3. Everyone speaks naturally — full concord, packed verbs, Sheng and all. Translated captions appear in about two seconds.
  4. Afterward, a searchable transcript and an AI summary arrive within about five minutes, shared at the host's discretion.

(Today this runs on Google Meet; Zoom and Microsoft Teams support is coming soon.)

How to test any tool in five minutes

Say a sentence where the noun class has to ripple — vitabu viwili vizuri vile ("those two nice books") — and check whether the captions hold the vi- agreement across all four words instead of letting one slip back to ki-. Then say a normal Sheng line ("tuta-finish hii task kabla ya next week" — "we'll finish this task before next week") and see whether it keeps the English words whole while rendering the Swahili correctly. If the agreement breaks or the English comes back garbled, the tool wasn't built for spoken Swahili.

Is it private?

For anything that joins your meetings: Sageio doesn't use your meeting content to train AI models, and its AI vendors are contractually restricted from doing the same. Audio is processed in memory and discarded — only the text transcript and summary are kept, encrypted, in the region you choose (US, EU, or APAC). Enterprise customers can self-host the entire stack.

Frequently asked questions

Why do Swahili noun classes matter for transcription? Swahili sorts nouns into many classes, and the class prefix echoes onto the verb, adjective, and demonstrative so the whole sentence agrees — kitabu kimoja kizuri kile ("one nice book") versus vitabu viwili vizuri vile ("two nice books"). If a tool mishears the class on the noun, the agreement it triggers downstream collapses and the line reads as broken.

What makes Swahili verbs hard to recognize? Swahili is agglutinative: a verb packs a subject marker, tense, an object marker, and the root into one word — ninakupenda is ni-na-ku-penda, "I love you." A recognizer that expects separate words mis-segments it, and the translation inherits the error.

What is Sheng? The Swahili-English code-switching of urban Kenya — a Swahili grammatical frame with English words dropped in, often carrying Swahili markers, like "tuta-deploy hii feature." It's a normal business register in Nairobi. Tools that assume one language per sentence translate only half; correct handling rebuilds a full sentence in each target language.

Does coastal versus inland Swahili affect the result? It can. The polished coastal Swahili of Mombasa and Zanzibar differs from inland up-country varieties in vocabulary and rhythm. The honest check is whether a native speaker recognizes how your team actually speaks, not just textbook Swahili.

What does it cost to try? Every plan starts with a free 60-minute trial, no credit card required. After that, Professional is $49/month and Teams is $99 per seat/month (annual billing includes 2 months free); Enterprise is custom-priced.


If your team works in Swahili, the honest test is whether a native speaker reads the live captions and transcript and hears the actual meeting — the class agreement holding across each line, the verbs segmented right, the Sheng kept whole. Add the bot to your next call and let them judge.