You answer the phone in English: “Hello?” The Japanese phone rings, gets picked up, and instead of konnichiwa or some other greeting that exists in normal conversation, the voice on the other end says: “Moshi moshi.“
It’s a phrase that exists almost nowhere else. Outside the phone, you’ll hear it perhaps when someone is trying to get the attention of a person who’s zoned out, or to verify that someone is still awake. It is, essentially, a word reserved for the moment when you can’t see who you’re talking to. The reason it exists has a strange history. Some of it is utilitarian. Part of it is folklore about ghosts.
What the word literally is
もしもし (moshi moshi) is a doubled form of an old verb stem. The verb is 申す (mousu), an archaic and humble form of “to say” — used historically when speaking to a superior or in formal contexts. Moshi moshi is roughly “I say, I say” or “I’m speaking, I’m speaking” — a polite, somewhat self-effacing announcement that someone on this end of the line is about to talk.
The doubling is the most distinctive feature. Most Japanese conversational openers are not doubled. Konnichiwa, ohayou gozaimasu, sumimasen — all stand alone. The fact that moshi moshi repeats has been the subject of considerable folk explanation, and one of the explanations is genuinely strange.
The ghost theory
One folk tradition holds that moshi moshi is doubled because Japanese ghosts and yokai cannot — or will not — say a word twice. The reasoning: by repeating “moshi,” the speaker is verifying their own humanity to a listener they cannot see. A spirit on the line would manage one “moshi” and then fall silent. Two means safe.
Whether this is the actual historical origin is not certain. Folklorists generally treat it as a post-hoc explanation rather than a clean etymology. But the cultural logic is real: Japanese folk tradition has a long history of supernatural beings who can mimic human speech only imperfectly, and verifying personhood through small linguistic challenges is a recurring motif. The phone, when it became common in the late 19th century, fit the same anxiety pattern — a voice without a body, possibly trustworthy, possibly not.
More documented as historical fact: when telephones first appeared in Japan in the 1890s, the phrase used to open calls drifted through several alternatives — oi oi, yes yes, moushiagemasu (“I humbly say”) — before settling on moshi moshi. The doubling stuck partly because the word “moshi” alone could be misheard, and partly because the redundancy made the message reliably arrive across early-era phone-line crackle.
Where you don’t say it
Native speakers know intuitively that moshi moshi is not for every phone call. There are two contexts where it’s clearly wrong, and using it in either makes you sound unfamiliar with phone etiquette.
Business calls
When a Japanese employee answers a business call, they do not say moshi moshi. They say something like “Hai, [company name] desu” — “Yes, this is [company].” The phrase identifies the institution, the speaker, and the willingness to receive the call, all in formal register. Moshi moshi in this context would read as casual, almost childish — the way “yo, what’s up” would read as out of place when answering a business call in English.
The same applies to business calls you place. The opening should be formal and identifying — your name, your company, brief context — not moshi moshi.
Calls to seniors and elders
Calling your boss, a teacher, a mentor, an older relative? Open with formal greeting and self-identification. Moshi moshi is appropriate downward (calling friends, family of equal or younger generation, casual contexts) but feels incorrect upward in formal Japanese conversation.
Where you do say it
The word’s natural home is informal phone calls between people of similar status: friends, casual family calls, calls where the relationship is already established and warm. It’s also the right choice when answering an unknown number — you don’t yet know who’s on the other end, and moshi moshi is a neutral, ungendered, unmarked opening.
Outside the phone, moshi moshi survives in a few specific situations. Trying to get the attention of someone who’s spaced out (moshi moshi, kikoeteru? — “hello, can you hear me?”). Verifying that someone fell asleep on the train and didn’t miss their stop. Calling out softly to someone who hasn’t noticed you. In all these uses, the through-line is the same: you’re trying to make verbal contact across a small gap, and the doubled word is doing the same job it does on a phone line — confirming presence on both ends.
The phone-as-social-distance signal
One thing the existence of moshi moshi reveals is that Japanese culture treats the phone as a categorically different communication context from face-to-face. A face-to-face greeting tracks the time of day (ohayou, konnichiwa, konbanwa) and the formality. A phone greeting cuts across all of that and hands you a single neutral word that says “we are now in phone-mode.”
This is consistent with broader Japanese conversational architecture, where the channel matters as much as the content. The honorifics shift, the politeness register shifts, the openings shift. Moshi moshi is the small linguistic flag that says: this is a voice without a face; calibrate accordingly.
The decline
Mobile phones changed the landscape. When you can see the caller’s name on the screen, the function of moshi moshi as an identity-verifying opening loses some of its work. Younger Japanese speakers, especially when calling friends, often skip the moshi moshi entirely and just open with “nani?” (“what?”) or “ohayou” or whatever conversational opening fits the relationship — they already know who’s calling.
The word survives most strongly in three contexts: when the caller is unknown, when the relationship is mid-formal (not a business call, not a close friend), and when there’s a quality issue (“moshi moshi? kikoeru?” — “hello? can you hear me?”). These are precisely the contexts where the original function of the word — making verbal contact in the absence of visual confirmation — is still doing real work.
Using it yourself
For a non-native making informal calls in Japan, moshi moshi is a fine default opening for a friend or someone whose number you have but who hasn’t called you before. The intonation rises slightly on the second “moshi” — a small upward inflection that signals “are you there?” rather than a flat declarative.
For business calls, skip it. For calls to seniors, skip it. For non-phone uses (waking someone up gently, getting attention across a room), it’s still in current use and signals warmth.
The principle underneath
The interesting thing about moshi moshi is what its survival signals about Japanese conversational habits. The word is doing a job — confirming presence across a gap — that English speakers handle through tone, context, and continuation (“hello? Are you there? Hello?”). Japanese has a single small phrase reserved for that exact need, and the phrase is built from a humble form of the verb “to say,” doubled for redundancy, with a folk story about ghosts attached.
Whether or not the ghosts are real, the cultural logic is real: when you cannot see the speaker, you have to verify them. The word itself was a tool for that. Phones are everywhere, the spirits have largely retreated, and the word is still on the line. That’s the kind of small linguistic artifact a culture keeps because it once did important work and now does subtler work — and continues to be useful in both directions.