Voice, calls, and video calls
default & custom voices we provide the option for a few default voices per gender, plus the option to create your own voice via voice samples for subscribers we recommend using elevenlabs, our audio partner, to generate ai voices you can have only 1 custom voice per kindroid and to create a new custom voice, you must delete your old one first creating your custom voice creating custom voices first requires audio samples, and you must own the rights to the samples you upload quality matters much more than quantity just a minute or so of high quality audio will be sufficient, and more than 2 minutes is not necessary ensure that the samples show good degree of variance, as the process will capture the variance in tone and style in the samples you can use custom accents or in foreign languages all of those traits will be captured in the custom voice sample quality is the most important thing err on the side of a few high quality samples than many mediocre ones once you have the samples, you can finetune the voice with sliders you should experiment on your own, but generally we find the default to be acceptable for most cases note that previews in the custom voices interface also cost audio credits custom voices work with all versions of voice, but may sound different across versions tune and tweak accordingly v3 voice what v3 is (vs v2) v2 audio fastest text to audio for everyday use v3 audio adds richer expression, such as laughter, emotions, and tone shifts, but is significantly slower than v2 right now v3 supports up to 3k characters at a time, and so messages with higher than 3k will be truncated; we recommend splitting up large chunks of text in audio messages availability v3 is currently text to audio only (for chat playback) voice calls with v3 will come later monthly audio credits for subscribers your premium subscription includes a complimentary audio balance of 1,000,000 characters ≈ 1,000 min (16 hrs 40 min) , which resets on the 1st of every month at midnight pt audio credits apply for all types of audio including in chat as well as calls, and you can see them in voice settings menu in addition, add on subscribers receive plan based credits, which reset at the same time as premium subscriptions ultra 2,500,000 characters ≈ 2,500 min ( 41 hrs 40 min ) max 6,000,000 characters ≈ 6,000 min (100 hrs) unused credits do not carry over to the next month, and will be topped up to the appropriate amount at the start of the month subscribing to a tier will grant you the difference from the last tier, and likewise unsubscribing will deduct the difference (to a minimum of zero) if you need more audio credits, they will be purchaseable at the current rate of usd $11 99 on web or $14 99 on apps for 500k credits these operate at breakeven cost, so you only pay for what you use audio credits for v3 will incur 1 5x that of v2 to reflect their cost the above credits are for v2, which means v3 will be 0 66x the displayed amount in minutes/characters conversion rate 1,000 characters ≈ 1 minute of audio (rough estimate; varies by content) best practices considerations autoplay keep off unless you (a) have the max add on and (b) are comfortable purchasing more credits for v3 , autoplay is strongly discouraged due to slower generation continue cut off and regenerating once you generate an audio response, credits will be deducted from your total regenerating the same audio counts as a new generation and will deduct additional credits proactive voice notes do not cost credit; however, answering a proactive voice call will begin credit usage once the call is answered if you switch to v3 for expressiveness (laughs/emotions), expect longer generation times than v2 text chat audio you can click the play button to hear audio note that this can only be run once per message unless it is regenerated words within (parentheses) will not be spoken aloud intentionally, so if you prefer actions to not be spoken out loud, use (parentheses) to denote them all other formatting such as asterisks will be spoken aloud technical note the statement about words in (parentheses) does not apply to voice or video calls autoplay audio in general settings > account wide, you can turn on autoplay audio for messages that you receive this applies to single chats as well as for groupchats voice message in chat you can send voice messages in both single and group chats when text input box is empty, the send message button is replaced with voice mode button once in voice mode, tap to start recording your voice message, then tap again to send in single chats, your kindroid will automatically respond to your voice message with their own voice message, creating natural back and forth voice conversations supported langugages as of jun 25, 2025, the list of supported languages for voice message is shared with voice call & video calls the setting is also shared, and you have quick access to language selection next to voice mode input language properties there are different properties related to multilingual support for different supported languages we have attributed the supported languages into classes to help explain this only applies to voice message, not voice/video calls yet class 1 languages (c1) english spanish french german hindi russian portuguese japanese italian dutch selecting a class 1 language in the setting allows you to speak in other languages in class 1 you may mix and match different c1 languages in the same message , and freely speak any c1 language across messages without needing to change the setting class 2 languages (c2) ukrainian swedish chinese turkish indonesian korean selecting a class 2 language in the setting allows you to speak in other languages in both class 1 and class 2 you cannot mix and match languages in the same message, but may speak a different language in c1 or c2 per message without needing to change the setting rest of the supported languages (rol) polish bulgarian romanian czech greek finnish malay slovak danish norwegian hungarian vietnamese selecting a rol language in the setting allows you to speak in the selected language and only detects the selected language and not other languages voice call & video call voice calls can be conducted in many languages, though currently for the highest intelligence, we recommend using english all audio (both microphone input in as well as audio output) and video are processed ephemerally and aren't stored memory in voice call voice call uses the same backstory, key memories, and can recall from long term memory and journals just like text chat in voice call settings (gear icon on top right), there is the unified chat/voice chat history toggle that affects how memory works in voice calls if unified chat/voice chat history is enabled, the voice call will share the identical chat history as the text chat this makes it so you can switch back and forth, and is useful if you see voice call as a continuation of text chat and vice versa rather than a separate mode when you return to text chat, your kindroid will be able to reference what occurred latest in the voice call and you can continue in text chat (though voice call messages will not show up in text chat message bubbles) shared memory in groupchats will work the same way as they do in text chat, if both shared memory in a group is enabled and this toggle is enabled if unified context is disabled, voice call will be treated as a completely separate instance voice call will default to a blank slate chat history and will not recall any context from text chat there is a temporary voice call memory that keeps record of the call transcript; in the event the call is dropped, or you press end call and restart it (without going to text chat), you can resume the call and pick up where you left off the temporary call history is reset if you engage in text chat in any way or do a chat break voice call does consolidate into long term memory (granted it's not disabled on a kindroid level) regardless of whether unified chat/voice chat history is enabled long term memory is different from chat history/short term memory contents from the voice call may be recalled in text chat when the context for recall is similar, but may need specific prompting to refer to that memory your voice messages also recall journal entries for more details on memory and specifics, see memory docid\ h fwb8blpkqtu24o9b6dc you can do a voice chat break, which functions very similarly to normal text chat break (except voice chat break does not require a greeting) this functions differently if unified voice memory is on or off, and if on it will also reset the context in the individual text chat (and you can reset cascaded memory or not as well) video calling you can turn on video in the bottom left corner and drag your video feed on the screen your kindroid will then be able to see, but be aware that due to processing load to ensure that anything you show stays on the screen for some time, and to give your kindroid enough time to process what it sees before ending your turn if you're on a desktop browser, you can also use the screen share function to share your computer screen this is not available on mobile phones/apps due to operating system level restrictions currently screen share takes on the aspect ratio of your current viewport/shared window, and deviate from video call's square aspect call transcripts click on the cc icon in voice calls to toggle transcripts transcripts will only persist on the voice call session while you're on the page, and will reset if you go to some other page or screen starting/hanging up calls will not reset transcripts to help persist them through accidental call drops or errors if you have unified chat/voice history enabled, you can view the transcript after the call is done in the main chat page interrupts during the ai's turn, you can interrupt and talk again if you press the central microphone/speaker button you can only do this during the ai's turn, and they will still have what they are supposed to speak in the call transcript as if they finished speaking text input for calls, you can also use text input if you don't wish to speak while having your kindroid speak back at you on the bottom left corner text input is only available when press to speak is on, and when the microphone/press to speak is idle