soundscape

0.4.0indexed

State-of-the-art audio toolkit: playback, recording, DSP effects, HLS streaming, background media controls, pluggable transcription and Compose UI components driven by a single coherent API.

AndroidJVMNativeWasm·NadeemIqbal/soundscape

Stars

—

Used by

dependents

—

Health

/ 100

soundscape

A state-of-the-art audio library for Kotlin Multiplatform / Compose Multiplatform. Playback, recording, effects, streaming, background controls, and pluggable transcription, with one coherent API across Android, iOS, macOS native, Desktop JVM, and WebAssembly.

Why soundscape

The KMP/CMP audio space is a patchwork of half-finished solo libraries. Some do playback, some do recording, none cover the full stack with parity across targets. soundscape is the one library that ships a single coherent surface across every CMP target.

Platform support

In v0.3:

✅ Desktop crossfade (dual SourceDataLine with overlap)
✅ Web crossfade (Web Audio gain ramping between two HTMLAudioElements)
✅ soundscape-transcription-whisper (whisper.cpp via whisper-jni, Desktop JVM)
✅ soundscape-desktop-ffmpeg (AAC/ALAC/Opus/etc on Desktop via JavaCV's GPL FFmpeg bundle - GPL license applies)
✅ Linux MPRIS backend (UNTESTED - written against the spec + dbus-java docs, please report issues)

Deferred to v0.3.x point release (CHANGELOG.md has the rationale):

Apple + Android crossfade (need AVAudioEngine unification on iOS and dual-ExoPlayer refactor on Android).
iOS URL-stream effects via MTAudioProcessingTap (design doc captured; needs MediaToolbox cinterop work).
Windows SMTC + Mac NowPlaying-via-JVM (need native helpers we can't ship overnight).
AAudio low-latency Android engine.

Installation

// build.gradle.kts
commonMain.dependencies {
    implementation(project.dependencies.platform(libs.soundscape.bom))
    implementation(libs.soundscape.player)
    implementation(libs.soundscape.ui.compose)
}

Quick start

import io.github.nadeemiqbal.soundscape.core.AudioSource
import io.github.nadeemiqbal.soundscape.player.MediaItem
import io.github.nadeemiqbal.soundscape.player.SoundscapePlayer

val player = SoundscapePlayer.create()
player.setQueue(
    items = listOf(
        MediaItem(id = "1", source = AudioSource.Url("https://example.com/track-a.mp3")),
        MediaItem(id = "2", source = AudioSource.Url("https://example.com/track-b.mp3")),
    ),
)
player.play()  // fire-and-forget; observe via player.state / position

Transport methods (setQueue, play, , , , , ) are intentionally so the WASM backend can call synchronously from a user-gesture event handler. The browser autoplay policy rejects deferred calls. Wire them directly into Compose lambdas without .

Recording

val recorder = Recorder.create()
val session = recorder.start(RecordingConfig.preset(RecordingConfig.Quality.Voice, "/path/to/out.wav"))
recorder.levels.collect { rms -> println("level=$rms") }
val result = recorder.stop()
// result.outputPath is a filesystem path on Android/Apple/Desktop, or a blob URL on WASM.

Effects

val chain = EffectsChain()
    .equalizer(EffectsChain.FlatEq10Band.map { it.copy(gainDb = 3f) })
    .reverb(ReverbPreset.Hall, wetDryMix = 0.4f)
    .compressor(thresholdDb = -12f, ratio = 4f)
    .gain(db = -3f)

player.setEffectsChainHandle(chain)
// Android: native audiofx attached to the ExoPlayer audio session.
// Apple:   AVAudioEngine path activates for AudioSource.File items.
// Web:     Web Audio graph wired between the audio element and destination.
// Desktop: pure-Kotlin DSP inserted in the PCM pipeline.

Background controls

val bg = BackgroundController.create()
bg.bind(player)
bg.setMetadata(Metadata(title = "Track A", artist = "Artist", durationMs = 240_000))
bg.transportActions.collect { action -> /* react to lockscreen / media-key events */ }

Android setup

// In your Application.onCreate
class MyApp : Application() {
    override fun onCreate() {
        super.onCreate()
        AndroidContextHolder.install(this)
    }
}

Add to your manifest if you use the Recorder:

<uses-permission android:name="android.permission.RECORD_AUDIO" />

Samples

samples/unified/ is a single Compose Multiplatform app that runs on every target with seven tabs covering every module (Player, Recorder, Effects, Streaming, Background, Transcribe, Visualizer).
samples/platforms/ ships separate native-idiomatic shells for Android, Desktop, Web, and iOS that demonstrate the platform-only OS integrations (MediaSession on Android, NowPlaying on iOS/macOS, etc).

Run the desktop sample:

./gradlew :samples:unified:desktopApp:run

Architecture

Each module publishes its own artifact and depends only on :core plus the modules it logically needs. Use the BOM to pin every artifact at one version.

ui-compose -> player, recorder, effects -+
background -> player --------------------+
streaming  -> player --------------------+-> core
transcription --------------------------+
player     -> effects (for backend offload bridges) ----+

Flow-driven throughout. StateFlow for state, Flow for streams (position, levels, transcripts), sealed for errors. Transport methods (, , , , ) are non-suspend so WASM can call them inside the same JS turn as the user gesture; everything else that does real I/O (, ) remains .

Roadmap

v0.2 (this release): real effects offload on all four backends, WASM recording via MediaRecorder, honest docs.
v0.3: crossfade across every backend, iOS URL-stream effects via MTAudioProcessingTap, Desktop OS controls (Linux MPRIS / Windows SMTC / macOS NowPlaying-via-JVM), soundscape-transcription-whisper (whisper.cpp via JNI), soundscape-desktop-ffmpeg (AAC/ALAC on Desktop), lower-latency Android engine (AAudio).

Contributing

PRs welcome. See CONTRIBUTING.md.

License

Apache 2.0. See LICENSE.

Related libraries

Surfaced from shared tags and platforms — no rankings paid for.

gadulka★ 73

kkostovMinimalistic audio player library enabling audio playback without UI, wrapping native functionality in "headless" mode. Compatible with various platforms and includes examples for Jetpack Compose integration.Shared: wasm, compose-multiplatform, cmp

TextToSpeechKt★ 57

Marc-JBCross-platform text-to-speech library enabling speech synthesis with coroutine support. Features include volume, pitch, and rate adjustments, with Compose integration for enhanced functionality.Shared: wasm, desktop, compose-multiplatform

compose-shimmer★ 1.0k

valentinilkOffers shimmering animation for UI elements, integrated via a simple modifier. Includes customization options like theming and animation boundaries. Supports advanced usage scenarios and custom modifiers.Shared: wasm, desktop, compose-multiplatform

minabox★ 353

oleksandrbalanDisplays lazy-loaded items on a scrollable 2D plane, allowing registration of items with defined positions and sizes. Supports pinned rows/columns and relative/absolute sizing.Shared: wasm, desktop, compose-multiplatform

MediaPlayer-KMP★ 302

KhubaibKhan4Enables seamless YouTube video and audio playback across multiple platforms, integrating with JetBrains Compose Multiplatform. Features include authentication tokens, event handling, and reels view support.Shared: desktop, compose-multiplatform, audio

kompose-country-code-picker★ 290

joelkanyiMaterial 3 country-code picker UI offering 250+ countries with flags and dial codes, phone validation/formatting, accent-normalized search, responsive dialogs, keyboard navigation, and multilingual support.Shared: wasm, desktop, cmp

`soundscape-core`	✅	✅	✅	✅	✅
`soundscape-player`	✅	✅	✅	✅	✅
`soundscape-recorder`	✅	✅	✅	✅	✅ MediaRecorder
`soundscape-effects` (offload)	✅ audiofx	✅ AVAudioEngine (file sources)	✅ AVAudioEngine	✅ pure-Kotlin DSP	✅ Web Audio
`soundscape-streaming`	✅ HLS+DASH	✅ HLS	🚧 v0.3	✅ HLS	✅ HLS
`soundscape-background`	✅ MediaSession	✅ NowPlaying	✅ NowPlaying	🚧 v0.3 (use macosArm64 instead)	✅ MediaSession API
`soundscape-transcription`	✅ SpeechRecognizer	✅ SFSpeechRecognizer	✅ SFSpeechRecognizer	🚧 v0.3 via `-whisper`	✅ Web Speech (Chromium)
`soundscape-ui-compose`	✅	✅	✅	✅	✅