KenZakreski’s Profile | Apple Developer Forums

KenZakreski

User for

From Grou, Friesland Netherlands

Roles

Developer

Platforms

iOS
iPadOS
watchOS

Post

Replies

Boosts

Views

Activity

Reply to Resolving co channel interference VOIP

Thank you Kevin.

Machine Learning & AI Core ML

21h

Reply to Co channel interference resolution

Thank you. Apple’s Push to Talk (PTT) framework is designed to provide a highly power-efficient, secure, and system-integrated foundation for walkie-talkie style applications. Wanting to provide a superior app experience. Can we implement language translation (for example Dutch (or Frisian) to English on edge) over these frameworks? Warmest Regards, Ken

Core OS

LiveCommunicationKit and Push-To-Talk Q&A

21h

Reply to Co channel interference resolution

Thank you for your reply, let me rephrase the question... Subject: Inquiry Regarding Architectural Overhead and Buffer Access in the Push to Talk Framework for Real-Time Core ML Blind Source Separation Dear Apple Engineering Team, We are currently developing an Apple-native communication platform that utilizes the Push to Talk framework alongside Core ML to handle real-time, on-device audio processing. We are working to resolve the issue of single-channel, co-channel interference (overlapping voice streams) directly on the edge. Our current challenge lies in the pipeline latency and background lifecycle constraints when intercepting incoming audio buffers. To cleanly separate overlapping voices before they hit the audio output mixer, we need to process the raw PCM data immediately upon arrival. Could you please provide guidance on the following architectural questions: Low-Latency Buffer Interception: What is the recommended design pattern within the PTChannelManagerDelegate flow to pass raw incoming audio buffers directly to a Core ML model running on the Apple Neural Engine (ANE) before the system routes them to AVAudioEngine for playback? Background Thread Management: Given the strict background execution boundaries enforced by the Push to Talk framework, how can we best optimize thread scheduling to ensure our speech separation model completes its execution without triggering an OS background processing timeout or process termination? Dynamic UI Manifestation: Once a combined audio stream is separated into two clean, distinct voice vectors on-device, what is the best approach for registering multiple PTParticipant states simultaneously so that the native system UI (like the Dynamic Island) accurately reflects both speakers? Thank you for your time, insights, and continued support of developer innovation within the iOS and iPadOS ecosystems. Best regards, Ken Zakreski Founder, Marine Link Pro