AI-powered transcription for audio and video. Convert speech to text with timestamps and speaker detection.