Skip to content
/ wedio Public

Wedio AI Studio - Professional AI Voice & Media Platform | ElevenLabs Alternative

License

Notifications You must be signed in to change notification settings

intellwe/wedio

Repository files navigation

Wedio AI Studio

Repository: https://github.com/intellwe/wedio

A professional AI-powered media processing suite for enterprises and content creators. Transform documents into videos, clone voices, transcribe meetings, and generate professional audio content with cutting-edge AI technology.

Wedio AI Studio

User Authentication & Analytics

Secure User Management

  • Firebase Authentication: Secure email/password based user authentication
  • User Profiles: Personal dashboards with usage statistics
  • Real-time Analytics: Track files processed, audio generated, videos created, and transcriptions
  • Plan Management: Free and Premium tier support
  • Session Management: Secure login/logout with persistent sessions

Personal Dashboard

  • Usage Statistics: Real-time tracking of user activity
  • Progress Monitoring: Visual analytics for content creation
  • Account Management: Profile settings and plan information
  • Protected Routes: Secure access to premium features

Enterprise Features

AI Transcription

  • Multi-Speaker Recognition: Advanced diarization technology automatically identifies and separates speakers
  • Real-Time Processing: Live recording with instant transcription
  • File Format Support: MP3, WAV, M4A, and more audio formats
  • Smart Export: JSON, TXT formats with speaker identification
  • AI Summaries: Generate executive summaries and key insights
  • Professional UI: Color-coded speakers with timestamps
  • User Tracking: Automatic stats tracking for transcription usage

Document to Video Converter

  • PDF Processing: Extract text from any PDF document
  • AI Content Enhancement:
    • Real translation (Hindi, English, and more)
    • Content summarization
    • Professional script formatting
  • Voice Narration: 40+ premium AI voices
  • Video Generation: Create professional videos with audio narration
  • SEO Optimization: Perfect for content marketing, e-learning, and presentations
  • Analytics Integration: Track document processing and video creation

Professional Text-to-Speech

  • Premium Voices: Choose from 40+ natural-sounding AI voices
  • Voice Customization: Adjust stability, clarity, and speech patterns
  • Bulk Processing: Handle large text documents (up to 5000 characters)
  • Multiple Formats: Download as MP3, WAV
  • Enterprise Quality: Studio-grade audio output
  • Usage Monitoring: Track audio generation minutes

Voice Cloning & Transformation

  • AI Voice Cloning: Transform any voice into premium AI voices
  • Real-Time Processing: Upload audio or record directly
  • Voice Settings: Fine-tune stability, style, and similarity
  • Professional Quality: Remove background noise automatically
  • Enterprise Applications: Perfect for audiobooks, podcasts, and presentations
  • Performance Tracking: Monitor voice processing statistics

Setup & Installation

Quick Start

  1. Clone the repository

    git clone https://github.com/intellwe/wedio.git
    cd wedio
  2. Install dependencies

    npm install
  3. Set up Firebase Authentication

    • Go to Firebase Console
    • Create a new project
    • Enable Authentication with Email/Password provider
    • Enable Firestore Database with these rules:
    rules_version = '2';
    service cloud.firestore {
      match /databases/{database}/documents {
        match /users/{userId} {
          allow read, write: if request.auth != null && request.auth.uid == userId;
        }
      }
    }
  4. Configure environment variables

    cp env.example .env

User Analytics Features

Dashboard Analytics

  • Files processed counter
  • Audio generation time tracking (in minutes)
  • Video creation statistics
  • Transcription usage metrics

Privacy & Security

  • All user data stored securely in Firestore
  • Authentication handled by Firebase Auth
  • Real-time database security rules
  • Client-side encryption for sensitive data

Use Cases

Enterprise Applications

  • Training Materials: Convert PDFs to narrated training videos
  • Meeting Transcription: Multi-speaker meeting notes with summaries
  • Content Localization: Translate documents and generate localized audio
  • E-Learning: Create course materials from existing documents

Content Creation

  • YouTube Videos: Generate narrated content from scripts
  • Podcast Production: Voice cloning and audio enhancement
  • Audiobooks: Convert written content to professional narration
  • Marketing Materials: Multi-language promotional content

Professional Services

  • Legal Transcription: Accurate speaker identification for depositions
  • Medical Transcription: Patient consultation recordings
  • Academic Research: Interview transcription with speaker diarization
  • Business Intelligence: Meeting analysis and key insight extraction

Authentication Flow

  1. Guest Access: View dashboard and feature descriptions
  2. Sign Up: Create account with email/password
  3. Authentication: Secure login with Firebase
  4. Protected Features: Access to all AI tools after authentication
  5. Real-time Stats: Live tracking of usage across all features
  6. Session Management: Persistent login with secure logout

License

This project is licensed under the MIT License - see the LICENSE file for details.


Made with ❤️ by IntellWe - Empowering enterprises with AI-powered solutions