Multimodal AI: The Future of Unified Intelligence Across Text, Image, Audio & Video