Artificial Intelligence Engineer

Sawera Khadium

From Vision to Reality: Crafting AI-Powered Solutions to Transform the Future.


About Me

Hi, I’m Sawera Khadium, an Artificial Intelligence Engineer with over 6 years of experience creating cutting-edge AI solutions. My journey in AI began with a passion for turning complex ideas into practical, real-world applications.My expertise includes Natural Language Processing (NLP), Computer Vision, Machine Learning, Data Analytics, Generative AI, and Backend Development. From building personalized chatbots to transforming low-quality videos into stunning visuals, I’m dedicated to pushing the boundaries of what AI can achieve.Driven by curiosity, I’m always exploring new tools and technologies to stay ahead of AI trends. For me, AI isn’t just work—it’s a passion. I thrive on solving problems, experimenting with tools, and creating solutions that inspire and amaze.Let’s create something extraordinary together! 🚀

Services

AI Solutions

  • Custom Chatbots: AI-powered conversational tools tailored to your needs.

  • Recommendation Systems: Enhance user engagement with personalized suggestions.

  • Generative AI Models: Create visuals or videos from text effortlessly.

ML Models

  • Predictive Analytics: Forecast trends with data-driven insights.

  • Image/Video Processing: Boost media quality with advanced AI techniques.

  • Classification & Clustering: Segment data intelligently for better insights and outcomes

Automation

  • Business Process Automation: Simplify operations and save time.

  • AI-Driven APIs: Scalable solutions for AI-powered applications.

  • Data Integration Pipelines: Seamlessly connect and process data from multiple sources.

Fun Portfolio Projects:

Real-ESRGANs Model: Upscaling Video Quality

Choose a GAN's model for example: RealESRGAN for now🎞 Step 1: Splitting Up the Video
First, we break the video into image frames.
💡 Step 2: Process Each image Frame with GANs
Utilize GANs to work some magic on each image to upscale it.
💾 Step 3: Saving the Improved image
After each image is upscaled, save it in original video sequence order.
🔄 Step 4: Putting it All Back Together
Finally, put all the upscaled images back together to create a super-duper enhanced video!

Fine Tune Stable Diffusion

I generated AI Magic Avatars of my own images by Fine tuning stable diffusion image generation model on google colab.
Using stable diffusion model and finetune on my own images and generated Lensa like AI Magic Avatars.

AI Image generation

I specialize in Text-to-Image and Image-to-Image models, leveraging tools like Stable Diffusion, ControlNets, DALL-E, MidJourney, OpenJourney, and LoRa. Through precise prompt engineering and thorough testing,
I’ve pushed the boundaries of image generation to deliver stunning visual results. My expertise includes experimenting with market-leading models and fine-tuning them to achieve highly customized outputs for diverse applications.

I𝐝𝐞𝐧𝐭𝐢𝐭𝐲 𝐟𝐨𝐜𝐮𝐬𝐞𝐝 𝐈𝐦𝐚𝐠𝐞 𝐠𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐰𝐢𝐭𝐡𝐨𝐮𝐭 𝐅𝐢𝐧𝐞 𝐭𝐮𝐧𝐢𝐧𝐠

Recently came across this really good open source PuLID opensource model tested this really powerful AI model for Identity based image generation on my own images which turned out to be really good.
Now About this model
PuLID is a tuning-free ID customization approach. PuLID maintains high ID fidelity while effectively reducing interference with the original model’s behavior.
A single ID image is usually sufficient, you can also supplement with additional auxiliary images as well.
Add your favourite prompt with images and here you have it your own identity based images within seconds.

Runway Gen-3 Alpha!

I experimented with Runway Gen-3 Alpha Turbo to test its image-to-video capabilities and created something fun. Here's what I did:
🖼️ Used Dr. Strange illustrations
📜 Wrote a 3-4 line script with ChatGPT
🎙️ Turned the script into audio using ElevenLabs
💥 Animated images with Runway Gen-3 Alpha
👄 Tried lip-sync on a character (mind-blowing!)
🎶 Added music from Pixabay
🎬 Combined it all in Canva
Check out the final result!