Skip to content
View Houss3m's full-sized avatar
  • Qatar Computing Research Institute
  • Doha

Block or report Houss3m

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Houss3m/README.md

Hi there, I'm Houssam 👋

I am a Speech and AI researcher/engineer working on automatic speech recognition (ASR), Arabic speech technologies, and robust speech modeling beyond adult read speech.

My current work focuses on building and evaluating ASR systems for challenging real-world conditions, including Arabic dialects, code-switching, children’s speech, long-form speech, and streaming ASR. I am especially interested in the gap between research prototypes and reliable production-ready speech systems.


About Me

  • 🎙️ Research Assistant working on speech and AI at QCRI
  • 🔭 Currently working on:
    • Arabic and multilingual ASR
    • Algerian dialect and code-switching ASR
    • Children’s speech recognition
    • Streaming ASR systems
    • Robustness and adaptation under distribution shift
  • 🧠 Interested in both research and engineering: model training, evaluation, deployment, and reproducibility

Research & Engineering Interests

Automatic Speech Recognition

  • End-to-end ASR systems
  • Arabic dialect ASR
  • Code-switching ASR
  • Children’s ASR
  • Streaming and low-latency ASR
  • Long-form ASR
  • ASR evaluation, normalization, and benchmarking

Speech Model Adaptation

  • Adult-to-child ASR adaptation
  • Robustness under domain shift
  • Fine-tuning and LoRA-based adaptation
  • Weight-space model merging
  • Retention-aware adaptation

NLP & Speech-Language Technologies

  • Arabic text normalization
  • Punctuation restoration
  • Speech-to-text post-processing
  • Multilingual and code-switched language modeling

ML Engineering

  • PyTorch and Hugging Face workflows
  • ESPnet, k2/icefall, Sherpa-ONNX, and Whisper-based pipelines
  • Dataset preparation and large-scale evaluation
  • Reproducible experiments and benchmarking
  • Deployment-oriented ASR pipelines

Current Projects

🎧 AlgerianSpeech Platform

AlgerianSpeech is a platform dedicated to advancing speech recognition for Algerian Arabic, especially in realistic multilingual and code-switched settings involving Arabic, French, and English.

The project includes:

  • A speech annotation platform for Algerian dialect and code-switching speech
  • Real-world spontaneous speech collected from online recordings
  • Transcription and annotation workflows for multilingual speech
  • ASR evaluation pipelines using metrics such as WER, CER, and MER
  • Resources for building more robust ASR systems for underrepresented Arabic dialects

This work supports the broader goal of improving speech technology for Arabic dialects and low-resource multilingual communities.


Selected Areas I Work On

  • Arabic ASR benchmarking
  • Streaming ASR for Arabic
  • Code-switching recognition and analysis
  • Children’s speech recognition
  • ASR robustness and domain adaptation
  • Punctuation restoration for Arabic ASR output
  • Dataset curation, normalization, and evaluation design

Tools & Frameworks

  • Languages: Python, Bash, LaTeX
  • ML/DL: PyTorch, Hugging Face Transformers, NumPy, pandas
  • ASR: ESPnet, k2/icefall, Whisper, Sherpa-ONNX, NeMo
  • Evaluation: jiwer, custom WER/CER pipelines, Arabic normalization tools
  • Experimentation: Slurm, Conda, Git, Linux, GPU-based training/inference
  • Deployment/Inference: ONNX, streaming ASR pipelines, model serving workflows

Let's Collaborate

I am open to collaboration on projects related to:

  • Arabic ASR and dialectal speech technologies
  • Code-switching speech recognition
  • Children’s speech recognition
  • ASR benchmarking and evaluation
  • Open-source speech tools and datasets
  • Robust and deployable speech AI systems

Contact Me

GitHub LinkedIn


GitHub Activity

GitHub contribution streak

Popular repositories Loading

  1. 500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code 500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code Public

    Forked from ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

    500 AI Machine learning Deep learning Computer vision NLP Projects with code

    Jupyter Notebook 1

  2. NeuralNetworkTP NeuralNetworkTP Public

    Python 1

  3. Coursera-Deep-Learning Coursera-Deep-Learning Public

    Forked from y33-j3T/Coursera-Deep-Learning

    My notes / works on deep learning from Coursera

    Jupyter Notebook 1

  4. MTI-TP02 MTI-TP02 Public

    XML HTML...

  5. Application-de-chat-multiclient-server Application-de-chat-multiclient-server Public

    réalisation d'une application avec interface graphique en langage Java. Cette application permet la communication entre deux clients mais également entre plusieurs clients. Les communications sont …

    Java 1

  6. TA1 TA1 Public

    Un profile d’un utilisateur dans un réseau social qui contient une fiche avec une photo d’identité, une photo de coverture, et une liste des publications qui peuvent contenir des images.

    PHP