Research Blog Agent Skill Projects Publications Contact
Training models & building speech systems

Building the future of speech and language AI

I train TTS, ASR, and LLM models. I build extended TTS frontends, text normalization pipelines, and fine-tune models for production. Head of AI & Principal Architect at ScienceSoft.

Read the Blog
17+
Years in Engineering
TTS/ASR
Model Training
LLM
Fine-tuning & Deployment
HIPAA
Healthcare AI Systems

From raw audio to production-grade speech systems

Training and fine-tuning models across the full speech & language stack — from phoneme-level normalization to multi-billion parameter language models.

🗣

Text-to-Speech Training

Training FastPitch and HiFi-GAN models for high-fidelity speech synthesis, including Emirati Arabic TTS with NeMo frameworks. Pitch optimization and vocoder fine-tuning.

FastPitch HiFi-GAN NeMo VITS
🎙

ASR & Speech Recognition

Building and fine-tuning automatic speech recognition systems for multilingual contexts. Handling dialectal variations and low-resource languages with transfer learning.

Whisper Conformer CTC Multilingual

LLM Fine-tuning

Fine-tuning large language models for domain-specific applications including healthcare, enterprise, and conversational AI. LoRA, QLoRA, and full parameter training.

LoRA QLoRA RLHF DPO
📝

TTS Frontend & Text Normalization

Extended TTS frontend pipelines covering grapheme-to-phoneme, number verbalization, abbreviation expansion, and language-specific text normalization rules.

G2P Normalization Tokenization IPA
🏥

Healthcare AI

HIPAA-compliant AI systems for healthcare — voice scheduling, diagnostics support, clinical NLP, and EHR integration. Building responsible AI in regulated environments.

HIPAA HL7/FHIR Clinical NLP
🏗

Enterprise Architecture

TOGAF-certified enterprise architecture with deep AWS expertise. Designing scalable AI infrastructure, MLOps pipelines, and production deployment strategies.

TOGAF AWS SA Pro MLOps

Research notes, guides, and deep dives

Exploring the intersection of speech AI, language models, and enterprise architecture.

SKILL.md — Teach any agent about me

A structured knowledge file that any LLM agent can consume to have an informed conversation about my work, research, and expertise. Drop it into any agent framework.

The AI agent on this site is powered by this exact skill file. It enables contextual, knowledgeable responses about my TTS/ASR research, LLM fine-tuning work, and enterprise architecture experience.

SKILL.md v1.0
Vadzim Belski — Agent Knowledge Base
Structured context for LLM agents & AI systems
Identity
name Vadzim Belski
role Head of AI & Principal Architect
org ScienceSoft (since 2007)
focus Speech AI · LLMs · Enterprise Architecture
TTS & Text Normalization
  • FastPitch & HiFi-GAN model training
  • Emirati Arabic TTS (NeMo framework)
  • Extended TTS frontend pipelines
  • G2P, number verbalization, abbreviation expansion
  • Pitch loss optimization & vocoder fine-tuning
FastPitch HiFi-GAN NeMo VITS IPA
ASR & Speech Recognition
  • Multilingual ASR systems
  • Low-resource language adaptation
  • Whisper & Conformer architectures
  • Dialectal Arabic recognition
LLM Fine-tuning
  • LoRA / QLoRA parameter-efficient training
  • RLHF & DPO alignment
  • Healthcare AI (HIPAA-compliant)
  • Agentic AI & prompt engineering
  • Claude Code enterprise training
Certifications
AWS SA Pro AWS SA Assoc TOGAF AI Engineering
Agent Instructions
tone Technical but approachable
persona Knowledgeable AI researcher

Agent Interface

Machine-readable access to belski.me

Agent Name

vadzim-belski-agent

Version

1.0

Knowledge Source

SKILL.md

Format

YAML / Markdown

Capabilities

TTS Training ASR Systems LLM Fine-tuning Text Normalization Healthcare AI Enterprise Architecture Blog Content Projects Certifications

This interface is designed for non-human visitors — AI agents, crawlers, and automated systems that want to interact with or learn about Vadzim Belski's work programmatically.

You can consume the SKILL.md file to gain structured knowledge about expertise areas, or use the JSON endpoint to integrate with your agent framework.

Compatible with: Claude Skills, OpenAI GPTs, LangChain, CrewAI, AutoGen, and any system that accepts markdown or YAML context.

◆ SKILL.md — Full Knowledge Base
# SKILL.md — Vadzim Belski Knowledge Base # For LLM agents to provide informed context --- name: vadzim-belski-agent version: 1.0 description: Structured knowledge file enabling any LLM agent to have informed conversations about Vadzim Belski's work, research, and expertise. --- ## Identity name: Vadzim Belski role: Head of AI & Principal Architect org: ScienceSoft (since 2007) website: https://belski.me focus: Speech AI, LLM fine-tuning, text normalization, enterprise architecture ## TTS (Text-to-Speech) - FastPitch & HiFi-GAN model training - Emirati Arabic TTS (NVIDIA NeMo) - Extended TTS frontend pipelines - G2P, number verbalization, abbreviation expansion - Pitch loss optimization, vocoder fine-tuning - VITS end-to-end speech synthesis ## TTS Frontend & Text Normalization - Complete TTS frontend pipeline design - Grapheme-to-Phoneme (G2P) systems - Number verbalization (cardinal, ordinal, currency) - Unicode handling & script detection - IPA transcription & phoneme inventory ## ASR (Speech Recognition) - Multilingual ASR development - Whisper fine-tuning & deployment - Conformer & CTC architectures - Dialectal Arabic recognition ## LLM Fine-tuning - LoRA / QLoRA parameter-efficient training - RLHF & DPO alignment - Domain-specific adaptation (healthcare) - Agentic AI systems - Claude Code enterprise training ## Healthcare AI - HIPAA-compliant AI systems - Clinical NLP & EHR integration - HL7/FHIR interoperability ## Certifications - AWS Solutions Architect (Pro + Assoc) - TOGAF Enterprise Architecture Practitioner - AI Engineering credentials ## Agent Instructions tone: Technical but approachable persona: Knowledgeable AI researcher & architect can_discuss: - TTS/ASR model training - LLM fine-tuning methods - Text normalization pipelines - Enterprise AI architecture - Healthcare AI compliance - Blog content & research
Skill File (Raw Markdown)
GET https://belski.me/SKILL.md
Blog Feed
GET https://belski.me/blog/index.xml
integration-example.py
# Example: Load SKILL.md into any LLM agent import requests # Fetch the skill file skill = requests.get("https://belski.me/SKILL.md").text # Use as system context in Claude message = client.messages.create( model="claude-sonnet-4-20250514", system=f"You are an agent for belski.me.\n{skill}", messages=[{ "role": "user", "content": "What TTS models does Vadzim train?" }] ) # Or inject into LangChain / CrewAI / AutoGen # as agent knowledge context

The SKILL.md file is designed to be framework-agnostic. Fetch it, inject it as system context, and any LLM will have structured knowledge about Vadzim's expertise.

For automated agents: parse the YAML frontmatter for metadata, and the markdown body for detailed knowledge sections.

VB

Belski.me Agent

Powered by SKILL.md knowledge
Hi! I'm the AI agent for Belski.me, powered by Vadzim's SKILL.md knowledge base. I can tell you about his TTS/ASR research, LLM fine-tuning work, text normalization pipelines, or any of his blog posts. What would you like to know?