Skip to main content

Projects

Selected work across NLP infrastructure, search, and Azerbaijani-language modeling. The current focus is the AzBERT pipeline — a 64k tokenizer plus a NeoBERT-style encoder trained from scratch for Azerbaijani — alongside the production legal-search system at NAIC.

2026

2025