Projects
Selected work across NLP infrastructure, search, and Azerbaijani-language modeling. The current focus is the AzBERT pipeline — a 64k tokenizer plus a NeoBERT-style encoder trained from scratch for Azerbaijani — alongside the production legal-search system at NAIC.
2026
2025
