Fathur.

AI and Software Engineer

Building agentic AI systems, speech AI, and scalable data-intensive products grounded in distributed systems principles.

Python
C/C++
PyTorch
TensorFlow
React
Next.js
TypeScript
Rust
Go
PostgreSQL
Docker
Node.js
FastAPI
LangChain
Tailwind

I pay attention to how people interact with systems.

Not the features they use. But the moments when they hesitate.

Crowd walking in the city

Movement as a shared language.

Crowds behave like software. Many intentions. One shared space.

People crossing a street in a crowd

I started watching before I started building.

I didn’t start with code.

I started with questions about clarity.

People walking on a subway platform

Patterns repeat when the environment stays the same.

Code became a way to care. For users I may never meet. For teammates I may never see.

Anonymous crowd walking at night

Empathy, without performance.

Clean systems reduce the need for explanation.

Crowd moving through a crossing

Silence is sometimes the most respectful interface.

When something is clear, people feel more confident using it.

Less friction. Less asking. More moving forward.

Blurred people walking on stairs

The interface should not be the obstacle.

I think in sequences.

Inputs. Transitions. Outcomes. And the space between them.

People moving through an urban space

Good work is often just well-managed transitions.

Engineering, to me, is less about solutions and more about transitions.

My work is shaped by observation.

City movement and crowd

I prefer systems that feel invisible.

Something that doesn't ask for attention — but earns it over time.

Crowd crossing in daylight

A calm interface is a form of respect.

This is how I work.

Carefully. With intention. So others can move without thinking about the system beneath them.

Crowd walking through a city at night

No conclusion. Just continuity.

Education

Institut Teknologi Bandung

B.Sc. in Computer Science

2023 - 2027 · Indonesia

Recent Awards

  • [International] 2nd Place - Data Analysis Competition 2025

    HIMASTA ITS · 2025

  • [National] Finalist - BRIN AIDeaNation 2025

    BRIN · 2025

  • [National] Finalist - Data Mining GEMASTIK XVIII

    Kemdiktisaintek · 2025

  • [International] Most Outstanding Student Award

    Matsuo-Iwasawa Lab, University of Tokyo · 2025

  • [National] Bakti BCA Scholarship Awardee

    PT Bank Central Asia Tbk. · 2025

  • [National] Honorable Mention - Data Analytics Dash COMPFEST 16

    COMPFEST 16 · 2024

  • [National] 5th Place - Objective Quest Airnology 3.0

    BEM FTMM Universitas Airlangga · 2024

  • [National] 4th Place - Data Analytics FIND IT! 2024

    DTETI Universitas Gadjah Mada · 2024

  • [National] 3rd Place - DATA SLAYER 1.0

    HMSD IT Telkom Purwokerto · 2024

  • [National] 1st Place - DINAMIK 17 Competitive Programming Contest

    FPMIPA Universitas Pendidikan Indonesia · 2022

What I built

Projects prove I can build. They turn ideas into working systems. Each one solved a problem or answered a question.

Competition & Community HMIF

Competition & Community HMIF

Web platform for HMIF's Competition & Community division. Built with Next.js to help association members track achievements and stay motivated. Initiative to increase member participation in competitions and community events.

Student InitiativeAchievement TrackingCommunity Platform

Next.js · React · TypeScript · Tailwind CSS

Social Media Impact on Education

Social Media Impact on Education

Interactive Tableau dashboard analyzing the impact of social media on education in Indonesia. Built custom web scraping pipeline and reached final round of COMPFEST 16 among nationwide participants.

National CompetitionFinal RoundWeb Scraping

Tableau · Python · Selenium · BeautifulSoup

Klu: Sign Language Learning Platform

Klu: Sign Language Learning Platform

Full-stack sign language learning platform with real-time hand gesture recognition. Built using computer vision and machine learning for preprocessing and model training with React frontend.

Real-time RecognitionComputer VisionFull-stack

React · Python · OpenCV · MediaPipe

Corrosion Rate Prediction Model

Corrosion Rate Prediction Model

Machine learning model for ONNX runtime that predicts corrosion rates for 9 materials in sulfidic/naphthenic acid conditions. Achieved exceptional RMSE of 0.3 with high predictive accuracy.

RMSE 0.3Industrial ApplicationONNX Runtime

Python · ONNX · Machine Learning · Scikit-learn

Oshi-no-Ko: 32-bit Kernel

Oshi-no-Ko: 32-bit Kernel

Built a 32-bit OS kernel with memory paging, round-robin scheduling, hardware drivers (keyboard, VGA 13h, disk), and an EXT2 filesystem. Designed system calls and a user-space shell with core utilities.

VGA 13h DriverEXT2 FilesystemShell Utilities

Operating Systems · C · Paging · Scheduling

Silberschatz: Mini DBMS

Silberschatz: Mini DBMS

A mini DBMS to learn core internals. Includes a storage engine, buffer/page management, query execution, a simple optimizer, indexing, transactions, and a client-server interface.

Query ExecutionBuffer ManagementClient-Server

Database Systems · Storage Engine · Query Optimization · Indexing

Papers

Papers bring me closer to the theoretical side. Articles force clarity. Code demonstrates execution.

Agent-Based SQL Question Answering System Using Small Language Model

Aug 2025

Developed an agent-based Text-to-SQL system using Qwen-2.5-Coder-32B-Instruct with a multi-area architecture, MCP tool binding, and iterative validation loops. Evaluated on a Spider subset with hybrid similarity and SLM semantic equivalence metrics, and analyzed ambiguity/schema challenges with query reformulation strategies.

Text-to-SQLQwen-2.5-CoderAgentic SystemsMCPSpider DatasetSQL

The Importance of Context and Avoiding Overengineering in Sentiment Analysis Problems

June 2025

Explored four different text preprocessing strategies for sentiment analysis of Indonesian text, comparing them with a Gensim baseline. Contributed comprehensive evaluation framework using TF-IDF and SVM classification.

NLPSentiment AnalysisTF-IDFSVMIndonesian Text

Modified Decision Tree Algorithm for Multivariate Nonlinear Data Imputation

January 2025

Proposed custom decision tree model to handle missing values during data splitting for nonlinear multivariate imputation tasks. Achieved notable average RMSE of 4.98 on synthetic educational dataset.

Decision TreesData ImputationMachine LearningMissing Data

Singular Spectrum Analysis for Multivariate Time Series Forecasting

January 2025

Developed forecasting model using SSA on daily food commodity prices in Bandung, Indonesia. Demonstrated SSA's ability to identify trends, seasonal patterns, and noise in non-stationary time series.

Time SeriesSSAForecastingSVDEconomic Data

Writing

Writing is how I express what I think about programming. Hobbies, observations, technical views.

Understanding Major Assignments in the AI Decade

Reflection on how major university assignments can provide new perspectives in the rapidly evolving AI era.

AIEducationUniversityReflection

Depth-First Search and Breadth-First Search

DFS and BFS algorithms are fundamental algorithms, meaning all programmers should at least know about these two algorithms.

AlgorithmsComputer ScienceProgrammingData Structures

Machine Learning: Decision Tree, Information Gain, Entropy

Decision Tree is one of the Machine Learning algorithms widely used for prediction. This article discusses the basic concepts of Decision Trees.

Machine LearningDecision TreesData ScienceAlgorithms

Experience: One Semester Studying Informatics at ITB

Experience and reflection during the first semester studying Informatics Engineering at Institut Teknologi Bandung.

UniversityITBStudent LifeExperience

Work record

Experience shows where I've applied my skills. Each role was about delivering specific results under real constraints.

May 2025 - Feb 2026

Part-time AI and Software Engineer

Fata Organa

Jakarta, Indonesia

Fata Organa is an Indonesia-based AI software company specializing in speech AI and applied machine learning solutions, and serves as an exclusive partner of Empath, a CAC Corporation division from Japan, to develop technologies such as emotional speech analysis, LLM systems, and AI-powered enterprise applications.

  • Built an agentic SQL question-answering system for a Japanese project management platform, enabling natural-language access to internal databases and reducing engineering information lookup time by ~60%; implemented with Python, LangChain, and LLM-based query planning.
  • Automated the generation of Functional and Technical Specification documents using LLM pipelines, cutting manual documentation effort by ~70% and standardizing specification output across 10+ engineering modules.
  • Developed a developer-task recommendation engine leveraging agentic SQL-QA and structured project metadata, improving task allocation accuracy by ~35% and reducing manual task assignment time for project managers by ~50%.
  • Optimized small language model inference using Ollama, vLLM, and model quantization, achieving 2x faster response time and enabling reliable deployment in CPU-only environments (24GB RAM) with <5% accuracy degradation.

Python · LangChain · LLMs · SQL · Ollama · vLLM

Sep 2025 - Oct 2025

AI Engineer

Legali AI

Remote, US

Legali AI is a San Francisco-based legal technology startup building AI-powered tools that help individuals navigate legal processes by organizing evidence, drafting legal documents, and preparing cases through an all-in-one litigation support platform.

  • Architected a multi-agent legal AI workflow that converts natural-language case descriptions into structured legal actions, enabling end-to-end self-service legal assistance for pro se users and reducing manual legal triage time by ~40%.
  • Built a jurisdiction-aware Retrieval-Augmented Generation (RAG) pipeline integrating document ingestion, OCR, hierarchical parsing, PII anonymization, embeddings, and region-filtered semantic retrieval, improving legal citation relevance by ~30% across multiple U.S. jurisdictions.
  • Developed an AI-driven legal document generation system that allows users to draft, revise, and export court-ready filings, reducing document preparation time from hours to minutes through template-guided prompting and legal clause retrieval.
  • Engineered scalable AI orchestration services with asynchronous processing, conversational memory, and source-traceable outputs, supporting 1,000+ document queries/day while maintaining reliable and explainable legal reasoning pipelines.

Multi-Agent Systems · RAG · OCR · Embeddings · Async Processing

Pro bono work

Unpaid work that sharpens skills and gives back. These roles tested my ability to design problems and validate solutions.

Nov 2024 - Present

Problem Setter

ARKAVIDIA 9.0

Developed and structured data science and machine learning challenges for Datavidia, ensuring a diverse range of problem difficulties and real-world applications. Also conducted extensive data collection and feasibility testing on problems.

Impact: Ensured high-quality and relevant datasets for competition tasks and validated problem fairness.

May 2024 - Jul 2024

Problem Setter

IMPACT ITB 4.0

Designed and formulated high-quality algorithmic and mathematical problems for the competition. Created problem sets across multiple domains and developed Higher-Order Thinking Skills (HOTS) problems to test participants' deep problem-solving skills.

Impact: Ensured a smooth and challenging experience for participants by providing fair and correct problems.

Figures

A way to remember what I've done. Visual memory of where I've been and what happened.

Selected figure 01

A pause within a shared moment.

Selected figure 02

Paths taken between obligations.

Selected figure 03

Tools arranged for quiet work.

Selected figure 04

Routine measured in steady stride.

Selected figure 05

Motion without intent to be seen.

Selected figure 06

Time held before moving on.

Selected figure 07

A moment passing without notice.

Selected figure 08

Scale felt through distance.

Selected figure 09

Presence caught in transit.

Selected figure 10

A frame that remains.

When the work aligns.

My work sits between software, data, and systems.

When a project needs clarity, structure, and long-term attention, I am available.

If the work makes sense, we can talk.

Muhammad Fathur Rizky (Fathur) | AI and Software Engineer | Fathurwithyou