Skip to content

Research overview

This page summarizes PaniniFS's active research tracks and their results.


Key results (state: February 2026)

Metric Value
Universal atoms 34 atoms (6 layers, 4 categories)
Languages covered 14 languages validated
Global coverage 76.8% (62 files, ~5.8M words)
EU coverage 7/7 languages ≥ 90% (calibrated corpus)
Wikipedia corpus 34/34 atoms present = 100% across 14 languages
Max breakthrough Japanese: 18.8% → 74.1% (+55.3pp)

See detailed coverage results →


Main tracks

1. Universal atoms and multilingual validation

2. Typological validation and experiments

  • Dhātu experiments v0.1 and typology — 20-language child-directed sample
  • Key discoveries:
  • Baby sign validation — pre-linguistic gestural primitives
  • Dhātu core set — 7 informational operators (COMM, ITER, TRANS, DECIDE, LOCATE, GROUP, SEQ)
  • Cross-language insight: Japanese kanji ↔ Chinese hanzi → atom independent of writing system

3. Human language and development

4. Semantic compression

  • Semantic compression — metrics, protocol, implementation
  • Toy corpus results: 12 sentences, coverage rate = 1.0, avg 3.67 primitives/encoding

5. Cloud strategies and infrastructure


Summary of semantic universals

PaniniFS's 34 atoms are organized into 4 ontological categories:

Category Atoms (selection) Sanskrit
PROCESS MOUVEMENT, COGNITION, COMMUNICATION, CRÉATION, EXISTENCE, SEEKING, FEAR, CARE… kriyā
RELATION RELATION, STRUCTURE, INVARIANCE, ORDRE, DOMINATION sambandha
QUALITY BON, GRAND, VRAI, INTENSE, ANCIEN, MESURE, PERCEPTION guṇa
ENTITY CHOSE, AGENT, CORPS, LIEU, MATIÈRE dravya

Theoretical cross-references: - NSM (Wierzbicka): GOOD, BAD, THINK, KNOW, FEEL, SAY, DO, HAPPEN, MOVE… - Jackendoff: GO, STAY, BE, CAUSE, HAVE, THING, PLACE, AMOUNT - Pustejovsky: FORMAL, AGENTIVE, TELIC, CONSTITUTIVE - Pāṇini: √gam, √jñā, √dṛś, √vac, √kṛ, √as, √labh…

See the complete table →


Full reading (book)

What's new (14 days)

See: What's new