Developing and evaluating a chatbot to support maternal health care
Developed a chatbot for maternal health in India using stage-aware triage and hybrid retrieval, achieving 86.7% emergency recall.
Smriti Jha, Vidhi Jain, Jianyu Xu et al.
Developed a chatbot for maternal health in India using stage-aware triage and hybrid retrieval, achieving 86.7% emergency recall.
Smriti Jha, Vidhi Jain, Jianyu Xu et al.
CRYSTAL benchmark evaluates multimodal reasoning transparency using Match F1 and Ordered Match F1, revealing systematic flaws in existing models.
Wayner Barrios, SouYoung Jin
Structured distillation reduces personalized agent memory tokens by 11x while preserving retrieval capabilities.
Sydney Lewis
The study enhances performance in non-verifiable LLM post-training using reasoning LLM judges, with gpt-oss-120b as the gold standard.
Yixin Liu, Yue Yu, DiJia Su et al.
Porfolio-CEGAR-SEQ algorithm optimizes object packing and scheduling in 3D printing, reducing the number of printing plates used.
Pavel Surynek