Ape210K

Introduction

Ape210K is a dataset comprising 210,488 Chinese mathematical word problems. Each problem in the dataset involves a single unknown variable. The dataset also includes 5,000 test samples for evaluation purposes.

Example

{
	"id": "20830",
	"segmented_text": "一 本 书 看 了 35% , 还 剩 多 少 没 看 .",
	"original_text": "一本书看了35%,还剩多少没看.",
	"ans": "65%",
	"equation": "x=1-35%"
}