A Math Formula Extraction and Evaluation Framework for PDF Documents