A family of methods has been developed for processing free-form text files using APL. We assume that a file has an arbitrary structure and has regions of statistical and numerical data embedded in it. The data in the file are processed to extract the following information: 1. the embedded statistical and numeric regions, 2. titles, keys (legends), and tic-mark identifiers, associated with each region, and 3. dates and other pertinent information. Once these data are extracted, bar, line, pie charts or a combination of those are plotted. An APL program which is automatically loaded and invoked performs these tasks. The program has been tested on hundreds of files with different structures. Some of these files have multiple statistical regions.
/lp/association-for-computing-machinery/techniques-for-extracting-statistical-data-from-free-form-text-using-YsQ8B0vMJz