[Featured Image: Open Data Kit | Civic Commons]
Context: Household survey in rural Tanzania
Task: Combine data from XML files and display in an Excel File
Challenges:
Data Provided:
The following methods were attempted in extracting data. Some successful, some not, with clear reasons as to why not.
Method 1: Use Excel
Excel’s XML to CSV or XLS function is very easy to use but issues were:
Method 2: Use MS Access
The idea was to first import into Access and then to CSV. Importing to Access worked exceptionally well. It automatically creates related tables for nested XML tags. However issues were:
Method 3: Install ODK Aggregate and use inbuilt stable version of ODK Briefcase:
ODK Briefcase is the companion tool for ODK Aggregate to import and export data from ODK generated files. However the inbuilt version could only:
Method 4: KoBo Post Processor
KoBo Post Processor is a third party tool to aggregate data from XML files. However the issues with this were:
Method 5: Stand Alone Beta Version of ODK Briefcase
Using ODK Briefcase v1.0 Beta 2.jar is the method which worked but since there wasn’t much documentation on this, it took awhile to figure out and find. There are only a couple of threads on the ODK Developers Google Group which mention this. There were obviously a ton of issues considering it’s in Beta mode but after a number of tweaks and re-positioning data it worked. Issues faced during this process and remedies:
Forms
Tanzanian Survey
Instances
TanzanianSurvey-11032011-1203
submission.xml
Ideally all of this could have been done by writing a simple script and writing to the same CSV file. However I was aiming to find something which could have been replicable by a non-tech individual. Looking forward to the next version of ODK Briefcase.