After working on question that was related to the field of duration, I found out that the type of it is String. I assume it was done for programs that have estimated durations (i.e 12-16 months). However, from working with the data it seems that some funders use it to write a string, for example, “12 months” or “A year”. This makes it harder process the data and require a lot of data cleaning.
On the other hand, the cases of estimated durations are rare (I saw only two cases in the whole corpus).
My suggestion -
- Change the type from String to Number.
- Three publishers will have to update their files to fit the schema, but their datasets are relatively small and this can be easily re-published.
I assume we can’t fix it before version 1.0, but I think this is a minor change to the standard.