User Tools

Site Tools


howtos:workwithdata:createreaddatafiles

This is an old revision of the document!


Use create pick apart complicated data files

It's always best to not touch a data file when trying to import it. Create has many feature to help with that and most situations should be covered by these parameters.

Here's a tar file with several examples and data.

  • Download and un-zip it somewhere on your server where you have access to it with interactive tool, on any account.
  • The best way to learn about the features is to work through the test.t script file in the above tar file looking at the results as you go.

Important Parameters

stopIfExtraOrMissData=on/off - Always run without this first and check out any errors or warnings about your input data however if you determine there are just extra empty columns or missing columns that you don't mind padding with 0's then turn on this parameter.

delimiter - works for both coordinate and odometer format now

searchElemName1 and replaceElemName1 - use this to search and replace pair to match up with the changes to element names you may have made when extracting the informant for this data. Must be used together

replaceWhiteSpaceInElemName - replace any tab, space or other “white space” in the element names and should also be used to match up with the element name changes you may have made when extracting the informant set

firstRow - this is the line the data starts on and is counted from the top of the file even if there are lines commented out with '!'

firstCol - this is the first column to consider and is a 1 based value so the first column is 1

  • In coordinate format that means the first column that has coordinate element names in it.
  • In odometer format that means the first column that has data in it, if this is used the rowTitles parameter is ignored

firstTabCol - only here for backward compatibility, it is used in odometer format only and it means the first column where there is actually data and is a 0 based index (ie the first column is actually counted as 0). If the rowTitles=on is used it is the same as setting firstTabCol to 1

Indexing

When you have a non-numeric column and have an informant for these values you can take in the data as the indexed numeric value. Then later when writing it out you can index it back to this informant and write out the text.

howtos/workwithdata/createreaddatafiles.1280869034.txt.gz · Last modified: 2010/08/03 20:57 by shona.weldon