User Tools

Site Tools


howtos:workwithdata:data_format_vs_file_format

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
howtos:workwithdata:data_format_vs_file_format [2009/11/17 15:12]
shona.weldon
howtos:workwithdata:data_format_vs_file_format [2011/06/24 19:14] (current)
chris.strashok
Line 1: Line 1:
 ====== Data Format vs File Format ====== ====== Data Format vs File Format ======
  
-Get notes from Michael ​with matrix ​of file and data format+The import and export tools mainly interact ​with files external to the model. ​ Using these files requires some understanding ​of the type of files and the layout of the data in the files. ​ The import and export tools conceptualize that by file format ​and data format ​respectively.
  
-Conceptually what is the difference one is physical format ​of the file vs the order and format within the file.+**File format** ​is the the makeup ​of the file itself, for example text is ASCII and readable by various text editors and is pretty universal, and there are also various binary types that depend more specifically on the program reading ​and writing the files. 
 + 
 +**Data ​format** is the logical order and organization of the data within the file.  
  
-The import and export tools mainly interact with files external to the model. ​ Using these files requires some understanding of the type of files and the layout of the data in the files. ​ The import and export tools conceptualize that by file format and data format respectively. 
  
 Here are is an example of each tool using the parameters related to format: Here are is an example of each tool using the parameters related to format:
Line 13: Line 14:
 export (myVar[]; dataFormat=tab,​ fileFormat=text,​ dataFile=myVarOutput.txt) export (myVar[]; dataFormat=tab,​ fileFormat=text,​ dataFile=myVarOutput.txt)
 </​code>​ </​code>​
- 
 ===== File Formats ===== ===== File Formats =====
 These are the supported file formats for each tool These are the supported file formats for each tool
 ^fileFormat^Description^import^export^ ^fileFormat^Description^import^export^
-| text| standard ​| yes | yes | +| text| ASCII and readable by any text editor| yes | yes | 
-| dbf | standard | yes | yes | +| dbf | standard ​database format ​| yes | yes | 
-| shape | produced by arc info | yes | yes | +| shape((http://​www.esri.com/​library/​whitepapers/​pdfs/​shapefile.pdf)) ​ARCInfo GIS format ​| yes | yes | 
-| mif | produced by map info | yes | yes | +| mif((http://​www.gissky.com/​download/​Download/​DataFormat/​Mapinfo_Mif.pdf)) ​MapInfo GIS format (geographic object definition)  ​| yes | yes | 
-| mid |  ​produced by map info| yes | yes |+| mid | MapInfo GIS format (attribute table) ​| yes | yes |
 | tool | A file written by whatIf?'​s tool languages in binary format | yes | yes | | tool | A file written by whatIf?'​s tool languages in binary format | yes | yes |
  
 ===== Data Formats ===== ===== Data Formats =====
 These are the supported data formats for each tool These are the supported data formats for each tool
-^dataFormat^Description^import^export^ +^dataFormat^Description^import^export
-| coordinate | The data is in delimited columns where the first columns specify the element names in the respective dimension sets and the data is in the last column| yes | yes | +| coordinate | The data is in delimited columns where the first columns specify the element names in the respective dimension sets and the data is in the last column. \\ \\ Pros: hunt & peck (order resilience). This is the preferred format.| yes | yes | [[howtos:​workwithdata:​coordExample | See Example]] ​
-| tool | A file written by whatIf?'​s tool languages in binary format | yes | yes | +| tool | A file written by whatIf?'​s tool languages in binary format | yes | yes 
-| odometer | Data is delimited as each data item is read it fills the object in the order of it's dimensions. ​ No validation that the data lands in the right cells is done. | yes | yes | +| odometer | Data is delimited as each data item is read it fills the object in the order of it's dimensions. ​ No validation that the data lands in the right cells is done. \\ \\ Pros: row and column headings not required, good for reading legacy TOOL output ​| yes | yes | [[howtos:​workwithdata:​odomExample | See Example]] ​
-| mapping | | yes | yes | +| mapping | | yes | yes 
-| record | | yes | yes |+| [[howtos:​workwithdata:​importing_and_tabulating_record-based_datasets|record]] The data is in delimited columns where the source data is read into a two dimensional variable consisting of a numerical record id and a user defined set of fields. \\ \\ Pros: Can convert text data into integer codes, extract data sets that are not continuous, can easily tabulate or cross-tabulate variables. \\ Cons: The source data must have unique column descriptions for each field that are usually related in some way.  ​| yes | yes | [[howtos:​workwithdata:​recordExample|See Example]] | 
 +| whatIfGEO (rename) | | yes | yes | | 
 + 
 +Provide more conceptual description of the different data formats, their pros and cons, etc. FIXME
howtos/workwithdata/data_format_vs_file_format.1258470770.txt.gz · Last modified: 2009/11/17 15:12 by shona.weldon