Difference between revisions of "Uploading Data"

From Explore Analytics: The Wiki
Jump to navigation Jump to search
Line 1: Line 1:
Explore Analytics makes it easy to upload data in [[Glossary of Data Analysis and Visualization Terms#CSV File|CSV Format]]. If you have data in Excel, you'll first need to save it as a CSV file.
+
Explore Analytics makes it easy to upload data in [[Glossary of Data Analysis and Visualization Terms#CSV File|CSV Format]]. If you have data in Excel, you'll first need to save it as a CSV file using Excel.
  
=====Automatic Detection of CSV Format=====
+
=====Automatic Detection=====
  
Explore Analytics is designed to automatically detect the following properties of the file being uploaded rather than ask you to specify them.
+
Explore Analytics automatically detects the following properties of the file being uploaded.
  
*'''Compression''' - the CSV file may be compressed in ZIP format. Explore Analytics will automatically uncompress the file.
+
*'''Compression''' - the CSV file may be compressed in ZIP format. In that case it will be automatically uncompressed during the upload.
*'''Encoding''' - the CSV file may have text using a wide variety of character encodings such as those for Latin languages, Asian languages or Unicode. In most cases.
+
*'''Encoding''' - text in the CSV file may use any character encoding including encoding for Latin languages, Asian languages or the universal Unicode encoding.
*'''Delimiter''' - CVS files typically use a comma to delimit (separate) fields, however a Tab, space, vertical bar or semicolon may also be used as a delimiter.
+
*'''Delimiter''' - CVS files may use a comma to delimit (separate) fields, however a Tab, space, vertical bar or semicolon may also be used as a delimiter.
*'''Field Names''' - the first line in the CSV file may contain field names, however, a CSV file is valid even if it does not contain field names. This is automatically detected and if column names are absent, you can provide them during the upload.
+
*'''Field Names''' - the first line in the CSV file may contain field names, however, a CSV file is valid even if it does not contain field names. This is automatically detected and if column names are absent, you will be prompted to provide them during the upload.
 
*'''Field Types''' - the following types are automatically detected
 
*'''Field Types''' - the following types are automatically detected
 
**datetime - date or date and time
 
**datetime - date or date and time
**decimal - decimal number
+
**decimal - decimal number. The number could be formatted (e.g., $1,234.10) or unformatted (e.g., 1234.1). A wide varity of localized formats are automatically detected
 
**integer - integer number
 
**integer - integer number
 
**varchar - varying-length character: text or any character data such as codes or abbreviations
 
**varchar - varying-length character: text or any character data such as codes or abbreviations
 +
*'''Use of double quotes''' - double quotes may be optionally used to enclose fields that may contain the delimiter character. For example, if the delimiter is a comma, the text "1,234.00" is enclosed in double quotes.
  
 
When you select a file to upload, it is analyzed to determine whether your review is required. If your review is not required, you can still choose to review and make adjustments to properties that were detected by selecting "Review and make changes" in the upload dialog.
 
When you select a file to upload, it is analyzed to determine whether your review is required. If your review is not required, you can still choose to review and make adjustments to properties that were detected by selecting "Review and make changes" in the upload dialog.

Revision as of 07:15, 17 June 2012

Explore Analytics makes it easy to upload data in CSV Format. If you have data in Excel, you'll first need to save it as a CSV file using Excel.

Automatic Detection

Explore Analytics automatically detects the following properties of the file being uploaded.

  • Compression - the CSV file may be compressed in ZIP format. In that case it will be automatically uncompressed during the upload.
  • Encoding - text in the CSV file may use any character encoding including encoding for Latin languages, Asian languages or the universal Unicode encoding.
  • Delimiter - CVS files may use a comma to delimit (separate) fields, however a Tab, space, vertical bar or semicolon may also be used as a delimiter.
  • Field Names - the first line in the CSV file may contain field names, however, a CSV file is valid even if it does not contain field names. This is automatically detected and if column names are absent, you will be prompted to provide them during the upload.
  • Field Types - the following types are automatically detected
    • datetime - date or date and time
    • decimal - decimal number. The number could be formatted (e.g., $1,234.10) or unformatted (e.g., 1234.1). A wide varity of localized formats are automatically detected
    • integer - integer number
    • varchar - varying-length character: text or any character data such as codes or abbreviations
  • Use of double quotes - double quotes may be optionally used to enclose fields that may contain the delimiter character. For example, if the delimiter is a comma, the text "1,234.00" is enclosed in double quotes.

When you select a file to upload, it is analyzed to determine whether your review is required. If your review is not required, you can still choose to review and make adjustments to properties that were detected by selecting "Review and make changes" in the upload dialog.