Data Engineering Hub
GitHub Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode

CSV

A CSV file is a type of delimited text file that uses commas to separate values, with each line of the file being one data record. A record can have multiple fields, which are also separated by commas. Usually, a CSV files stores tabular data (numbers and text) in plain text form, meaning each line will include the same number of fields.

Extension: .csv

CSV Advantages

  • Human readable/writeable
  • Widely used/supported by most applications
  • Can be read in a memory efficient way

CSV Disadvantages

  • Data is uncompressed
  • Does not support binary data
  • Unquoted fields can easily “break” the format if they contain the delimiter
  • Lack of support for metadata