2016-05-10 14:47 GMT+02:00 Moreno Andreo <moreno.andreo@xxxxxxxxxx>:
Il 10/05/2016 13:38, Nicolas Paris ha scritto:
Is this just to be quicker or are you going to add some business logic while converting CSV data?2016-05-10 13:04 GMT+02:00 Moreno Andreo <moreno.andreo@xxxxxxxxxx>:
Il 10/05/2016 12:56, Nicolas Paris ha scritto:
I always create binary files withHello,
What is the way to build a binary format (instead of a csv) ? Is there specification for this file ?
http://www.postgresql.org/docs/9.5/static/sql-copy.html
COPY table TO 'path/to/file' WITH BINARY
Fine, this works in this way :
postgresql -> binary
binary -> postgresql
The way I want is :
csv -> binary -> postgresql
As you mentioned ETL, I assume the second, as I don't think that converting CSV to binary and then loading it to PostgreSQL will be more convenient than loading directly from CSV... as quicker as it can be, you have anyway to load data from CSV.
Right, ETL process means huge business logic.
Right, ETL process means huge business logic.
get the data (csv or other) -> transform it -> produce a binary -> copy from binary from stdin
Producing 100GO CSVs, is a waste of time.
Binary file format is briefly described in the last part of the doc you linked, under "Binary format", and there's also reference to source files.
This is beyond my knowledge, ATM. I'm just starting with Java and JDBC is still in the TODO list, sorry... :-)
And if possible, transforming csv to binary throught java.
Cheers
Moreno.-
Documentation explains a bit. Moreover, I have found a detailled answer here :
My ultimate goal is to encapsulate it in a Talend component. (talend is an open-source java based ETL software).
Thanks, I ll keep you aware.
Thanks, I ll keep you aware.