On 24/11/05 4:42 pm, "Tom Lane" <tgl@xxxxxxxxxxxxx> wrote: > Adam Witney <awitney@xxxxxxxxxx> writes: >> On 24/11/05 4:19 pm, "Tom Lane" <tgl@xxxxxxxxxxxxx> wrote: >>> The question is, can you tell whether any data is actually missing? > >> Well each of these datasets are about 20,000 rows each... So I can tell >> which one is in (640792,12) and in (640799,1), they have the same dataset id >> value. Im assuming the missing ones in the middle will therefore be from the >> same dataset as well then... So I know which files did not get uploaded >> properly and I can re-upload them.... > > But can you tell whether there *were* any in the middle? It might be > worth counting the rows currently present for that dataset, then > reloading and counting again. Ah yes I see what you mean. I have already done a count(*) on them: bugasbase2=# select count(*) from mba_data_base where bioassay_id in (5176); count ------- 20000 (1 row) bugasbase2=# select count(*) from mba_data_base where bioassay_id in (5177); count ------- 19988 (1 row) bugasbase2=# select count(*) from mba_data_base where bioassay_id in (5203); count ------- 19928 And the two affected datasets (5177, 5203) are short on rows compared to a correct one (5176) Does this help identifying what went wrong? Thanks Adam -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.