Re: Find min year and min value

Stefan Schwarzer <stefan.schwarzer@xxxxxxxxxxxx> · Thu, 4 Oct 2007 13:41:20 +0200

Having 500 statistical global national variables for about 240  
countries/territories. Need to do regional aggregations, per  
Capita calculations and some completeness computations on-the-fly.

id_variable   |    year    |    value     |    id_country

Both Steve and I have given you alternatives and reasons for  
choosing alternative schema. You haven't provided any additional  
information to really help us guide you in any particular direction  
from what we already have. For example, in the section from me  
which you quoted above, I wrote that this schema may be appropriate  
if the "value" column values are actually all of the same type  
(e..g, all currency amounts, all masses, all counts). You haven't  
said whether or not this is the case. We can't read your mind :)

Again, one thing that would help is if you use a more descriptive  
column name than "value" that gives an indication of what *kind* of  
values are in the column.

I thought (and did ask) about the possibility to put nevertheless  
- with the new table design - the variables into different tables,  
but nobody really got my on a track for that.

Steve first suggested it and I provided an example of what that  
would look like (using "gdp" and "fish_catches" tables) in the same  
post you quoted from above.

http://archives.postgresql.org/pgsql-general/2007-10/msg00108.php

Is this not what you mean?

Hmm... sorry that I didn't give you more info. I will see it I can do  
better now.

So, as mentioned above, we have 500 different variables (such as GDP,  
Fish Catch, Population, Forest Cover, Fertility Rate etc.). Many of  
them have indeed different units (Metric Tons, Thousand People, Sqkm,  
Persons per Year etc.). And many do cover different time periods  
(1970-2005; 2000-2002; 1970-75, 1975-80,..., 2000-05; ...).

So, "value" is something general - it can be (measured in) (thousand)  
Dollars, (Million) People, (Hundred) Hectares etc...

We update the variables something like once a year or every two  
years, in an ongoing manner. Updating means especially adding new  
years (as they are being published by World Bank, UNSD, etc.). So, no  
real "hard core" editing/adding, I'd say. A slow, but steady progress.

Queries, as mentioned too, are mainly simple "SELECt * FROM  
specific_variable_like_gdp". But as well on-the-fly aggregations to  
subregions and regions (such as West Africa, and Africa). And some  
calculations which enables to indicate the "completeness" of the  
aggregations (parameters being used are "Number of countries included  
in aggregation in respect to Number of Countries, Total Population,  
Land Area, GDP).

Does this help?

Thanks anyway already for all your feedback. Your really great people  
out there!!

Best wishes,

Stef

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster