Either it wasn't great timing, or it was actually very good timing.
At a stage in its life when it is gradually getting more heavily questioned on the quality of data it is producing, Statistics NZ's admission of what I think was a truly calamitous error relating to food prices saw our official statistics provider turning the spotlight brightly on itself.
(The nub of the food price issue appears to have been that "due to human error" about 13 days worth of January data from one of the supermarket groups just simply wasn't included. Incredible.)
An official statistics provider is a very easy target. And for this reason (well, I would suspect it's the reason), economists and the like don't tend to publicly question, raise doubts, or criticise Stats NZ and the information it provides.
I can imagine that providing statistical information could easily feel like a thankless task. Produce spot-on data that nails the subject matter and people say, yeah fine. But produce something that looks like it has glitches and you become open to criticism.
So, to openly criticise Stats NZ is to seem, well, mean. Let's face it we are talking about people with feelings and professional pride. How do you feel if someone criticises your work?
All I can say though is, that as someone who has used the data provided by Stats NZ on a very regular basis, I have become increasingly frustrated and, yes, doubtful, about what is being produced.
One area of Stats NZ's work that has received a reasonable amount of discussion is its GDP figures. This is important stuff. Any number of people and organisations need to know what the economy is doing.
A key user of the information is the Reserve Bank, since it needs to know what its monetary policy - its Official Cash Rate settings - are doing to the performance of our economy.
Since a very bad mess-up in May 2024 when the RBNZ misread the tealeaves completely and gave a super 'hawkish' statement - at about the time our economy was actually starting to fall of the cliff - it's been noticeable that there's been much more dependence by the central bank on 'high frequency' data, which includes such things at the BNZ-Business NZ Performance of Manufacturing Index and and Performance of Services Index, among other things.
The outcome has been that the RBNZ hasn't messed up again, but it's not a satisfactory situation. Our central bank SHOULD be able to trust the official data sources.
Vital decision making processes depend on this
I use this example to stress that the information produced by Stats NZ is not just 'nice to have'. In many instances it is information that people are relying on to make big decisions. If the information is not good, decisions made might not be good - and that could have adverse consequences for us.
In May 2024 the RBNZ was talking about the possibility of interest rate hikes - when it should very much have been thinking about cuts. The subsequent huge U-turn in commentary by the bank in July, followed by the first OCR cut in August 2024 were based on what the central bank was gleaning from high frequency data - not from what the GDP data was saying, which at that point was missing the very sharp recession we had fallen into.
That's one negative example.
Here's a (generally) positive one. The Selected Price Indexes (SPI) monthly publication began in 2023 in response to a push, particularly from the RBNZ, for more timely inflation information - given that we as a country are an outlier in not having a monthly inflation read. To have just four definitive snapshots of inflation in a year is nowhere near adequate.
Anyway, the SPI started in 2023 and, to be honest, it all looked a bit rushed and the initial iteration was short on commentary and was just something of a data dump. BUT from the get-go there's no doubt this has been a very useful addition to the information provided by Stats NZ. And it seems to have been done by our data provider without any obvious additional resources made available. And that's probably important to bear in mind, given this week's events.
It seems to me that economists' forecasts of the future level of the Consumers Price Index, the official measure of inflation, have been a lot nearer the mark since the SPI was introduced. The economists like the SPI (which covers about 47% of the CPI's ingredients), because it covers some of the more volatile ingredients. Consider the fact that domestic air fares rocked 12.8% in the month of February. Economists can now factor that into their understanding of where the CPI may sit for the March quarter.
The Government of course tipped some more money in for Stats NZ in the Budget last year in order for the development of a monthly CPI measure. That's going to start next year and will be welcome.
An independent stock take
But I didn't really like the ad hoc way in which that was done. "Here's a fiver, go get us a monthly CPI." And indeed, I would certainly say the way in which the SPI was introduced was pretty ad hoc as well. And did that in any way contribute to the awful error Stats NZ admitted to this week?
I think it would be helpful, to say the least, to have an independent stock take of what it is Stats NZ produces, why it produces it, who it is for and whether it is fit for purpose.
The Government of course would have to play a key role. And obviously there's plenty of room for debate about which work Stats NZ does is particularly 'important'. I'm biased - I think the economic stuff is crucial. But others would argue strongly about the more socially focused work.
I think one vital area of focus for any inquiry should be to examine data that others - such as the RBNZ - lean on for making key decisions that have a key impact on NZers.
Right at the moment we definitely have a situation where the likes of the RBNZ are looking elsewhere for guidance. And if we end up in that situation, why spend around $250 million a year for a statistics provider at all? And no, I'm not suggesting getting rid of it. The question is rhetorical. We need to get better bang for the buck and, as far as I'm concerned, if one way of doing that is actually giving the stats provider MORE bucks, then so be it.
I think it is that important.
We need good official information. I think our official information has gone backwards in terms of quality and reliability in recent years. This trend needs to be arrested. An all-embracing independent assessment of what Stats NZ is, what it does, how it does it, and what it should be doing is needed. Sooner rather than later.
12 Comments
The question I would ask first is why the quality of statistics has declined?
One would think that with the improved (?) connectivity and computing power that statistics should be both more timely and accurate.
Exhibit A the last 2 censuses
Exhibit B deciding no more traditional census
https://www.rnz.co.nz/news/in-depth/564560/the-traditional-census-has-b…
And apparently no public accountability for the department.
How badly do things have to get screwed up before there are consequences?
Maybe contract out the census to someone with a track record for data collection and good statistical performance. Like the Singaporean public service. Or Google, for that matter.
The problem is simple. It's Wellington. A world of it's own, where performance is not judged by doing an effective job.
Wonder how long before AI is greatly faster, easier, more accurate, and more independent and objective (already possible now?). There's also far too much gate keeping about what's in the 'basket'.
Wonder how long before AI is greatly faster, easier, more accurate, and more independent and objective (already possible now?). There's also far too much gate keeping about what's in the 'basket'.
AI depends on data, but it is conceptually and practically distinct from data collection itself. And more importantly, as I often say, "garbage in, garbage out". AI is not a genie lamp.
Key distinctions between data collection and AI:
Data collection is the process of gathering, organizing, labeling, and storing information so that it can be used later. It’s about pipelines, formats, quality control, bias checks, and governance.
AI is a set of models and algorithms that learn patterns from that prepared data and then generate predictions, classifications, or content. It is a computational capability built on top of data, not the act of acquiring it.
And we have attempts to do better than the bureaucrats; for ex, Truflation in the U.S.
I lost faith long ago in the data that was coming out of stats NZ. Governments of all stripes are of the mindset; accentuate the positive, eliminate the negative. Talk it up, seasonally adjust and massage every statistic to the positive, and if we are positive enough the gullible kiwi will think its all true, start spending, and like magic the economy will improve. Every statistic and every prediction from economists and commentators alike have been spectacularly wrong for the last 5 years. A recent case in point was the RBNZ chief ecomist claing inflation will fall to 2%.( before Iran) running contrary to every indicator. Also yesterday the government are claiming the economy will grow 2.5% this year.
We need accurate measurements of what's going on, but what is being collected needs to be examined for necessity, and it needs to be done by mining existing collected data so there's not yet more form-filling overhead. It also needs to be kept apolitical, which is a lot harder but critical to credibility.
"Forgetting" to add in a great wodge of data kind of defies belief and suggests innovations, like using a checklist, are still to be adopted by Stats NZ. Not a good look.
OK I have some inside info here as I know people working in the org.
There is a massive amount of work in Stats NZ either been started (monthly CPI) or about to kick off which will re-platform their current legacy (often 30yo) systems into a modern platform. Which will be easier to manage/run with far better testing capabilities, far less people required. But it requires time and effort and a lot of projects to move critical stats over to this platform, which also comes with the requirement for better built data products, making them more reliable, timely and accurate. It also comes with a culture shift within the org towards reusability, standardisation and modularisation of core functions. For those asking AI questions, the modern platform has AI built into its core, they will need to certify the functions of it before models are used in anger (i.e. we don't want data breaches or our census data, for instance, to be used as training data for AI models, exposing our data).
So the issues written about in this article are being addressed. You have to recognise where Stats has come from and the issues it has. Stats has been funded via governments with short term funding models for decades. Basically a government decides they need XYZ stat now and then funding just that project. Which has meant they have 20 different systems all providing different functions, with differing levels of reliability/accuracy etc. And then run that new Stat forever on the same platform, never upgrading it (because next year the government wants stat ABC). There is likely a need to change their funding model to change the reliability of the stats they produce as a whole.
Stats NZ is moving toward a modern cloud data stack centred on Azure Databricks and .Stat Suite, alongside its existing Datahub and IDI environments.
As the article below explains,
In January, Statistics NZ told incoming minister Shane Reti its key operational platform, the Integrated Data Infrastructure (IDI), was struggling and in need of significant renewal.
The IDI was built on old, increasingly unstable technology and was exceeding the capacity of its original 2011 prototype design, the agency warned.
“It is often difficult to access, data ingestion is clunky and slow, and Stats NZ’s ability to refresh and update the ‘central linking spine’ underpinning the IDI is limited,” Stats NZ reported.
https://www.reseller.co.nz/article/4011273/spark-selected-to-deliver-ne…
https://www.stats.govt.nz/integrated-data/integrated-data-infrastructur…
Correct. I am in the know about this and its going to be revolutionary for them moving to a good data platform. The current one is garbage and increasingly so, as anyone who has used it as a researcher will also tell you. The data on the IDI is fantastic, the envy of many nations (in fact researchers come from all over the world to use the dataset). It's method to get the data set though is... antiquated, expensive and increasingly difficult to administer. Add to that the shared infrastructure the researchers have and you have a system that is also quite fragile.
Too too easy. Blame the Government. But it's not so.
The civil servants live in a much more benign and stable environment than most enterprises. They have every facility to keep up, but fumble it. Blame the Civil servants I think.
We welcome your comments below. If you are not already registered, please register to comment
Remember we welcome robust, respectful and insightful debate. We don't welcome abusive or defamatory comments and will de-register those repeatedly making such comments. Our current comment policy is here.