Time to refine the visualisation of students by postcodes started earlier this week. Have another set of data to work with.
- Remove the identifying data.
- Clean the data.
I had to remind myself the options for the sort comment – losing it. The following provide some idea of the mess.
:1,$s/”\* Sport, Health & PE+Secondry.*”/HPE_Secondary/
:1,$s/Health & PE Secondary/HPE_Secondary/
- Check columns
Relying on a visual check in Excel – also to get a better feel for the data.
- Check other countries
Unlike the previous visualisation, the plan here is to recognise that we actually have students in other countries. The problem is that the data I’ve been given doesn’t include country information. Hence I have to manually enter that data. Giving for one of the programs, the following.
4506 Australia 8 United Kingdom 3 Vietnam 3 South Africa 3 China 2 Singapore 2 Qatar 2 Japan 2 Hong Kong 2 Fiji 2 Canada 1 United States of America 1 Taiwan 1 Sweeden 1 Sri Lanka 1 Philippines 1 Papua New Guinea 1 New Zealand 1 Kenya 1 Ireland
And all good.