An attempt to start another MOOC. Learn to code for data analysis from FutureLearn/OUUK. Interested in this one to perhaps start the migration from Perl to Python as my main vehicle for data munging; and, also to check out the use of Jupyter notebooks as a learning environment.
- The approach – not unexpectedly – resonates. Very much like the approach I use in my courses, but done much better.
- The Juypter notebooks work well for learning, could be useful in other contexts. Good example of the move toward a platform
- The bit of Python I’ve seen so far looks good. The question is whether or not I have the time to come up to speed.
Intro video from a BBC journalist and now the software. Following a sequential approach, pared down interface, quite different from the standard, institutional Moodle interface. It does have a very visible and simple “Mark as complete” interface for the information. Similar to, but perhaps better than the Moodle book approach from EDC3100.
Option to install the software locally (using Anaconda) or use the cloud (SageMathCloud). Longer term, local installation would suit me better, but interested in the cloud approach. The instructions are not part of the course, seem to be generic instructions used for the OUUK.
Intro using a video, which on my connection was a bit laggy. SageMathCloud allows connection with existing accounts, up and going. Lots of warnings about this being a free service with degraded performance, and the start up process for the project is illustrating that nicely. Offline might be the better option. Looks like the video is set up for the course.
The test notebook loads and runs. That’s nice. Like I expected, will be interesting to see how it works in “anger”.
Python 3 is the go for this course, apparently.
Worried a little about installing another version of python. Hoping it won’t trash what I have installed, looks like it might not. Looks like the download is going to take a long time – 30 min+. Go the NBN!
Two notebooks a week: exercise and project. Encouraged to extend project. Exercises based on data from WHO, World Bank etc. Quizzes to check knowledge and use of glossaries. Comments/discussions on each page. Again embedded in the interface, unlike Moodle. Discussion threads expand into RHS of page.
Start with a question – point about data analysis illustrated with a personal story. Has prompts to expand and share related to that story. Encouraging connections.
Ahh, now the challenge of how to segue into first steps in programming and supporting the wide array of prior knowledge there must be. Variables and assignment. and a bit of Jupyter syntax. Wonder how the addition of Jupyter impacts cognitive load?
Variable naming and also starting to talk about syntax, errors etc. camelCase is the go apparently.
And now for some coding. Mmm, the video is using Anaconda. Could see that causing some problems for some learners. And the discussion seems to illustrate aspects of that. Seems installing Anaconda was more of a problem. Hence the advantages of a cloud service if it is available..
Mmm, notebooks consist of cells. These can be edited and run. Useful possibilities.
Expressions. Again Juypter adds it’s own little behavioural wrinkle that could prove interesting. IF the last line in a cell is an expression, it’s value will be output. Can see that being a practice people try when writing stand alone python code.
Functions. Using established functions.
Onto a quiz. Comments on given answers include an avatar of the teaching staff.
Values and units. With some discussion to connect to real examples.
Pandas. The transition to working with large amounts of data. And another quiz, connected to the notebook. That’s a nice connection. Works well.
Range of pages an exercises looking at the pandas module. Some nice stuff here.
Do I bother with the practice project? Not now. But nice to see the notebooks can be exported.
Week 2 – Cleaning up our act
The BBC journalist giving an intro and doing an interview. Nodding head and all.
Ahh weather data. Becoming part of the lefty conspiracy that is climate change? :)
Comparison operators, with the addition of data frames. Which appears to be a very useful abstraction.
Bitwise operators. Always called these logical or boolean operators. Boolean isn’t given a lot of intro yet.
Ahh, the first bit of “don’t worry about they syntax, just use it as a template” advice. Looks like it’s using the equivalent of a hash that hasn’t yet been covered.