I would also recommend
Mitchell, Machine Learning.
I have quite a lot of Haskell code for stuff in this book, which I'm currently
polishing for publishing. (For example, I'm rather proud of my thirty line
neural network code.)