Book: The Emerging Science of Machine Learning Benchmarks

(mlbenchmarks.org)

128 points | by jxmorris12 4 days ago

6 comments

erdemo 1 hour ago
https://mlbenchmarks.org/
This is the actual link to reach the book. There is no navigation link back to the index on the shared link.
[-]
- NeutralForest 1 hour ago
  Thanks, I had to do the same.
loveparade 12 hours ago
Very cool book. I think a reason why ML has seen so much progress despite benchmark overfitting/abuse is that results are "regularized" by real world applications and the Lindy effect. Methods, or research, that abuse benchmarks aren't adopted by follow-up research so they tend not to survive. And they aren't adopted because people try them but then find out that they don't generalize to other/newer benchmarks. So the system works not because of specific benchmarks, but because of how the community as a whole deals with benchmarks.
pakapica 1 hour ago
added to my reading list :)
trostaft 12 hours ago
If I'm recall correctly, this was also a keynote at MDS24? That was also a great talk, Hardt is an excellent speaker.
lazrgatr 15 hours ago
A little rule I live by is that if Moritz Hardt writes it, I will read it
[-]
- TrainedMonkey 12 hours ago
  Why is that?
  [-]
  - kaycey2022 8 hours ago
    You honestly don't know of Moritz Hardt?
    [-]
    - fxwin 57 minutes ago
      Why so snarky? I also didn't know who he was:
      I'm a director at the Max Planck Institute for Intelligent Systems. Prior to joining the institute, I was Associate Professor for Electrical Engineering and Computer Sciences at the University of California, Berkeley. My research contributes to the scientific foundations of machine learning and algorithmic decision making with a focus on social questions.[0]
      Also simply knowing of him doesn't answer the question.
      [0] https://mrtz.org/
    - khafra 52 minutes ago
      xkcd 1053, my friend.
salberts 7 hours ago
Read the preface.
1. It sounds like this book can be summarized in a practical blog post or a series of posts
2. Is using the term crisis so many times really necessary?