Book: The Emerging Science of Machine Learning Benchmarks

(mlbenchmarks.org)

128 points | by jxmorris12 4 days ago

6 comments

  • erdemo 1 hour ago
    https://mlbenchmarks.org/

    This is the actual link to reach the book. There is no navigation link back to the index on the shared link.

  • loveparade 12 hours ago
    Very cool book. I think a reason why ML has seen so much progress despite benchmark overfitting/abuse is that results are "regularized" by real world applications and the Lindy effect. Methods, or research, that abuse benchmarks aren't adopted by follow-up research so they tend not to survive. And they aren't adopted because people try them but then find out that they don't generalize to other/newer benchmarks. So the system works not because of specific benchmarks, but because of how the community as a whole deals with benchmarks.
  • pakapica 1 hour ago
    added to my reading list :)
  • trostaft 12 hours ago
    If I'm recall correctly, this was also a keynote at MDS24? That was also a great talk, Hardt is an excellent speaker.
  • lazrgatr 15 hours ago
    A little rule I live by is that if Moritz Hardt writes it, I will read it
    • TrainedMonkey 12 hours ago
      Why is that?
      • kaycey2022 8 hours ago
        You honestly don't know of Moritz Hardt?
        • fxwin 57 minutes ago
          Why so snarky? I also didn't know who he was:

          I'm a director at the Max Planck Institute for Intelligent Systems. Prior to joining the institute, I was Associate Professor for Electrical Engineering and Computer Sciences at the University of California, Berkeley. My research contributes to the scientific foundations of machine learning and algorithmic decision making with a focus on social questions.[0]

          Also simply knowing of him doesn't answer the question.

          [0] https://mrtz.org/

        • khafra 52 minutes ago
          xkcd 1053, my friend.
  • salberts 7 hours ago
    Read the preface.

    1. It sounds like this book can be summarized in a practical blog post or a series of posts

    2. Is using the term crisis so many times really necessary?