Hacker News new | past | comments | ask | show | jobs | submit login
Dry Bean Dataset: Images of 13,611 grains of 7 different registered dry beans (uci.edu)
63 points by polm23 on April 3, 2021 | hide | past | favorite | 14 comments



When I click through I only see a 4MB zip file. Where is the dataset?

It's great to have this kind of data available. So much computer vision research is done with a small number of datasets, e.g. imagenet, and there are lots of gaps to fill in on how sell these techniques apply to industrially relevant data like this.


With the pandemic, I'm finally eating a variety of beans, lentils, hearty rice, etc that I've always meant to. I love it.

I used to have a fuzzy logic rice cooker. I'd LOVE the same tech for legumes. I'm still in the trial and error phase.


I love green lentils with just salt and butter.

It's very easy to cook too. It does not stick to the pot until you boil off all of the water which is easy to avoid.


That’s some dedication to grains and beans.


These sort of datasets is not surprising to see. I watch "how it's made" videos on youtube and a lot of factory processes seem to have computer vision based sorting for rejecting rocks, debris, etc.


One of my professors consulted for Del Monte. They had an automated vision system that could spot damaged corn kernels as they flew down the conveyer belt and precisely knock them off into a hopper with a puff of air.


I have a small collection of bean-shaped rocks that I have found over time in my purchases of dry beans. Lesson, rinse and sort your beans! By sort I mean remove a 2-3 shrivelling beans per pound and a rock every 10 lbs.


You should see the other articles in the journal where it comes from! (https://www.sciencedirect.com/journal/computers-and-electron...)

It's cool to see people doing "real" things with software instead of serving the interests of surveillance capitalism.


Potatoes should be next


If you enter the site, there's a good description of the parameters of the data used.

And if you click to the parent directory, you'll find many, many folders with interesting thing.

P.S.: don't expect too much of the Adult folder


This is where that other article about one pixel object detector could come into play. Build a detector that maps these good bean properties to the color green and then you have a passive identifier than can check a hundred thousand beans per second.

Put a few parallel lines of these up and you could check a truckload of beans in no time.


Could you link the article you mean? This sounds pretty interesting.



Why only 7 type of beans(I haven't looked at the data sets themselves)?.

Frankly there are so many types of dry beans that these should be a informative comprehensive catalogue of information about them available somewhere, but I can't seem to find it.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: