Dry Bean Dataset: Images of 13,611 grains of 7 different registered dry beans

iujjkfjdkkdkf · on April 3, 2021

When I click through I only see a 4MB zip file. Where is the dataset?

It's great to have this kind of data available. So much computer vision research is done with a small number of datasets, e.g. imagenet, and there are lots of gaps to fill in on how sell these techniques apply to industrially relevant data like this.

specialist · on April 3, 2021

With the pandemic, I'm finally eating a variety of beans, lentils, hearty rice, etc that I've always meant to. I love it.

I used to have a fuzzy logic rice cooker. I'd LOVE the same tech for legumes. I'm still in the trial and error phase.

scotty79 · on April 3, 2021

I love green lentils with just salt and butter.

It's very easy to cook too. It does not stick to the pot until you boil off all of the water which is easy to avoid.

virtuallynathan · on April 3, 2021

That’s some dedication to grains and beans.

uyt · on April 3, 2021

These sort of datasets is not surprising to see. I watch "how it's made" videos on youtube and a lot of factory processes seem to have computer vision based sorting for rejecting rocks, debris, etc.

m-ee · on April 3, 2021

One of my professors consulted for Del Monte. They had an automated vision system that could spot damaged corn kernels as they flew down the conveyer belt and precisely knock them off into a hopper with a puff of air.

abcanthur · on April 3, 2021

I have a small collection of bean-shaped rocks that I have found over time in my purchases of dry beans. Lesson, rinse and sort your beans! By sort I mean remove a 2-3 shrivelling beans per pound and a rock every 10 lbs.

crispyambulance · on April 3, 2021

You should see the other articles in the journal where it comes from! (https://www.sciencedirect.com/journal/computers-and-electron...)

It's cool to see people doing "real" things with software instead of serving the interests of surveillance capitalism.

faeyanpiraat · on April 3, 2021

Potatoes should be next

woliveirajr · on April 3, 2021

If you enter the site, there's a good description of the parameters of the data used.

And if you click to the parent directory, you'll find many, many folders with interesting thing.

P.S.: don't expect too much of the Adult folder

jcims · on April 3, 2021

This is where that other article about one pixel object detector could come into play. Build a detector that maps these good bean properties to the color green and then you have a passive identifier than can check a hundred thousand beans per second.

Put a few parallel lines of these up and you could check a truckload of beans in no time.

wittenator · on April 3, 2021

Could you link the article you mean? This sounds pretty interesting.

jcims · on April 3, 2021

https://phys.org/news/2021-03-classification-single-pixel-de...

https://news.ycombinator.com/item?id=26633468

billfruit · on April 3, 2021

Why only 7 type of beans(I haven't looked at the data sets themselves)?.

Frankly there are so many types of dry beans that these should be a informative comprehensive catalogue of information about them available somewhere, but I can't seem to find it.