To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

80 Million Tiny Images

From Wikipedia, the free encyclopedia

80 Million Tiny Images is a dataset intended for training machine learning systems.[1] It contains 79,302,017 32×32 pixel color images, scaled down from images extracted from the World Wide Web in 2008 using automated web search queries on a set of 75,062 non-abstract nouns derived from WordNet. The words in the search terms were then used as labels for the images.[2] The researchers used seven web search resources for this purpose: Altavista, Ask.com, Flickr, Cydral, Google, Picsearch and Webshots.[2]

The 80 Million Tiny Images dataset was retired from use by its creators in 2020,[3] after a paper by researchers Abeba Birhane and Vinay Prabhu found that some of the labeling of several publicly available image datasets, including 80 Million Tiny Images, contained racist and misogynistic slurs which were causing models trained on them to exhibit racial and sexual bias.[4][5] Birhane and Prabhu also found that the dataset contained a number of offensive images.[5]

Following the release of the paper, the dataset's creators removed the dataset from distribution, and requested that other researchers not use it for further research and to delete their copies of the dataset.[3]

The CIFAR-10 dataset uses a subset of the images in this dataset, but with independently generated labels.[6]

YouTube Encyclopedic

  • 1/3
    Views:
    775
    5 544
    57 075
  • Learn an object recognition system with millions of parameters from a small number of training sets
  • Jensen Huang: ''I Regret Creating This Computer, It Started Acting On It's Own...''
  • Bizarre Discoveries Scientists Can’t Explain

Transcription

References

  1. ^ Quach, Katyanna (1 July 2020). "MIT apologizes, permanently pulls offline huge dataset that taught AI systems to use racist, misogynistic slurs". www.theregister.com. Retrieved 2020-07-02.
  2. ^ a b Torralba, Antonio; Fergus, Rob; Freeman, William T. (November 2008). "80 million tiny images: a large data set for nonparametric object and scene recognition" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 30 (11): 1958–1970. doi:10.1109/TPAMI.2008.128. ISSN 1939-3539. PMID 18787244. S2CID 7487588.
  3. ^ a b "80 Million Tiny Images". groups.csail.mit.edu. Retrieved 2020-07-02.
  4. ^ Ustik, Georgina (2020-07-01). "MIT removes huge dataset that teaches AI systems to use racist, misogynistic slurs". Neural | The Next Web. Retrieved 2020-07-02.
  5. ^ a b Prabhu, Vinay Uday; Birhane, Abeba (2020-06-24). "Large image datasets: A pyrrhic win for computer vision?". arXiv:2006.16923 [cs.CY].
  6. ^ A. Krizhevsky. Learning multiple layers of features from tiny images. Tech Report, 2009. University of Toronto


This page was last edited on 23 May 2024, at 08:48
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.