talks
Data source:
https://csvconf.com/
1 row
where abstract = "While algorithms and computing power get all the press, the special sauce behind many recent machine learning breakthroughs are meticulously labeled training data. Developing and maintaining these data sets as public goods is both an art and a science. In this talk I'll present a new set of best practices gleaned from interview with ~20 data set builders, maintainers, and funders. Topics include: encouraging collaboration between rival data teams; finding and addressing ethical issues with crowd labeling; launching competitions to spur data set use; and revenue generation models for sustainability.", datetime = "2019-05-08T16:00:00" and day = "May 8 2019" sorted by rowid
✎ View and edit SQL
This data as JSON, CSV (advanced)
abstract
✖
- While algorithms and computing power get all the press, the special sauce behind many recent machine learning breakthroughs are meticulously labeled training data. Developing and maintaining these data sets as public goods is both an art and a science. In this talk I'll present a new set of best practices gleaned from interview with ~20 data set builders, maintainers, and funders. Topics include: encouraging collaboration between rival data teams; finding and addressing ethical issues with crowd labeling; launching competitions to spur data set use; and revenue generation models for sustainability. · 1 ✖
Link
|
rowid ▼
|
title
|
speaker
|
time
|
day
|
room
|
url
|
datetime
|
abstract
|
image
|
29 |
29 |
How to Feed Your Robot: Building and Maintaining Open Machine Learning Datasets |
Evan Tachovsky |
4:00 PM |
May 8 2019 |
Daisy Bingham Room |
https://csvconf.com/speakers/#evan-tachovsky |
2019-05-08T16:00:00 |
While algorithms and computing power get all the press, the special sauce behind many recent machine learning breakthroughs are meticulously labeled training data. Developing and maintaining these data sets as public goods is both an art and a science. In this talk I'll present a new set of best practices gleaned from interview with ~20 data set builders, maintainers, and funders. Topics include: encouraging collaboration between rival data teams; finding and addressing ethical issues with crowd labeling; launching competitions to spur data set use; and revenue generation models for sustainability. |
https://csvconf.com/img/speakers-2019/etachovsky.jpg |
CREATE TABLE [talks] (
[title] TEXT,
[speaker] TEXT,
[time] TEXT,
[day] TEXT,
[room] TEXT,
[url] TEXT,
[datetime] TEXT,
[abstract] TEXT,
[image] TEXT
)