10 Years of Automated Category Classification for Product Data
Johannes Knopp
Artificial Intelligence, Deep Learning, Data Science, Infrastructure, Machine Learning, Data Engineering

10 years ago we built a classifier for categorizing product data. Let's take a journey through the lessons we learned over the years about building, maintaining, and modernizing the category classifier.

Airflow: your ally for automating machine learning and data pipelines
Enrica Pasqua, Bahadir Uyarer
Big Data, Infrastructure, Machine Learning, Data Engineering

Automate your machine learning and data pipelines with Apache Airflow

Applying deployment oriented mindset for building Machine Learning models
Marianna Diachuk
Data Science, Machine Learning

Getting stuck for months trying to deploy the model and fighting with data inconsistency and bugs? This talk will introduce the way to build the development process with deployment in mind.

Are you sure about that?! Uncertainty Quantification in AI
Florian Wilhelm
Artificial Intelligence, Deep Learning, Data Science, Machine Learning, Science

Are you sure about that?! Uncertainty Quantification in AI helps you to decide if you can trust a prediction or rather not.

Automated Feature Engineering and Selection in Python
Franziska Horn
Data Science, Machine Learning, Science, Data Engineering, Statistics

Automated feature engineering and selection in Python with the autofeat library.

Automating feature engineering for supervised learning? Methods, open-source tools and prospects.
Thorben Jensen
Artificial Intelligence, Algorithms, Data Science, Machine Learning, Data Engineering

How to automate the labor-intensive task of feature engineering for Machine Learning? This talk gives an overview on methods, presents open-source libraries for Python, and compares their performance.

Avoiding ML FOBO
Rachel Berryman, Dânia Meira
Algorithms, Business & Start-Ups, Data Science, Machine Learning

Is FOBO (Fear Of Better Option) preventing you from delivering practical ML products? Join 'Avoiding ML FOBO' to learn tips for cutting through the hype.

Build a Machine Learning pipeline with Jupyter and Azure
Daniel Heinze
Computer Vision, Deep Learning, DevOps, IDEs/ Jupyter, Machine Learning, APIs, Python

Build a Machine Learning pipeline with Jupyter and Azure: https://notebooks.azure.com/Starlord/projects/pycon-ml-jupyter-azure

CANCELLED: First steps in Julia
Felicia Burtscher
Artificial Intelligence, Algorithms, Deep Learning, Data Science, Networks, Machine Learning, Science

#julia_introduction. why julia is better than python. machine learning made eady with juliabox.

Creating an Interactive ML Conference Showcase
Harald Bosch
Artificial Intelligence, Computer Vision, Deep Learning, IDEs/ Jupyter, Machine Learning

Build a ML showcase using #transferlearning, #keras, #WebRTC, #python

Current affairs, updates, and the roadmap of scikit-learn and scikit-learn-contrib
Adrin Jalali
Artificial Intelligence, Community, Code-Review, Machine Learning

an update on recent scikit-learn changes, current affairs, and the roadmap

Data Literacy for Managers
Alexander CS Hendorf
Artificial Intelligence, Business & Start-Ups, Data Science, Machine Learning, Use Cases

Artificial Intelligence need to be better understood in enterprises. Close the communications gap between engineers and management. Making data litteracy happen in your organisation.

Decentralized and Privacy-Preserving ML via TensorFlow Federated
Peter Kairouz, Amlan Chakraborty
Artificial Intelligence, Deep Learning, Data Science, Machine Learning, Data Engineering

Meet TensorFlow Federated: an open-source framework for machine learning and other computations on decentralized data.

Deep Learning for Healthcare with PyTorch
Valerio Maggio
Artificial Intelligence, Deep Learning, Machine Learning, Science

This tutorial provides a general introduction to the PyTorch Deep Learning framework with specific focus on Deep Learning applications for Precision Medicine and Computational Biology.

Detecting and Analyzing Solar Panels in Switzerland using Aerial Imagery
Martin Christen
Big Data, Computer Vision, Deep Learning, Data Science, Machine Learning, Visualisation

Detecting Solar Panels from aerial imagery using #Python #DeepLearning #CrowdSourcing

Embrace uncertainty! Why to go beyond point estimators for valuable ML applications
Stefan Maier
Algorithms, Data Science, Machine Learning, Statistics

Usually, uncertainties of Machine Learning predictions are just regarded as a sign of poor prediction accuracy or as a consequence of lacking input features. This talk illustrates how modeling uncertainties can improve ML based decisions.

Equivariance in CNNs: how generalising the weight-sharing property increases data-efficiency
Marysia Winkels
Artificial Intelligence, Algorithms, Computer Vision, Deep Learning, Data Science, Machine Learning, Science

Equivariance in CNNs: how generalising the weight-sharing property increases data-efficiency

Fairness in decision-making with AI: a practical guide & hands-on tutorial using Aequitas
Pedro Saleiro
Data Science, Machine Learning, Use Cases

In this tutorial, we are going to deep dive into algorithmic fairness, from metrics and definitions to practical case studies, including bias audits using Aequitas (http://github.com/dssg/aequitas) in real policy problems where AI is being used

Fighting fraud: finding duplicates at scale
Alexey Grigorev
Data Science, Infrastructure, Machine Learning, Data Engineering

Fight fraudsters at scale: use machine learning to find duplicates in 10 million ads daily

Friend or Foe: Comparison of R & Python in Data Wrangling & Visualisation
Yuta Kanzawa
Data Science, Machine Learning, Visualisation, Statistics

R and Python are different in community and as language. Still, comparing them in their common fields such as data wrangling and visualisation, useRs and Pythonistas will deepen mutual understanding.

Gaussian Process for Time Series Analysis
Dr. Juan Orduz
Algorithms, Data Science, Machine Learning, Statistics

Gaussian process for regressions problems and time series forecasting

Gaussian Progress
Vincent Warmerdam
Artificial Intelligence, Algorithms, Data Science, IDEs/ Jupyter, Machine Learning, Statistics

gaussian progress. it's meta, but also the most normal conference title this year!

Get to grips with pandas and scikit-learn
Sandrine Pataut
Algorithms, Data Science, Machine Learning

Get to grips with pandas and scikit-learn: a first contact with data science using python

Hidden Markov Models for Chord Recognition - Intuition and Applications
Caio Miyashiro
Algorithms, Data Science, Machine Learning, Statistics

Come check out Caio's workshop on music+programming+stats on PyData

Hide Code, Minimize Dependencies, Boost Performance - The PyTorch JIT
Tilman Krokotsch
Artificial Intelligence, Deep Learning, Data Science, Machine Learning

PyTorch makes developing, training and debugging deep neural networks convenient. Learn how to export your trained model using its just-in-time (JIT) compiler to hide your network architecture, minimize code dependencies and use it in the C++ API. It's getting faster, too!

Interpretable Machine Learning: How to make black box models explainable
Alexander Engelhardt
Data Science, Machine Learning

In this talk, we'll find out how to interpret the predictions of otherwise black-box models.

Law, ethics and machine learning – a curious ménage à trois
Dr. Benjamin Werthmann
Artificial Intelligence, Big Data, Machine Learning

Find out and discuss how law and ethics should be included in a framework for machine learning that protects creativity and effectiveness

Lessons Learned as a Product Manager in Data Science
Tereza Iofciu
Business & Start-Ups, Data Science, Machine Learning

How many languages does the data science product manager need to speak?

Leveraging ML to obtain fine-grained (yet reliable) causal estimates from A/B tests and experiments
Maximilian Eber
Data Science, Machine Learning, Science, Statistics

How to use machine learning to evaluate randomised experiments and A/B tests

Leveraging the advantages of Bayesian Methods to build a data science product using PyMC3
Korbinian Kuusisto
Algorithms, Business & Start-Ups, Data Science, Machine Learning, Science, Statistics

How can one leverage the power of Bayesian methods to build a successful data science product?

Loss Function Theory 101
David Wölfle
Artificial Intelligence, Algorithms, Deep Learning, Data Science, Machine Learning, Statistics

This talk covers the theoretical background behind two common loss functions, mean squared error and cross entropy, including why they are used for machine learning at all, and what limitations you should keep in mind.

Machine learning with little data - from digital twin to predictive maintenance
Andreas Hantsch
Deep Learning, Machine Learning, Science

This talk is about the coupling of a digital twin model and a machine learning predictive maintenance algorithm in order to be able to detect anomalies in the operation of a not well-known hardware system.

Managing the end-to-end machine learning lifecycle with MLFlow
Tobias Sterbak
Data Science, Infrastructure, Machine Learning, Data Engineering

How to manage the end-to-end machine learning lifecycle with MLflow.

Practical DevOps for the busy data scientist
Dr. Tania Allard
Algorithms, Big Data, Data Science, DevOps, Machine Learning

Devops for the busy data scientist: learn how to leverage these practices to improve your workflows

Privacy-preserving Machine Learning for text processing
Sarah Diot-Girard
Artificial Intelligence, Data Science, Natural Language Processing, Machine Learning

Data privacy can be tricky when doing Natural Language Processing, join us to explore the different strategies you can use to keep your user data safer!

Production-level data pipelines that make everyone happy using Kedro
Yetunde Dada
Data Science, DevOps, Machine Learning, Data Engineering

Learn how easy it is to apply software engineering principles to your data science and data engineering code. Expect an overview of Kedro, a library that implements best practices for data pipelines with an eye towards productionizing ML models.

Python-Powered OSINT! Modernising Open Source Intelligence for Investigating Disinformation
Chiin-Rui Tan, Dare Imam-Lawal
Artificial Intelligence, Algorithms, Data Science, Machine Learning, Web, Data Mining / Scraping, Use Cases

Socio-Technical Python for OSINT! The old state discipline of gathering intelligence from open sources is today critical for investigating Disinformation but has lacked modernisation. A former UK Gov Head of DataSci presents a maturity model for updating legacy OSINT with Python!

Running An Open Source Project Like A Start Up
Cheuk Ting Ho
Business & Start-Ups, Community, Data Science, Machine Learning

My journey of running an open source project like a start up

skorch: A scikit-learn compatible neural network library that wraps pytorch
Benjamin Bossan
Deep Learning, Data Science, Machine Learning

Combine the best of sklearn and PyTorch by using skorch. This talk shows you why and how to use skorch and what cool features it has to offer.

Take control of your hearing: Accessible methods to build a smart noise filter
Peggy Sylopp, Aislyn Rose
Artificial Intelligence, Algorithms, Computer Vision, Deep Learning, Data Science, Machine Learning, Science

Control what you hear with deep learning and open audio databases. The developer and manager of \\NoIze//, a project supported by Prototype Fund, share what’s helped them build an open source smart, low-computational noise filter in Python.

The Sound of Silence: Online Misogyny and How we Model it
Teresa Ingram
Community, Natural Language Processing, Machine Learning

Opt Out of Online Sexism - From a Problem to Open Source Activism and the Mechanics of Change

Time Series Anomaly Detection for Bottling Machine Maintenance
Andrea Spichtinger
Algorithms, Data Science, Machine Learning

Anomaly detection in time series data from mechanical motors in bottling machines, set productive on an AWS edge device. #AnomalyDetection #UnsupervisedML #AWS

Transforming a Legacy System into a Bias-Mitigating AI Solution for Debt Repayment
Avaré Stewart
Artificial Intelligence, Data Science, Natural Language Processing, Machine Learning, Data Engineering

Unleash Intelligence in you Data Transform a Legacy System into Bias-Mitigating AI Solution for Debt Repayment with Tesseract, SpaCy, & AI Fairness 360

Using adversarial samples to break and robustify your Vision Neural Network Models
Irina Vidal Migallón
Artificial Intelligence, Computer Vision, Deep Learning, Machine Learning

How much time & risk do you have? Ways to robustify your vision NN model before you let it go live.

Using machine learning for Level Generation in Snake (video-game)
Filipe Silva
Data Science, Machine Learning

Using machine learning models for level generation in video-games

Version Control for Data Science
Alessia Marcolini
Artificial Intelligence, Data Science, Machine Learning

Versioning in Data Science projects can be pretty painful: are you able to track the data sets along with the code itself and some of the resulting models?

Why you don’t see many real-world applications of Reinforcement Learning.
Yurii Tolochko
Artificial Intelligence, Algorithms, Deep Learning, Machine Learning, Statistics

Why doesn’t RL show the same success as (un)supervised learning? Inherent difficulties facing RL and avenues for future work

Why you should (not) train your own BERT model for different languages or domains
Marianne Stecklina
Artificial Intelligence, Deep Learning, Data Science, Natural Language Processing, Machine Learning, Science

Language models like BERT can capture general language knowledge and transfer it to new data and tasks. However, applying a pre-trained BERT to non-English text has limitations. Is training from scratch a good (and feasible) way to overcome them?

Filter