archivist: Managing Data Analysis Results

Podobne dokumenty
deep learning for NLP (5 lectures)

Zarządzanie sieciami telekomunikacyjnymi

Machine Learning for Data Science (CS4786) Lecture11. Random Projections & Canonical Correlation Analysis

Miedzy legenda a historia: Szlakiem piastowskim z Poznania do Gniezna (Biblioteka Kroniki Wielkopolski) (Polish Edition)

TTIC 31210: Advanced Natural Language Processing. Kevin Gimpel Spring Lecture 9: Inference in Structured Prediction

Stargard Szczecinski i okolice (Polish Edition)

Wojewodztwo Koszalinskie: Obiekty i walory krajoznawcze (Inwentaryzacja krajoznawcza Polski) (Polish Edition)

Łukasz Reszka Wiceprezes Zarządu

Tychy, plan miasta: Skala 1: (Polish Edition)

Zakopane, plan miasta: Skala ok. 1: = City map (Polish Edition)

ARNOLD. EDUKACJA KULTURYSTY (POLSKA WERSJA JEZYKOWA) BY DOUGLAS KENT HALL

ERASMUS + : Trail of extinct and active volcanoes, earthquakes through Europe. SURVEY TO STUDENTS.

Strona główna > Produkty > Systemy regulacji > System regulacji EASYLAB - LABCONTROL > Program konfiguracyjny > Typ EasyConnect.

Cel szkolenia. Konspekt

Katowice, plan miasta: Skala 1: = City map = Stadtplan (Polish Edition)

Hard-Margin Support Vector Machines

Fig 5 Spectrograms of the original signal (top) extracted shaft-related GAD components (middle) and

The Overview of Civilian Applications of Airborne SAR Systems

An Empowered Staff. Patty Sobelman

ZGŁOSZENIE WSPÓLNEGO POLSKO -. PROJEKTU NA LATA: APPLICATION FOR A JOINT POLISH -... PROJECT FOR THE YEARS:.

OpenPoland.net API Documentation

What our clients think about us? A summary od survey results

Karpacz, plan miasta 1:10 000: Panorama Karkonoszy, mapa szlakow turystycznych (Polish Edition)

Wojewodztwo Koszalinskie: Obiekty i walory krajoznawcze (Inwentaryzacja krajoznawcza Polski) (Polish Edition)

Jak działa grawitacja?

Wojewodztwo Koszalinskie: Obiekty i walory krajoznawcze (Inwentaryzacja krajoznawcza Polski) (Polish Edition)

Machine Learning for Data Science (CS4786) Lecture 11. Spectral Embedding + Clustering

Zmiany techniczne wprowadzone w wersji Comarch ERP Altum

Network Services for Spatial Data in European Geo-Portals and their Compliance with ISO and OGC Standards

POLITYKA PRYWATNOŚCI / PRIVACY POLICY

INSTRUKCJE JAK AKTYWOWAĆ SWOJE KONTO PAYLUTION

Wojewodztwo Koszalinskie: Obiekty i walory krajoznawcze (Inwentaryzacja krajoznawcza Polski) (Polish Edition)

Systemy wbudowane. Poziomy abstrakcji projektowania systemów HW/SW. Wykład 9: SystemC modelowanie na różnych poziomach abstrakcji

Nowe podejście do składowania danych

MaPlan Sp. z O.O. Click here if your download doesn"t start automatically

Dolny Slask 1: , mapa turystycznosamochodowa: Plan Wroclawia (Polish Edition)

OSI Network Layer. Network Fundamentals Chapter 5. Version Cisco Systems, Inc. All rights reserved. Cisco Public 1

DOI: / /32/37

Ankiety Nowe funkcje! Pomoc Twoje konto Wyloguj. BIODIVERSITY OF RIVERS: Survey to teachers

Instrukcja obsługi User s manual

Evaluation of the main goal and specific objectives of the Human Capital Operational Programme

EXCEL PL PROGRAMOWANIE PDF

ANKIETA ŚWIAT BAJEK MOJEGO DZIECKA

TTIC 31210: Advanced Natural Language Processing. Kevin Gimpel Spring Lecture 8: Structured PredicCon 2

Domy inaczej pomyślane A different type of housing CEZARY SANKOWSKI

Emilka szuka swojej gwiazdy / Emily Climbs (Emily, #2)

OSI Network Layer. Network Fundamentals Chapter 5. ITE PC v4.0 Chapter Cisco Systems, Inc. All rights reserved.

Obwieszczenia i Zarządzenia władz Okupacynych (Sygn. 241), (Announcements and Orders) RG

Wiejskie organizacje pozarządowe

Wroclaw, plan nowy: Nowe ulice, 1:22500, sygnalizacja swietlna, wysokosc wiaduktow : Debica = City plan (Polish Edition)

EXAMPLES OF CABRI GEOMETRE II APPLICATION IN GEOMETRIC SCIENTIFIC RESEARCH

Towards Stability Analysis of Data Transport Mechanisms: a Fluid Model and an Application

Effective Governance of Education at the Local Level

Realizacja systemów wbudowanych (embeded systems) w strukturach PSoC (Programmable System on Chip)

Arrays -II. Arrays. Outline ECE Cal Poly Pomona Electrical & Computer Engineering. Introduction

Wojewodztwo Koszalinskie: Obiekty i walory krajoznawcze (Inwentaryzacja krajoznawcza Polski) (Polish Edition)

Poland) Wydawnictwo "Gea" (Warsaw. Click here if your download doesn"t start automatically

Healthix Consent Web-Service Specification

Revenue Maximization. Sept. 25, 2018

Krytyczne czynniki sukcesu w zarządzaniu projektami

Wybrzeze Baltyku, mapa turystyczna 1: (Polish Edition)

THE ADMISSION APPLICATION TO PRIVATE PRIMARY SCHOOL. PART I. Personal information about a child and his/her parents (guardians) Child s name...

UE przyjmuje nowy program Bezpieczny Internet : 55 mln euro, aby Internet stał się bezpieczny dla dzieci

Analysis of Movie Profitability STAT 469 IN CLASS ANALYSIS #2

Why do I need a CSIRT?

Technologie Komponentowe. Piotr Łukasik p /

Perspektywy PDF. ==>Download: Perspektywy PDF ebook By 0

Instructions for student teams

Proposal of thesis topic for mgr in. (MSE) programme in Telecommunications and Computer Science

Instrukcja konfiguracji usługi Wirtualnej Sieci Prywatnej w systemie Mac OSX

1. Ile czasu dziennie spędzasz z rodzicami?

Machine Learning for Data Science (CS4786) Lecture 24. Differential Privacy and Re-useable Holdout

Linear Classification and Logistic Regression. Pascal Fua IC-CVLab

MS Visual Studio 2005 Team Suite - Performance Tool

Projekt: Mikro zaprogramowane na sukces!

Administracja serwerami

USB firmware changing guide. Zmiana oprogramowania za przy użyciu połączenia USB. Changelog / Lista Zmian

Adult Education and Lifelong Learning

Pielgrzymka do Ojczyzny: Przemowienia i homilie Ojca Swietego Jana Pawla II (Jan Pawel II-- pierwszy Polak na Stolicy Piotrowej) (Polish Edition)

MoA-Net: Self-supervised Motion Segmentation. Pia Bideau, Rakesh R Menon, Erik Learned-Miller

Formularz dla osób planujących ubiegać się o przyjęcie na studia undergraduate (I stopnia) w USA na rok akademicki

Wydział Fizyki, Astronomii i Informatyki Stosowanej Uniwersytet Mikołaja Kopernika w Toruniu

Anonymous Authentication Using Electronic Identity Documents

Financial support for start-uppres. Where to get money? - Equity. - Credit. - Local Labor Office - Six times the national average wage (22000 zł)

Oprogramowanie R na potrzeby Uczelnianej Bazy Wiedzy w projekcie SYNAT

Sargent Opens Sonairte Farmers' Market

Baptist Church Records

[MS-10979] Course 10979C: Microsoft Azure Fundamentals (2 dni)

Installation of EuroCert software for qualified electronic signature

DM-ML, DM-FL. Auxiliary Equipment and Accessories. Damper Drives. Dimensions. Descritpion

you see decision. oznacza to, Whenever kiedy widzisz biznes, someone once made Za każdym razem, który odnosi sukces,

Agnieszka Lasota Sketches/ Szkice mob

2 nd ClimMani EU COST Action Meeting Poznań, Poland September, 2015

METHOD 2 -DIAGNOSTIC OUTSIDE

Życie za granicą Studia

DATA-S MONITORING ROZPROSZONY OŚWIETLENIA AWARYJNEGO DIVERSIFIED MONITORING OF EMERGENCY LIGHTING

Immigration Studying. Studying - University. Stating that you want to enroll. Stating that you want to apply for a course.

WYDZIAŁ NAUK EKONOMICZNYCH

Teaching activities. improving methods of teaching. improving social students skills. respect to each other. communication through games -

Transkrypt:

archivist: Managing Data Analysis Results https://github.com/pbiecek/archivist Marcin Kosiński 1,2, Przemysław Biecek 2 1 IT Research and Development Grupa Wirtualna Polska 2 Faculty of Mathematics, Informatics and Mechanics, University of Warsaw 14 October, 2015

About Grupa Wirtualna Polska The Leader Of The Polish Internet IT Research and Development large-scale online learning personalized news article recommendation e-mail targeting text mining web user behavior identification 2 / 12 Figure : wp.pl

archivist: What does it do? Main features allows to store and archive objects in repositories (stored on a local disk or via GitHub/Dropbox) provides handy tools facilitating objects search and recovery ideally performs as cache supports the philosophy of reproducible research 3 / 12

archivist: Why is it useful? Solves reproducible research problems sometimes raw data are large or with limited access computations take a lot of time or require specialized hardware reproducibility requires specific versions of packages 4 / 12

archivist: Cache Use Case getmaxdistribution <- function ( D = rnorm, N = 10, R = 1000000) { res <- replicate (R, max (D(N))) summary ( res ) } # or get directly from GitHub library ( archivist ) loadfromgithubrepo ( md5hash =" c", user =" MarcinKosinski ", repo =" Museum ") 5 / 12

archivist: Cache Use Case Using the archivist one can prepare a repository which stores calls and results of the cache() function to avoid their re-call in the future. cacherepo <- tempdir () createemptyrepo ( cacherepo ) system. time ( cache ( cacherepo, getmaxdistribution, rnorm, 10) ) user system elapsed 5.230 0.090 5.315 system. time ( cache ( cacherepo, getmaxdistribution, rnorm, 10) ) user system elapsed 0.009 0.000 0.008 6 / 12

archivist: Retrieving an Object Use Case Fragment of a poster Using archivist to retrieve an object aread ( MarcinKosinski / ISOBM /3b ) -> mutationsplot plot ( mutationsplot ) Figure : ggplot object Figure : downloaded object 7 / 12

archivist: Object s Pedigree Use Case createemptyrepo ( "FORUM_BI ", d e f a u l t = TRUE) i n v i s i b l e ( a o p t i o n s ( " s i l e n t ", TRUE) ) d a t a ( i r i s ) i r i s %a% d p l y r : : f i l t e r ( S e p a l. Length < 16) %a% lm ( Petal. Length~ Species, data =.) %a% summary ( ) > obj a h i s t o r y ( o b j ) i r i s [ ff575c261c949d073b2895b05d1097c3 ] > d p l y r : : f i l t e r ( S e p a l. Length < 16) [ 9 f 7 0 4 5 b 7 3 2 2 c d f 3 a 9 0 7 1 3 7 7 c 6 f e 9 c 1 7 5 ] > lm ( Petal. Length ~ Species, data =. ) [ 0 a82efeb8250a47718cea9d7f64e5ae7 ] > summary ( ) [ 671 a0b89fccdf02087acb002374a0fcd ] 8 / 12

archivist: Objects Exploration Within a Repository models <- asearch (" pbiecek / graphgallery ", patterns = c(" class :lm", " coefname : Sepal. Length ")) lapply ( models, coef ) [ [ 1 ] ] ( I n t e r c e p t ) S e p a l. Length 7.101443 1. 8 5 8 4 3 3 [ [ 2 ] ] ( I n t e r c e p t ) S e p a l. Length S p e c i e s v e r s i c o l o r S p e c i e s v i r g i n i c a 1.7023422 0. 6 3 2 1 0 9 9 2. 2 1 0 1 3 7 8 3. 0 9 0 0 0 2 1 9 / 12

archivist: How does it work? library (" archivist ") 10 / 12

archivist: Plans & Prototypes Automated repository creation on github, commit, push and a return of a hook to an object. archive (iris, " MarcinKosinski ", " archivist - Museum - RforeveR _ last3 ", USER _ EMAIL, USER _ PASSWORD, app _key, app _ secret, github _ token ) -> aread _ input # function returns a hook archivist :: aread (" MarcinKosinski / archivist - Museum - RforeveR _ last3 / ff575c261c949d073b2895b05d1097c3 ") archivist :: aread ( aread _ input ) -> x digest :: digest (x) [1] " ff575c261c949d073b2895b05d1097c3 " identical (iris,x) [1] TRUE 11 / 12

archivist: Learn More http://pbiecek.github.io/archivist/ 12 / 12