Skip to content
Toby Dylan Hocking edited this page Sep 4, 2025 · 385 revisions

These articles either focus on data.table (bold) or mention/use it (perhaps only briefly and you may need to search the article for "data.table"), ordered by date. If you know of an article that may be of interest to others, please add it here (**). You can also search all articles from the R blogosphere since c. 2009 on http://www.r-bloggers.com/. There is no filter applied: if the article exists and mentions data.table, positively or negatively, it is included on this page. Please watch out for benchmarks measured in milliseconds. Comparisons on such small scales often do not hold when scaled up to larger data because, for example, they over-represent call overhead and/or the dataset is so small it fits in CPU cache. A test repetition count (e.g. ntimes=) of 5 or more is often an indication that the test data size is too small. Please check that setkey() has been used and its time reported separately. Tutorials, slides and videos are over on the Videos & Slides page.

(**) all pages on this wiki have no write restrictions. You are encouraged to change content in this wiki yourself as you see fit. Changes will go live immediately with no oversight by any project member. If you spot any abuse, please check the edit history to see who made the edit and please inform us.

LinkTitleAuthor
2025.09Manipuler des données avec data.tablePierre-Yves Berrard, Lino Galiana et Olivier Meslin
2025.05Syntax conversion: data.table vs. base vs. dplyrVincent Arel-Bundock
2024.11Data wrangling with data.tableStata2R: Kyle Butts, Nick Huntington-Klein, and Grant McDermott
2024.11Julia DataFrames.jl comparison with data.tableauthors of DataFrames.jl docs
2024.11data.table.threadsAnirban Chetia
2024.10Comparing data.table reshape to duckdb and polarsToby Dylan Hocking
2024.10Benchmarking rolling window functions in RMikkel Roald-Arbøl
2024.09Mutation testing for data.tableAnirban Chetia
2024.08Collapse reshape benchmarkToby Dylan Hocking
2024.07Benchmarking a change in data.tableToby Dylan Hocking
2024.06data.table for the Google Summer of Code 2024 (Joshua Wu)Joshua Wu
2024.02Column assignment and reference semantics in data.tableToby Dylan Hocking
2024.02NSF project activitiesAnirban Chetia
2024.02new programming with data.tableJohn MacKintosh
2024.02more .I in data.tableJohn MacKintosh
2024.01.I in data.tableJohn MacKintosh
2024.01Reshape performance comparisonToby Dylan Hocking
2023.12Comparing data table to frame for row subsetToby Dylan Hocking
2023.12non-equi joins in data.tableJohn MacKintosh
2023.11Some pedagogical elements of computer programming for data science: A comparison of three approaches to teaching the R languageDavid Shilane, Nicole Di Crecchio, Nicole L. Lorenzetti
2023.11data.table CRAN diffs: Verifying consistency between CRAN and githubToby Dylan Hocking
2023.10data.table asymptotic timingsToby Dylan Hocking
2023.03A Coding Translation to Increase the Efficiency of Programmatic Data AnalysesDavid Shilane
2023.02Pivoting data in R with tidyr and data.tableJohn MacKintosh
2022.11dplyr 1.1.0 is coming soonDavis Vaughan
2022.11Handling larger than memory data with{arrow} and{duckdb}David Lucey
2022.11R Package Release History: Extracting and plotting data from CRAN web siteToby Dylan Hocking
2022.10Efficiency comparison of dplyr and tidyr functions vs base RManuel Teodoro Tenango
2022.08modifying columns in datatable with lapplyJohn MacKintosh
2022.08Simulating data from a non-linear function by specifying a handful of pointsKeith Goldfeld
2022.06Timing data.table OperationsThomas Shafer
2022.06Shuffling Columns With data.tableThomas Shafer
2022.06A quirk when using data.table?Kenneth Tay
2022.05Comparing performances of CSV to RDS, Parquet, and Feather file formats in RTomaž Kaštrun
2022.04Loading a large, messy csv using data.table fread with cli toolsDavid Lucey
2022.04Greatly revised edition of tidyverse skeptic
Original 2019.07 below: Ctrl-F "matloff"
Norm Matloff
2022.03Shiny: Fast Data Loading with fstPhilipp Probst
2021.12Optimising dplyrTom Jemmett
2021.11Should I Move to a Database?Roel M. Hogervorst
2021.10Most Starred and Forked GitHub Repos for Data Science and RKenneth Leung
2021.10fwf without the faffJohn MacKintosh
2021.10Simulating the Squid Game bridge scene in RJohn Paul Helveston
2021.09Calculating hotel occupancy with RJohn MacKintosh
2021.08Exploring Stock Market Listing Mortality since 1986David Lucey
2021.08Introducing the fastverse: An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data ManipulationSebastian Krantz
2021.08Well Well Well My ExcelJohn MacKintosh
2021.08Cutting down code in dplyr and data.tableJohn MacKintosh
2021.08Code performance in R: Working with large datasetsMira Céline Klein
2021.07Time Travel with py datatable 1.0Gregory Kanevsky
2021.06DTPlyr – easier data.table for DPLYR usersGary Hutson
2021.06Stress testing reshape operations on list columnsToby Dylan Hocking
2021.06Wide-to-tall Data Reshaping Using Regular Expressions and the nc PackageToby Dylan Hocking
2021.05Update about data reshaping and visualization in R and pythonToby Dylan Hocking
2021.05Hamburg RUG: A professional trading research system in RDaniel Brandt
2021.05The new R pipeElio Campitelli
2021.0410 Tips And Tricks For Data Scientists Vol.6George Pipis
2021.04Not data.table vs dplyr... data.table + dplyr!Matt Dancho
2021.03Some data.table tipsJohn MacKintosh
2021.03Data.Table – everything you need to know to get you started in RGary Hutson
2021.02I wrote one of the fastest DataFrame libraries (hacker news)Ritchie Vink
2021.02Joins vs case whens - speed and memory tradeoffsThomas Mock
2021.02The unequalled joy of non-equi joinsDavid Selby
2021.02Measuring and Monitoring Arrow's Performance: Some Updated R Benchmarks (response)Jonathan Keane & Neal Richardson
2021.02Bigger Data With Ease Using Apache Arrow, (response)(rebuttal)Neal Richardson
2021.01Fast and Easy Aggregation of Multi-Type and Survey Data in RSebastian Krantz
2021.01How to create a stock screenerMartin Bel
2020.12You only need library(data.table) / 你只需要library(data.table) (in Chinese)Xianying Tan (@shrektan)
2020.11Comparing Common Operations in dplyr and data.tableMartin Chan
2020.11non-equi merge in data.table and epidemiologyDenis Mongin
2020.10The ultimate R data.table cheat sheetSharon Machlis
2020.10What is R data.table and Why is R data.table? (In Korean, 한국어)HongDon Lee
2020.10Solving small problems with data.tableJohn MacKintosh
2020.10Python and R – Part 1: Exploring Data with DatatableDavid Lucey
2020.10Decomposition and Smoothing with data.table, reticulate, and spatstatTony ElHabr
2020.09The Fastest Way To Read And Write Files In RGeorge Pipis
2020.09The treedata.table PackageApril Wright, Cristian Román-Palacios, Josef Uyeda
2020.09Gotta go fast with "{tidytable}"Bruno Rodrigues
2020.09Task 2 - Retail Strategy and AnalyticsShrishti Vaish
2020.08Solving small data problems with data.tableJohn MacKintosh
2020.08Replicating .SD in Python DatatableSamuel Oranyeli
2020.08Let's Learn data.table (日本語)Uryu Shinya
2020.0887th TokyoR Meetup Roundup:{data.table}, Bioconductor, & more!Ryo Nakagawara
2020.075 handy options in R data.table’s freadSharon Machlis
2020.07Even more reshape benchmarksGrant McDermott
2020.07RvsPython #2: Pivoting Data From Long to Wide FormBenjamin Smith
2020.06A gentle introduction to data.table@atrebas
2020.06Reshape benchmarksGrant McDermott
2020.06Selecting and Grouping Data with Python DatatableSamuel Oranyeli
2020.05dtplyr speed benchmarksIyar Lin
2020.05Creating a data.table from C++David Zimmermann, Leonardo Silvestri, Dirk Eddelbuettel
2020.04Data manipulation libraries: Translating between data.table, pandas, dplyrToby Dylan Hocking
2020.04patientcounterJohn MacKintosh
2020.04Fastest data operations with least memory in tidy syntaxTian-Yuan Huang
2020.04W is for Write and Read Data – FastSara Locatelli
2020.03Use data.table the tidy way: An ultimate tutorial of tidyfstTian-Yuan Huang
2020.03R data.table symbols and operators you should knowSharon Machlis
2020.03Variable name in functions, it's easy with datatableLino Galiana
2020.02stringsAsFactorsKurt Hornik
2020.01Programming with data.tableJohn MacKintosh
2020.01Blazing Fast Data Wrangling With R data.tableThu Vu
2020.01New Timings for a Grouped In-Place Aggregation TaskJohn Mount
2020.01Base R, the tidyverse, and data.table: a comparison of R dialects to wrangle your dataJason Mercer
2019.124 great free tools that can make your R work more efficient, reproducible and robustJozef Hajnala
2019.12Why I don’t use the TidyverseHolger K. von Jouanne-Diedrich
2019.11dtplyr 1.0.0Hadley Wickham
2019.10Using ggplot2 Inside data.tableJohn Lashlee
2019.10Fast and Readable 'If Else' in RTysson Barrett
2019.10Data Joins: Speed and Efficiency of dplyr and data.tableTysson Barrett
2019.10Comparing Efficiency and Speed of data.table: Adding variables, filtering rows, and summarizing by groupTysson Barrett
2019.10Columnar File Performance Check-in for Python and R: Parquet, Feather, and FSTWes McKinney
2019.09Selecting the max value from each group, a case study: data.tableNathan Eastwood
2019.09Sentiment analysis at the Fringe, part 1Megan Stodel
2019.09{disk.frame} is epicBruno Rodrigues
2019.08A shallow benchmark of R data frame export/import methodsJulien Barnier
2019.08The R FactorOwen Jones
2019.08Hydra Chronicles, Part V: Loose EndsBrodie Gaslam
2019.08Everyone’s Favorite Blogpost: CSV BenchmarksJacob Quinn
2019.08No visible binding for global variableNathan Eastwood
2019.08Why Machine Learning is more Practical than Econometrics in the Real WorldAdrian Antico
2019.08What’s next for the popular programming language R?Dan Kopf
2019.08Wrangling 4.6M Rows with dtplyr (the NEW data.table backend for dplyr)Matt Dancho
2019.08mlr3-0.1.0Patrick Schratz
2019.07Hydra Chronicles, Part IV: Reformulation of StatisticsBrodie Gaslam
2019.07Multiple Columns to Multiple Colums at OnceRecle Etino Vibal
2019.07Long to Wide and Wide to Long Format ConversionGiovanni Pavolini
2019.07fread-benchmarks-rsuiteAlfonso R. Reyes
2019.07Bayesian Power Analysis with data.table, tidyverse, and brmsTyson Barrett
2019.07Making .SD your best friendJosé Morales
2019.07data.table's cube functionGiovanni Pavolini
2019.07How to use .SD in the data.table packageSharon Machlis
2019.07Why I Chose to Learn data.table (and such related things)Tyson Barrett
2019.07What R’s most popular tools say about the state of data scienceDan Kopf
2019.07data.table and Text Analysis: Analyzing the Four GospelsTyson Barrett
2019.07Analyzing data with data.tableGiovanni Pavolini
2019.07Why I love data.tableElio Campitelli
2019.07Why I like the TidyverseChris Muir
2019.07An opinionated view of the Tidyverse "dialect" of the R language, and its promotion by RStudio
Circa this revision on GitHub was in effect at the time and widely shared; e.g. HackerNews. Revision announced 2022.04.
Norm Matloff
2019.06Learning Japanese with data.table and ggplot2Atrebas
2019.06data.table by a dummyJohn MacKintosh
2019.06My Favorite data.table FeatureJohn Mount
2019.06Coke vs. Pepsi? data.table vs. tidy? Part 2)Beth Milhollin, Russell Zaretzki, and Audris Mockus
2019.06The Psychology of Flame WarsEdwin Thoen
2019.06data.table is Much Better Than You Have Been ToldJohn Mount
2019.06data.table is expressive and powerfulMichael Frasco
2019.06How data.table's fread can save you a lot of time and memory, and take input from shell commandsJozef Hajnala
2019.06Hydra Chronicles, part III: Catastrophic ImprecisionBrodie Gaslam
2019.06Hydra Chronicles, part II: beating data.table at its own gameBrodie Gaslam
2019.06An Overview of Python's Datatable packageParul Pandey
2019.06For and Against data.tableAaron Jacobs
2019.05Three reasons why I use data.tableMegan Stodel
2019.05Timing Working With a Row or a Column from a data.frameJohn Mount
2019.05Using Data Cubes with RKristian Larsen
2019.05cranlogs 2.1.1 is on CRAN!R-hub blog
2019.05R package installation on windows considered harmfulToby Dylan Hocking
2019.05Hydra Chronicles, part I: Pixie DustBrodie Gaslam
2019.04Using data.table with magrittr pipes: best of both worldsMartin Chan
2019.04What are the Popular R Packages?John Mount
2019.04Coke vs. Pepsi? data.table vs. tidy? Examining Consumption Preferences for Data ScientistsAudris Mockus
2019.03A data.table and dplyr tourAtrebas
2019.03Dependencies. Now with badges!Dirk Eddelbuettel
2019.03Unit Tests in RJohn Mount
2019.03Creating blazing fast pivot tables from R with data.table - now with subtotals using grouping setsJozef Hajnala
2019.02A strategy for faster group statisticsBrodie Gaslam
2019.02Verbose data.table and uncovering hidden cedta's data table awareness decisionsJozef Hajnala
2018.12Timing Grouped Mean Calculation in RJohn Mount
2018.12How to sort data by one or more columns with base R, dplyr and data.tableJozef Hajnala
2018.12Smartly select and mutate data frame columns, using dictRoman Pahl
2018.11Statistics Sunday: Reading and Creating a Data Frame with Multiple Text FilesSara Locatelli
2018.11Wrangling and Manipulation of Monthly Philippine Consumer Price IndexRecle Vibal
2018.10Now "fread" from data.table can read "gz" and "bz2" files directlyPradeep Mavuluri
2018.10How to perform merges (joins) on two or more data frames with base R, tidyverse and data.tableJozef Hajnala
2018.10How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?Jozef Hajnala
2018.10Some R Guides: tidyverse and data.table VersionsJohn Mount
2018.10Running the Same Task in Python and RJohn Mount
2018.10Limiting dependencies in R package developmentScott Chamberlain
2018.09R Tip: Give data.table a tryJohn Mount
2018.08Timings of a Grouped Rank Filter TaskJohn Mount
2018.08R Tip: Consider Radix SortJohn Mount
2018.08Meta-packages, nails in CRAN’s coffinJohn Mount
2018.07EARL London interviews – Patrik Punco, NOZ MedienMango Solutions
2018.07Speed up your R WorkJohn Mount
2018.06Python for data analysis… is it really simple?!?Ferenc Bodon
2018.06R and Data – When Should we Use Relational Databases?Claude Seidman
2018.06Re-referencing factor levels to estimate standard errors when there is interaction turns out to be a really simple solutionKeith Goldfeld
2018.06Most Starred R Packages on GitHubSteven Mortimer
2018.06Melt and Cast The Shape of Your Data-Frame: Exercisessindri
2018.06Sharpening The Knives in The data.table Toolbox: [Exercises] [Solutions]sindri
2018.06rqdatatable: rquery Powered by data.tableJohn Mount
2018.04An R vlookup? Not so silly ideaHanjo Oden
2018.04Benchmarking the six most used manipulations for data.tables in ROpremic
2018.04Down the AUC Rabbit Hole and into Open Source: Part 2Michael Frasco
2018.04Down the AUC Rabbit Hole and into Open Source: Part 1Michael Frasco
2018.04Quick R TutorialFrank Erickson
2018.03pandas vs. data.table – A study of data-frames – Part 2Tobias Krabel
2018.02Retail analytics: from hours to seconds using RBharani Subramaniam
2018.02pandas vs. data.table – A study of data-framesChristian Moreau
2018.02Julia vs R vs Python: string-sort performance + an unfinished journey to optimizing Julia's performanceZJ
2018.02dplyr, (mc)lapply, for-loop and speedMike Spencer
2018.02Speeding up spatial analyses by integrating sf and data.table: a test caseLorenzo Busetto
2018.02Packages for Getting Started with Time Series Analysis in RAbraham Mathew
2018.02DataExplorer: Fast Data Exploration With Minimum CodeBoxuan Cui
2018.01Supercharge your R code with wraprJohn Mount
2018.01Tidyverse and data.table, sitting side by side… and then base R walks inIñaki Úcar
2018.01Tidyverse and data.table, sitting side by side (Part 1)Dirk Eddelbuettel
2018.01Base R can be FastJohn Mount
2018.01Lightning fast serialization of datasets using the fst packageMark Klik
2018.01rquery: Fast Data Manipulation in RJohn Mount
2017.12A tour of the data.table package by creator Matt DowleDavid Smith
2017.12More Pipes in RJohn Mount
2017.12Team Rtus wins Munich Re Datathon with mlrJann Goschenhofer
2017.12Correlated log-normal chain-ladder modelMarkus Gesmann
2017.11How we built a Shiny App for 700 usersOlga Mierzwa-Sulima
2017.11An empirical study of group-by strategies in JuliaZJ
2017.11Using data.table and Rcpp to scale geo-spatial analysis with sfTim Appelhans
2017.11Creating integer64 and nanotime vectors in C++Dirk Eddelbuettel
2017.10The Impressive Growth of RDavid Robinson
2017.10Data.Table by Example – Part 3atmathew
2017.09Speed of data manipulations in Julia vs RZJ
2017.09Data.Table by Example – Part 2atmathew
2017.09Data.Table by Example – Part 1atmathew
2017.09Beyond the basics of data.table: Smooth data explorationSindri
2017.09Strategies to Speed-up R CodeSelva Prabhakaran
2017.08Is the Hadleyverse the only option?Billy Fung
2017.08Basics of data.table: Smooth data explorationSindri
2017.08Polygenic Risks Scores with data.table in RSahir Rai Bhatnagar
2017.08July(ish) UpdateJohn MacKintosh
2017.08R for System AdminstrationDirk Eddelbuettel
2017.07Compare data.table pipes and magrittr pipesGuanglai Li
2017.06data.table tutorial (with 50 examples)Deepanshu Bhalla
2017.06The data.table R Package Cheat SheetKarlijn Willems
2017.06Data Manipulation with data.table (part 2)Biswarup Ghosh
2017.06R in pRoduction: theRe be dRagons!Tim Sweetser and Kyle Schmaus
2017.06Improving Zillow’s Zestimate with 36 Lines of CodeEduardo Ariño de la Rubia
2017.06Data Manipulation with data.table (part 1)Biswarup Ghosh
2017.05plotly 4.7.0 now on CRANCarson Sievert
2017.05R⁶ — Idiomatic (for the People)Bob Rudis
2017.05Reading/writing biggish data, revisitedKarl Broman
2017.05dplyr in contextJohn Mount
2017.05Everyone knows that loops in R are to be avoided but vectorization is not always possibleKeith Goldfeld
2017.04R code to accompany Real-World Machine Learning (Chapter 6): Exploring NYC Taxi DataPaul Adamson
2017.04Fast data loading from files to ROlga Mierzwa-Sulima
2017.03Data Manipulation with Python Pandas and R Data.TableFisseha Berhane
2017.03Fast data lookups in R: dplyr vs data.tableMarek Rogala
2017.02Fitting logistic regression on 100gb dataset on a laptopDmitriy Selivanov
2017.02Large data, feature hashing and online learningDmitriy Selivanov
2017.02Moving largish data from R to H2O - spam detection with Enron emailsPeter Ellis
2017.01Discover your data (XGBoost vignette)Tianqi Chen, Tong He, Michaël Benesty, Yuan Tang
2017.01fst: Fast serialization of R data framesDavid Smith
2017.01fst: Lightning Fast Serialization of Data FramesMark Klik
2017.01R to the RescueJohn Mackintosh
2016.12Using R to prevent food poisoning in ChicagoDavid Smith
2016.12Behind the scenes of CRANMatt Dowle
2016.12nanotime 0.0.1: New package for Nanosecond Resolution Time for RDirk Eddelbuettel
2016.12Does replyr::let work with data.table?John Mount
2016.12data.table: Where Have You Been All My Life?JoAnn Rudd Alvarez
2016.12Organize your data manipulation in terms of “grouped ordered apply”John Mount
2016.12Comparing a MySQL Query with a Data Table in RDouglas Rice
2016.11data.table: squeeze the maximum speed when using data in RStanislav Chistyakov
2016.10Data Wrangling: Quick Guide for dplyr, data.table and R build-in data.frameVincent Cao
2016.09This Machine Learning Project on Imbalanced Data Can Add Value to Your ResumeManish Saraswat
2016.09Rolling a joinWill Rogers
2016.07Winning approach of the Facebook V Kaggle competitionTom Van de Wiele
2016.07New release of partools packageNorm Matloff
2016.07Bad Coder, Bad Coder!Norm Matloff
2016.06Intro to the data.table packageSteve Pittard
2016.06Boost Your Data Munging with RJan Gorecki
2016.06Improving Season on SeasonJames P. Curley
2016.06Understanding data.table Rolling JoinsRobert Norberg
2016.05From a (set.)seed grows a mighty datasetJonathan Carroll
2016.05Feather: fast, interoperable data import/export for RDavid Smith
2016.05Best packages for data manipulation in RFisseha Berhane
2016.05My Two favorite Packages for Data Manipulation in RFisseha Berhane
2016.05Use H2O and data.table to build models on large data sets in RManish Saraswat
2016.05The R Data I/O ShootoutEduardo Ariño de la Rubia
2016.05Red herring bitesMatt Dowle
2016.05data.table() vs data.frame() – Learn to work on large data sets in RManish Saraswat
2016.04Feather: it's about metadataWes McKinney
2016.04Fast csv writing for RMatt Dowle
2016.04I'll Keep Using RMichael Ekstrand
2016.04data.table objects should not be considered data.frame instances in R [retracted]John Mount
2016.04Learning R in Seven Simple StepsMartijn Theuwissen
2016.04Collapsing lists of data.frames with data.tableSteph Locke
2016.04Working with databases in RFisseha Berhane
2016.03Data table exercises: keys and subsettingHan de Vries
2016.03Performing SQL selects on R data framesFisseha Berhane
2016.02Read from hdfs with R. Brief overview of SparkRDmitriy Selivanov
2016.02Up to code? An algorithm is helping Chicago health officials predict restaurant safety violations (featured on TV at 06:40). [Tweet] [Code]PBS NewsHour
2016.01Strategies to Speedup R CodeSelva Prabhakaran
2015.12Our R package roundup 2015Christoph Safferling
2015.12Who’s downloading the forecast package?Rob J Hyndman
2015.12Solve common R problems efficiently with data.tableJan Gorecki
2015.11Efficient aggregation (and more) using data.tableDavid Kun
2015.11Scaling data.table with indexJan Gorecki
2015.11H2O World 2015 – Day 2 HighlightsAnmol Rajpurohit, KDnuggets
2015.11H2O World 2015Joseph Rickert
2015.11H2O.ai raises $20m series B to capitalize on rapid open source machine-learning growthMatt Aslett, 451 Research
2015.10R and Impala: it's better to KISS than using JavaGergely Daroczi
2015.10R: data.table – Finding the maximum rowMark Needham
2015.09Querying a 20 million line CSV file – data.table vs data frameMark Needham
2015.09Data ergonomics with data.table, iHub Nairobi, with supporting materialsHenk Harmsen
2015.09R Stories from the Trenches [Video] [Slides]Szilard Pafka
2015.09Advanced Tips and Tricks with data.tableAndrew Brooks
2015.08data.table cookbookSteph Locke
2015.07Overlap joins in R: a speed comparison with packages sqldf and data.tableZev Ross
2015.06Data Warehousing with RJan Gorecki
2015.06Auditing data transformationJan Gorecki
2015.06Back from R/Finance in ChicagoMarkus Gesmann
2015.05Fast data munging in RAlexander Konduforov
2015.05No THIS Is How You dplyr and data.table!Jeffrey Horner
2015.05Comparing data frames, data.table and dplyr with random walksDavid Smith
2015.05Working with "large" datasets, with dplyr and data.tableArthur Charpentier
2015.04Comparing the execution time between foverlaps and findOverlaps [data.table vs GenomicRanges]Katarzyna Wręczycka
2015.04Open Source Business Intelligence: Then and NowSteve Miller
2015.04Mapping Flows in R with data.table and latticeOscar Perpiñán Lamigueiro
2015.03Need for Processing Speed: data.tableOpenAnalytics
2015.03Getting Data From An Online SourceRobert Norberg
2015.02A data.table R tutorial by DataCamp: intro to DT[i, j, by]DataCamp
2015.02Minimal example for joining data.tablesMarkus Gesmann
2015.01Using the microbenchmark package to compare the execution time of R expressionsStephen Turner
2015.01Sessionizing Log Data Using data.tableRandy Zwitch
2015.01R in Business IntelligenceJan Gorecki
2014.12dplyr and a very basic benchmarkSzilard Pafka
2014.12JOINing data in R using data.tableRonald Stalder
2014.12Cheat Sheets for Data ScienceSteve Miller
2014.11Partying R Style with Sqor Sports, R on Azure, and data.tableJoseph Rickert
2014.11The data.table Cheat SheetDataCamp
2014.11Some R Highlights from H20 WorldJoseph Rickert
2014.10Complete data.table tutorial: data analysis the data.table wayDataCamp
2014.10data.table UniversitySteve Miller
2014.10Visualising the seasonality of Atlantic windstormsMarkus Gesmann
2014.08Scaling up data framesBen Lorica
2014.08data.table for RGrant Rettke
2014.08MongoDB – State of the RRaffael Vogler
2014.08VIDEO Matt Dowle's data.table talk from useR! 2014Eduardo Ariño de la Rubia
2014.08Pro Grammar and Devel HoperRomain Francois
2014.08Faster CSV Import with RPhill Clarke
2014.0710 R Packages to Win Kaggle CompetitionsXavier Conort
2014.07R – Data.Table Rolling JoinsBen Gorman
2014.07Dependencies of popular R packagesAndrie de Vries
2014.072014 useR! conference, days 1-2Karl Broman
2014.06The joy of joining data.tablesMarkus Gesmann
2014.06Concatenating a list of data framesAndrew
2014.05R/Finance 2014Steve Miller
2014.05Working with large data sets in R - data.table and dcastKamil Bartocha
2014.05Reading large data tables in RFabio Marroni
2014.04Exploring US healthcare dataVik Paruchuri
2014.04data.table vs dplyr in split apply combine style analysisBrodie G
2014.02Dueling R and Python FollowupSteve Miller
2014.02Efficiency of Importing Large CSV Files in Rstatcompute
2014.01Benchmark on baseball data: dplyr (0.1) and data.table (1.8.10) [tweet]Arun Srinivasan and Matt Dowle
2014.01R: the good partsJose Quesada
2014.01Two of my favorite data.table featuresBrandon Le Beau
2014.01When I use plyr/dplyr/data.tableEducate-R
2013.12Review: Kölner R Meeting 13 December 2013Markus Gesmann
2013.09A speed comparison of plyr, data.table and dplyrJake Russ
2013.08An R function like “order” from StataAnanda Mahto
2013.07Fig Data: 11 Tips on How to Handle Big Data in R (and 1 Bad Pun)Ulrich Atz
2013.07A Bottom-up Start on Big Data AnalyticsSteve Miller
2013.06Simulating Map-Reduce in R for Big Data Analysis Using Flights DataJitender Aswani
2013.06Improve The Efficiency in Joining Data with Indexstatcompute
2013.04FasteR! HigheR! StrongeR! – A Guide to Speeding Up R Code for Busy PeopleNoam Ross
2013.04Using data.table for binningOscar Perpiñán Lamigueiro
2013.03RMark: data.table merge vs core mergeXachriel
2013.02data.table or data.frame?DataParadigms
2013.01Another Benchmark for Joining Two Data Framesstatcompute
2013.01Efficiecy of Extracting Rows from A Data Frame in Rstatcompute
2013.01Efficiency in Joining Two Data Framesstatcompute
2012.12Surprising Performance of data.table in Data AggregationWensui Liu
2012.11Data.table rocks! Data manipulation the fast way in RMarkus Gesmann
2012.10Generate a panel data.table or data.frame to fill with dataThiemo Fetzer
2012.06Transforming subsets of data in R with by, ddply and data.tableMarkus Gesmann
2012.06Access data quickly and easily: data.table packageAnna Longari
2012.05data.table 1.8.1 - Now allows numeric columns and big-number (via bit64) in keys!Branson Owen
2012.03R code for Chapter 2 of Non-Life Insurance Pricing with GLMAllan Engelhardt
2012.02Elegant & fast data manipulation with data.tableCarl Boettiger
2012.01Say it in R with "by", "apply" and friendsMarkus Gesmann
2011.08Comparison of ave, ddply and data.tablePaul Hiemstra
2011.04Data Aggregation in R: plyr, sqldf and data.tableHayward Godwin
2011.03Applying functions on groups: sqldf, plyr, doBy, aggregate or data.table ?altuna
2011.03Fast(ish) extraction of exon locations from a BED12 file using data.tablealtuna
2011.03data.table: an R package everyone should useJason
2011.02By-Group Processing, the R data.table and the Power of Open SourceSteve Miller

Clone this wiki locally