Revisiting text processing with R and Python
Back in 2011, I covered the relative performance difference of the most popular libraries for text processing in R and Python. In case you can’t guess the answer, Python and NLTK won by a...
View ArticlePlotting Oracle RMAN backup durations with R
How long does your Oracle RMAN backup take to complete? How does this vary over time? Are there patterns by week, week of month, or day of week? The gist below can help you evaluate questions...
View ArticlePlotting average read and write operation size by ASM disk for Oracle
Throughput, throughput, throughput – for many databases, this is the performance measure of importance. When you are working with a fixed number of IOPS but see mixed workload types, system health...
View ArticleIs the Tax Code the longest Title?
Last week, I shared that Dan Katz and I had finally published a draft of our paper, Measuring the Complexity of the Law: The U.S. Code. We’d previewed this research on Computational Legal Studies...
View Articleipython notebook for R: Quickstart for Ubuntu
If you’re like me, you love ipython notebook but often write R. RStudio’s integrated RMarkdown is nice, but for some contexts like quick demos or basic training, a browser-based interface is...
View ArticleRetrieving the VIX term structure in R
Much of my time lately has gone into analyzing and trading products in the volatility complex. As a result, I regularly watch the VIX term structure for continuations or deviations from trend. To...
View ArticleConnecting R to an Oracle database with RJDBC
In many circumstances, you might want to connect R directly to a database to store and retrieve data. If the source database is an Oracle database, you have a number of options: ROracle RODBC RJDBC...
View ArticleSummary of community detection algorithms in igraph 0.6
Based on Launchpad traffic and mailing list responses, Gabor and Tamas will soon be releasing igraph 0.6. In celebration, I’ll be publishing a number of helpful lists and tables I’ve put together to...
View ArticleWordcloud of the Arizona et al. v. United States opinion
Here’s one purely for fun – a wordcloud built from the Supreme Court’s opinion on Arizona et al. v United States. Word clouds, though certainly not the most scientific of visualization techniques, are...
View ArticleDebugging parameter mismatch across RAC database instances with R, dba_hist,...
Did you find this post useful? Does your organization need Oracle services? We can help. Much of this morning went into investigating strange ADDM reports on a two-node Oracle RAC database. For...
View Article