Installing Scientific Packages for Python3 on MacOS 10.9 Mavericks
I just went through some pain (again) when I wanted to install some of Python’s scientific libraries on my second Mac. I summarized the setup and…
I just went through some pain (again) when I wanted to install some of Python’s scientific libraries on my second Mac. I summarized the setup and…
After I wrote the initial teaser article “SQLite – Working with large data sets in Python effectively” about how awesome SQLite databases are via sqlite3 in…
In the previous article we have discussed about the Data Envelopment Analysis technique and we have seen how it can be used as an effective non-parametric ranking algorithm. In this blog post we will develop an implementation of Data Envelopment Analys…
Data Envelopment Analysis, also known as DEA, is a non-parametric method for performing frontier analysis. It uses linear programming to estimate the efficiency of multiple decision-making units and it is commonly used in production, management and eco…
This is a quickguide showing how to use OpenEye software command line tools to align target molecules to a query based on substructure matches and how to…
In this article we will discuss how you can build easily a simple Facebook Sentiment Analysis tool capable of classifying public posts (both from users and from pages) as positive, negative and neutral. We are going to use Facebook’s Graph API Search a…
In previous articles we have discussed the theoretical background of Naive Bayes Text Classifier and the importance of using Feature Selection techniques in Text Classification. In this article, we are going to put everything together and build a simpl…
Let’s be honest, code testing is everything but a joyful task. However, a good unit testing framework makes this process as smooth as possible. Eventually…
I received many questions from people who want to quickly visualize their data via heat maps – ideally as quickly as possible. This is the major issue of…
My new project confronted me with the task of screening a massive set of large data files in text format with billions of entries each. I will have to…