PERSPECTIVE

Contextual Advertising for Consumer Banking

|
2017 is the year when Artificial Intelligence is one of the hottest topic in finance. Machine Learning is redefining processes in financial institutions and challenges some of the decade-old business models. Wealth management companies are using deep learning solutions for long-term value investments and advisors are being replaced by chatbots, successfully covering up to 95% of…

Hive with LLAP for Fast Querying

|
One common use case in enterprise computing is repeated use of the same tables. Business queries tend to be heavy on operations like aggregation and focused on extracting a small amount of data out of a large data set. In recent years, most Hadoop customers use a rudimentary approach of moving data from a Hadoop…

Sizing Apache Flume for Big Data Applications

|
Big data management applications for diverse businesses require the ability to collect, aggregate and move large amounts of data from many sources, to a centralised data store. The increasing volume, velocity and variety of data from log data, click streams and sensors should all be similarly handled in a data lake architecture and other data architectures. Apache Flume is a distributed, highly reliable, and highly available system for meeting these needs. In this post, we discuss the case of si

Personal Assistants: The brain behind the voice

|
“Hey Siri! Remind me to submit this article tomorrow! “ And indeed it did! Last year, Apple introduced hands free Siri as an improvement to its previous version of personal assistant that allows you to interact with your phone using voice commands, without actually touching your phone. The personal assistant is intelligently built to respond…

Visualizing Graphs in Apache Zeppelin using Neo4J

|
Graph models of data are invaluable tools to understand domains in the context of data analysis. In this post, we discuss one approach to merge the beautiful Zeppelin data analysis interface with graph models of domains, by using the D3 visualization library.

Behavioral Profiling in Banks with Cadence

|
We build practices, processes and experiences around our customers to deliver more value to them and to enhance their customer journey. But who are these customers? We might know the customers who generate the maximum profits and who are loyal towards our services but do we really “know” them, know them well enough to understand their behavior and reasons why they choose a particular service.

Effectively Sizing a NoSQL Database for Big Data Applications

|
The proliferation of unstructured data has made the collection and handling of large and varied data sets at scale a real challenge for organizations. Organizations often grapple with the challenge of setting up and sizing NoSQL databases when managing unstructured data from organizational processes, systems and from their customers. We look at HBase, which is a modern NoSQL database that provides real-time read and write access to large datasets.

Understanding Independent and Identically Distributed Data

|
Background Sensor data collected from various sources, be it from products or processes can exhibit a tendency to vary over time. This is because sensors often measure system states over time, as data are continuously collected from these products and processes. There is business value in understanding whether such data will be suitable for aggregate…

Transforming Payments through Digitisation

|
In 1998, PayPal started as a firm that executes wireless payment transactions on a palm tablet. It would soon become a pioneer in the digital payments history in the Unites States. Closer to home, PayTM’s recent TVC claims to be “the new way of life”. As per the ad, “Paytm karo” is the panacea for…

Advanced Sensing and Measurement Characterization for IoT Applications

|
Author’s Note: This is the second in a series of blog posts by the author about IoT sensor data and time series analysis. The previous post is here. The author will be speaking on these topics at Strata+Hadoop World 2016 in Singapore on December 8th 2016. For further details and to register for this event…