Geeks With Blogs
Josh Reuben OLAP / Data Mining
BigQuery QuickRef
Big Data keeps evolving. Stone Age Hadoop was alot of Java bolierplate for defining HDFS access, Mapper & Reducer. This was superceded by Bronze Age Spark, which provided a succint Scala unification of:ML pipelinesin-memory structured DataSets over RDDs via a SparkSession SQL APIDistributed Streams(Note: You can run such jobs easily in a dynamically scalable manner on Google Dataproc) Technology keeps evolving - the Big-Iron Age has arrived in the form of Google Cloud Platform's SPARK KILLER ......

Posted On Thursday, December 15, 2016 5:33 AM

Hive - HQL query over MapReduce
Overview Developed by Facebook HiveQL is a SQL-like framework for data warehousing on top of MapReduce over HDFS. converts SQL query into a series of jobs for execution on a Hadoop cluster. Organizes HDFS data into tables - attaching structure. Schema on Read Versus Schema on Write - doesn’t verify the data when it is loaded, but rather when a query is issued. full-table scans are the norm and a table update is achieved by transforming the data into a new table. HDFS does not provide in-place file ......

Posted On Tuesday, March 22, 2016 5:32 AM

Azure Futures - Distributed Computing and Number Crunching
"the biggest Azure customers today are the ones using HPC on-premises at the current time" - Orleans Framework for cloud computing - HPC on Azure - ......

Posted On Thursday, February 3, 2011 7:41 AM

SSAS DM Algorithms
· DM algorithms that come with SSAS o Decision Trees Algorithm: uses the values, or states, of the designated “input columns” to predict the states of the column that was designated as “predictable”. It identifies the attribute tree that best predicts the result. allows for interplay between attributes and provides a hierarchy of attribute definitions that can be used to take a decision. o Clustering Algorithm: grouping of the cases that contain similar characteristics. Identifies how the data forms ......

Posted On Monday, December 31, 2007 6:19 AM

SSAS BI Dev Studio OLAP Designers
The 2 main tools used in cube construction using BI Dev Studio are the Dimension Designer and the Cube Designer. To be effective at OLAP, you need to conceptualize your options - know what to do and where to do it. Dimension Designer - Use to edit various properties of an existing dimension, including the attributes, hierarchies, levels, and translations of the dimension, as well as to browse the data contained by the dimension. Contains 3 tabs: · Dimension Structure Tab - view and modify the attributes, ......

Posted On Monday, December 24, 2007 6:57 AM

Copyright © JoshReuben | Powered by: