Datafactory intro

November 28, 2019


What needs and requirements we help to address

Whether already appling data science and machine learning to business process automation or starting data-driven transformation, your company’s needs are in some layers of the ‘ML layers pyramid’ below or even across the entire pyramid. Our DataFactory products and Learning Tracks training programs will likely match some of the business cases that you address in DS/ML projects.

Please read about the mission of our company.


DataFactory Azure

DataFactory Azure leverages the power of Azure cloud databases and apps beyond DataFactory Enterprise whilst shifting the paradigm from ownership to usership and keeping maintenance costs low. The data storage component is built upon the Common Data Model schema to handle data from enterprise data sources like CRM, ERP, etc. Data pipelines are powered by Azure Data Factory, customizable apps by Power Apps.

DataFactory Azure is a proper match if:

  • your company is in lending, insurance, retail and wants to leverage the power of machine learning to optimize operations
  • you want to use ML apps as a service without the hassle of building complex IT infrastructure or hiring a team of data scientists
  • there are multiple data sources from business applications like ERP and CRM that need to be put to good use in predictive modeling
  • you prefer to rely on industry-standard data storage solutions with no costly vendor lock-ins and want to avoid custom-developed DWH

DataFactory Enterprise

DataFactory Enterprise is an on-premises data factory that covers the entire data life cycle from raw data processing to model deployment inside SQL Server and its SSIS, SSAS and ML Server components.

It could be the right option if:

  • you prefer to handle data in-house in a traditional way
  • you have a team of SQL Server developers and DBAs to develop and maintain
  • you want existing data infrastructure to be less costly, more streamlined, cohesive and easy to administer
  • you seek reliable instruments to build data pipelines and deploy predictive models and would like to avoid building custom-built data pipelines in Python or Scala using open-source components

DataStore Community

DataStore Community, an open-source edition of DataFactory Enterprise, is a light-weight database based on the DataStore schema and populated with data from credit bureau reports.

  • you want to store data from credit report XML files in a designated data silo
  • you need to integrate data from multiple business applications into a unified data silo for DS/ML projects, and have the resources to try and test the DataStore for that purpose
  • you have the resources to develop your own data pipelines using Microsoft’s cloud and on-premises tools such as SSIS and Azure Data Factory.

Learning Tracks

We teach the same curriculum and use the same code that we use ourselves for developing our DataFactory products.