2017 Meeting Schedule
CALL FOR SPEAKERS: If you are interested in speaking at one of our 2017 meetings, please let us know here.
Next meeting in on Oct. 19, 2017. Please register on Eventbrite.
321 South Inverness Drive
Englewood, CO 80112
We are excited to have two great presentations from local professionals in the Denver community. We have the unique opportunity to hear from individuals that have lived through very pertinent projects locally.
Title: Implementing Metadata at the DOI
The Department of Interior (DOI) is a Federal agency with over 90,000 employees across 10 Bureaus and 8 Agency Offices. Its primary mission is to protect and manage the Nation’s natural resources and cultural heritage; provide scientific and other information about those resources; and honor its trust responsibilities or special commitments to American Indians, Alaska Natives, and affiliated island communities.
Documenting data through metadata is essential in realizing the value of data as an enterprise asset. The completeness, consistency, and timeliness of metadata affect users’ ability to search for and discover the most relevant data for the intended purpose, and facilitate the interoperability and usability of these data among DOI bureaus and offices. Fully documented metadata describe and enhance data usability, quality, accuracy, provenance, and meaning.
The Department has organized a committee consisting of Bureau-level points-of-contacts to collaborate on the development of more consistent, standardized, and effective metadata management practices and guidance to support this shared mission and the information needs of the Department.
This presentation will discuss the roles and responsibilities of various processes and actions that are important to DOI’s approach for metadata implementation across these three implementation phases.
Ray Obuch has been with the USGS Energy Program since 1985 supporting Energy related data and databases that are used in researching and assessing United State and International Oil and Gas Resources. Ray is a USGS subject matter expert with regards to Oracle and layered database products such as ESRI’s ArcSDE and the Geo database. Ray also works at the USGS Bureau level in support of the Open Data Initiative since 2009 and is a USGS representative on the Department of Interiors Data Service Team which supports enterprise data management.
Title: Data Warehousing Implementation from Scratch
Building an Enterprise level data warehousing continues to be a challenges effort as many fail to achieve the intended objectives. Sakthi Rangarajan had an opportunity to conceive, model, build and sustain an Enterprise Data Warehouse from scratch for a rental fleet management company. This presentation discusses the experience, processes followed & processes missed, as well as the critical challenges faced and lessons learned.
Sakthi Rangarajan is a Data Warehouse Architect with Infosol Inc. and has been part of many successful data warehousing implementations. He is passionate about data modeling, reporting and analytics. Before joining Infosol he has worked for MillerCoors, Accenture and Newmont Mining Corporation. Sakthi has a bachelors’ degree in engineering and is currently pursuing a masters’ degree in technology management from the University of Denver.
North Office Building (NOB): Room 1194
311 W. 10th St.
Golden, CO 80401
Title: Predictive Analytics; A Statistical Primer
Predictive Analytics is complementary to familiar descriptive analytics (OLAP) providing the business with not just a postmortem of what has happened, but a reliable prediction of what will happen. Predictive Analytics use statistical methods to build powerful forecasting models whose value is dramatic.
This presentation will introduce the concept of Predictive Analytics, illustrate the value with some industry examples, and demonstrate statistical techniques such as multivariable regression and principal component analysis that are central to the science of forecasting. The presentation will point out some common pitfalls and advice on getting started with PA.
BIO: Bob Conway has over twenty years experience planning, designing and building integrated data warehouse/business intelligence solutions on various platforms and tools. His broad industry experience ranges from manufacturing, retail, financial services and telecommunications to oil and gas, automotive, healthcare and marketing. He developed the robust RAPID Architecture and Methodology and teaches workshops on this and other data management topics. He was an adjunct instructor at several graduate level institutions and a frequent speaker at local and national conferences. His presentation style is entertaining and informative.
Second Featuresd Speaker Lowell Fryman – DAMA Rocky Mountain Chapter; VP of Education (Note; this is a change from the original agenda. The original speakers will be unable to attend the 04/20/2017 event)
Title: Governing Hadoop and the Data Lake
There is a surprising variety of approaches and vendor solutions in the class of Hadoop architecture products. Yet, all of those solutions have some things in common. They are providing the enterprise with valuable data assets that we need to maximize. This seminar will discuss many alternatives for applying Data Governance processes and techniques to the data we have in the Hadoop platforms and Data Lakes.
Big Data is essential in conducting business in the digital age. Data Leaders must address the governance and processes necessary to understand and integrate data from the Hadoop clusters with your existing structured data for effective business management and analytics. Traditional data gathering, integration, and preparation can consume an estimated 80 to 90% of a data scientist’s development efforts.
This seminar will discuss methods and techniques for governance of data from Hadoop platforms to increase the speed to access and improve the quality of our analytics.
Attendees will learn:
- The alternative Hadoop architectures and data usage challenges from them
- Governance practices for identifying critical data and opportunities using this data
- Value of a catalog and data dictionary and techniques to find data
- The data preparation, integration and usage controls for Big Data
- Applying security, privacy and compliance usage techniques
9200 East Mineral Avenue
Centennial, CO 80112
Title: Data Masking in a Target Rich Environment
In this presentation, we will discuss the data masking implementation and strategy for FirstBank, the second-largest privately held bank in the US. We will discuss our approach and options available, from Dynamic vs. Static Data Masking to full Test Data Generation. We will discuss the perils and pitfalls of implementing masking, and its impact on the development and business culture. We will also go into the risk vs. reward discussions that helped guide what data we masked and the initiatives that required us to mask data in the first place. The project was long and arduous, with many stops and starts, but overall it ended up as a resounding success. Our goal is to discuss our approach and ways to focus the effort and implement it using a smaller scope, and better-defined releases, so that you could implement data masking in much less time.
Second Speakers: Rama Chilka & Shannon Prince of IHS Markit
Title: Realizing a Global View of Customer Data for B2B
Rama Chiluka, Data Architect and Shannon Prince, Director Customer & Company Data, IHS Markit, will share their experience, knowledge, and lessons learned in a real case study about managing and mastering customer data for a global B2B company. IHS Markit (then IHS) went public in 2005 and grew through aggressive acquisitions (71 since 2004) leading up to the 2016 merger of equals with Markit. As a global company with multiple business lines serving customers in major industries and governments, the customer data from disparate systems needed to be integrated to provide meaningful information for critical decision making. After several years and significant investments in a customer data integration hub and ERP and CRM systems, the company was still challenged to maintain a global view of customer activity.
This presentation will cover:
-Intricacies and complexities of mastering B2B customer data in a highly acquisitive, multiple business line, global business environment
-A strategy to bring together EPR, CRM and MDM and Reference Data using, governance, technology, people and processes
-How to use fuzzy matching algorithms in an MDM solution to build a crosswalk of customer data across disparate systems, plus other uses cases
-How to manage and master company reference data and company hierarchies to enhance customer data
-Examples of using mastered customer data to enable sales, customer satisfaction, marketing, and product management operations and strategy
-A review of technologies and methods to deliver a global view of the customer from complex company data
-Plans for the future… merging two large companies and migrating to a new MDM solution
BLM Lakewood Federal Center
6th Avenue & Kipling St
Lakewood, CO 80225
Title: Agile Data Warehousing: It Should Not Be Scary
As we move more and more towards the need for everyone to do Agile Data Warehousing, we need a data modeling method that can be agile with us. Data Vault Data Modeling is an agile data modeling technique for designing highly flexible, scalable, and adaptable data structures for enterprise data warehouse repositories. It is a hybrid approach using the best of 3NF and dimensional modeling. It is not a replacement for star schema data marts (and should not be used as such). This approach has been used in projects around the world (Europe, Australia, USA) for over 10 years but is still not widely known or understood. The purpose of this presentation is to provide attendees with an introduction to the components of the Data Vault Data Model, what they are for and how to build them. The examples will give attendees the basics:
- What the basic components of a DV model are
- How to build, and design structures incrementally, without constant refactoring
Title: Building an Agile Metadata Strategy: It’s not as Scary as You Think
According to a recent DATAVERSITY survey, over 80% of respondents agreed that metadata is as or more important than ever before. Yet with the rapidly-changing technical landscape, many historic metadata strategies can seem archaic and out of date. With the rise new technologies and approaches such as Big Data, graph databases, agile development, and data lakes, it can seem a daunting task to manage metadata in this new environment. Luckily, metadata management technologies and approaches are rapidly innovating as well, and there are a vast array of solutions to help you with your metadata journey. Join Donna Burbank, past president & advisor of the DAMA Rocky Mountain Chapter, as she discusses modern approaches to metadata management.