IBM MDM Interview Questions and Answers

IBM MDM Interview Questions and Answers

Last updated on 16th Oct 2020, Blog, Interview Question

About author

Prakash (Sr Manager - Director Level )

He is Possessing 7+ Years Of Experience in IBM. His Passion lies in Developing Entrepreneurs & Activities. Also, Rendered his intelligence to the Enthusiastic JOB Seekers.

(5.0) | 14512 Ratings 4498

In today’s scenario, INFORMATICA has achieved the tag of a most demanding product across the globe. Its products were newly introduced but they became popular within a short time period.

Over the years, INFORMATICA has been acting as a leader in the technology field, Data Integration. Based on ETL (Extract, Transfer, Load) architecture, this data integration tool has several products that focus on providing services for government organizations, financial & insurance companies, healthcare, and several other businesses.

1. What is MDM?

Ans:

Master data management (MDM) is a comprehensive method of enabling an enterprise to link all of its critical data to one file, called a master file, that provides a common point of reference. When properly done, MDM streamlines data sharing among personnel and departments.

2. Define Dimensional Modeling?

Ans:

There are two types of table involved in Dimensional Modeling and this model concept is different from the third normal form. Dimensional data model concept makes use of facts table containing the measurements of the business and dimension table containing the measurement context.

3. What are the data movement modes in Informatica?

Ans:

A data movement mode determines how the power center server handles the character data. We choose the data movement in the Informatica server configuration settings. Two types of data movement modes available in Informatica.

  • ASCII mode
  • Unicode mode

4. Are you seeing awareness translate into bigger budgets for MDM?

Ans:

It’s a matter of awareness and the problem becoming urgent. We are seeing budgets increased and greater success in closing deals, particularly in the Pharmaceutical and Financial services industries. Forrester predicts MDM will be $6 billion markets by 2010, which is a 60 percent growth rate over the $1 billion MDM market last year. Gartner forecasted that 70 percent of Global 2000 companies will have an MDM solution by the year 2010. These are pretty big numbers. 

5. What are the different ways to migrate from one environment to another in Informatica?

Ans:

  • We can export repository and import into the new environment
  • We can use Informatica deployment groups
  • We can Copy folders/objects
  • We can Export each mapping to XML and import in a new environment

6. What are the fundamental stages of Data Warehousing?

Ans:

Offline Operational Databases – Data warehouses in this initial stage are developed by simply copying the database of an operational system to an off-line server where the processing load of reporting does not impact on the operational system’s performance.

Offline Data Warehouse – Data warehouses in this stage of evolution are updated on a regular time cycle (usually daily, weekly or monthly) from the operational systems and the data is stored in an integrated reporting-oriented data structure.

Real-Time Data Warehouse – Data warehouses at this stage are updated on a transaction or event basis, every time an operational system performs a transaction (e.g. an order or a delivery or a booking, etc.)

Integrated Data Warehouse – Data warehouses at this stage are used to generate activity or transactions that are passed back into the operational systems for use in the daily activity of the organization.

 

7. Explain Transformation?

Ans:

It is a repository object that helps in generating, modifying or passing data. In a mapping, transformations make a representation of the operations integrated with service performs on the data. All the data goes by transformation ports that are only linked with maple or mapping.

8. Describe the foreign key columns in fact table and dimension table?

Ans:

Foreign keys of dimension tables are the primary keys of entity tables.

Foreign keys of facts tables are the primary keys of dimension tables.

9. What is a mapplet?

Ans:

A Mapplet is a reusable object that contains a set of transformations and enables to reuse that transformation logic in multiple mappings.

10. What are the Different methods of loading Dimension tables?

Ans:

There are two different ways to load data in dimension tables.

Conventional (Slow) – All the constraints and keys are validated against the data before, it is loaded; this way data integrity is maintained.

Direct (Fast) – All the constraints and keys are disabled before the data is loaded. Once data is loaded, it is validated against all the constraints and keys. If data is found invalid or dirty it is not included in the index and all future processes are skipped on this data. 

11. Define Informatica Power Center?

Ans:

Designed by Informatica Corporation, it is data integration software providing an environment that lets data loading into a centralized location like a data warehouse. From here, data can be easily extracted from an array of sources, also can be transformed as per the business logic and then can be easily loaded into files as well as relation targets.

12. What the term MDM means?

Ans:

MDM stands for Master Data Management. It is a comprehensive method used to enable an enterprise for linking all of its critical data to a single file also known as a master file, providing a common point of reference. When done in a proper manner, MDM helps in streamlining the process of data sharing among departments and personnel.

13. Describe the parallel degree of data loading properties in MDM?

Ans:

This specifies the parallelism’s degree that is set upon the base object table as well as its related tables. Although it doesn’t occur for all batch processes, it can have a positive consequence on performance once it’s used. Nevertheless, its use is restricted by the number of CPUs on the database server machine along with the amount of available memory. 1 is the default value.

Subscribe For Free Demo

Error: Contact form not found.

14. Has MDM gone mainstream? Do people “get it?

Ans:

There is a huge awareness of MDM. Gartner recently hosted an MDM conference for the first time [piggy-backing on its CRM conference], and they pulled in about 500 attendees.

As to whether they “get it,” it depends on who you’re talking to. Most of the IT people get it. Business users understand the moniker, but they might or might not understand MDM quite as well. I find that business users often require education in terms of what it can do for them and what value it brings. With IT people, it’s a different conversation; they want to know more about the features and how we differentiate ourselves from the competition.

15. What are the biggest technical and management challenges in adopting MDM?

Ans:

Technical folks often have a challenge in data governance in selling the project and getting the funding. Management is looking for a return on investment; they need MDM tied to quantifiable benefits that business leaders understand, like dollar amounts around ROI.

16. Describe various fundamental stages of Data Warehousing?

Ans:

There are various fundamental stages of Data warehousing. They are:

1. Offline Operational Databases: This is the first stage in which data warehouses are developed simply by copying operational system database to an offline server where the dealing out a load of reporting not put any impact on the performance of the operational system.

2. Offline Data Warehouse: In this stage of development, data warehouses are updated on a regular basis from the operational systems. Plus, all the data is stored in an incorporated reporting-oriented data structure.

3. Real-Time Data Warehouse: During this stage, data warehouses are updated on an event or transaction basis. Also, an operational system executes a transaction every time.

4. Integrated Data Warehouse: This is the last stage where data warehouses are used for generating transactions or activity passing back into the operating system for the purpose of use in an organization’s daily activity.

17. What is Dimensional Modeling?

Ans:

Dimensional data model concept involves two types of tables and it is different from the third normal form. This concept uses Facts table which contains the measurements of the business and Dimension table which contains the context (dimension of calculation) of the measurements.

18. What is Transformation?

Ans:

A transformation is a repository object that generates, modifies or passes data. Transformations in a mapping represent the operations the Integration Service performs on the data. Data passes through transformation ports that are linked in a mapping or mapplet.

19. Name various components of Informatica Power Center. ?

Ans:

There are various components of Informatica PowerCenter. They are as follows:

  • PowerCenter Repository
  • PowerCenter Domain
  • PowerCenter Client
  • Administration Console
  • Integration Service
  • Repository Service
  • Data Analyser
  • Web Services Hub
  • PowerCenter Repository Reports
  • Metadata Manager

20. What is Data Mining?

Ans:

Data Mining is the process of analyzing data from different perspectives and summarizing it into useful information.

21. What is a different type of repositories that can be created using the Informatica Repository Manager?

Ans:

Standalone Repository:

A repository which functions individually and is unrelated to any other repositories.

Global Repository :

This is a centralized repository in a domain. This repository can contain shared objects across the repositories in a domain. The objects are shared through global shortcuts.

Local Repository:

The local repository is within a domain. The local repository can connect to a global repository using global shortcuts and can use objects in it’s shared folders.

22. Describe various repositories that can be generated using Informatica Repository Manager.?

Ans:

There are various repositories that can be formed with the help of the Informatica Repository Manager. They are as follows:

  • Standalone Repository: It is a repository functioning individually as well as is not related to any other repositories.
  • Local Repository: This repository functions within a domain. It is able to connect to a global repository with the help of global shortcuts. Also, it can make use of objects in their shared folders.
  • Global Repository: This repository works as a centralized repository in a domain. It contains shared objects crossways the repositories in a domain.

23. What is Informatica PowerCenter?

Ans:

Powercenter is data integration software of Informatica Corporation which provides an environment that allows loading data into a centralized location such as data warehouse. Data can be extracted from multiple sources can be transformed according to the business logic and can be loaded into files and relation targets.

24. What is the Dimension table?

Ans:

The dimensional table contains textual attributes of measurements stored in the facts tables. The dimensional table is a collection of hierarchies, categories, and logic which can be used for the user to traverse in hierarchy nodes.

25. What is the expiration module of automatic lock-in Informatica MDM?

Ans:

In every 60 seconds, the hub console is refreshed in the current connection. A lock can be released manually by a user. In case the user switches to another database while having a hold of a lock, then the lock will be released automatically. In case the hub console is terminated by the user, then the lock will be expired after a minute.

26. Define Data Warehousing?

Ans:

It is the main depot of an organization’s historical data and its corporate memory, containing the raw material for the decision support system of management. What lead to the use of data warehousing is that it allows a data analyst to execute complex queried and analysis like data mining on the info without making any slow in an operational system. Collection of data in Data warehousing is planned for supporting decision making of the management. These warehouses contain an array of data presenting a coherent image of business conditions in time at a single point. Data Warehousing is a repository of information that is available for analysis and query.

Course Curriculum

Get Practical Oriented IBM MDM Training to Meet the Industry Needs

  • Instructor-led Sessions
  • Real-life Case Studies
  • Assignments
Explore Curriculum

27. What is the difference between the mapping parameter and variable?

Ans:

A Mapping Parameter is a static value that you define before running the session and its value remains until the end of the session. When we run the session PowerCenter evaluates the value from the parameter and retains the same value throughout the session. When the session run again it reads from the file for its value.

A Mapping Variable is dynamic or changes anytime during the session. PowerCenter reads the initial value of the variable before the start of the session and changes its value by using variable functions and before ending the session its saves the current value (last value held by the variable). Next time when the session runs the variable value is the last saved value in the previous session.

28. What is Fact table?

Ans:

Fact table contains measurements of business processes also fact table contains the foreign keys for the dimension tables. For example, if your business process is “paper production” then “average production of paper by one machine” or “weekly production of paper” would be considered as a measurement of the business process.

29. Differentiate between variable and mapping parameter?

Ans:

A Mapping variable is dynamic, i.e. it can vary anytime throughout the session. The variable’s initial value before the starting of the session is read by PowerCenter, which makes use of variable functions to change the value. And before the session ends, it saves the current value. However, the last value is held by the variable itself. Next time when the session runs, the value of the variable is the last saved value in the previous session.

A Mapping parameter is a static value, defined by you before the session starts and the value remains the same until the end of the session. Once the session runs, PowerCenter evaluates the parameter’s value and retains the same value during the entire session. Next time, when the session runs, it reads the value from the file.

30. What is OLTP & OLAP?

Ans:

OLTP is an abbreviation of Online Transaction Processing. This system is an application that modifies data the instance it receives and has a large number of concurrent users.

OLAP is an abbreviation of Online Analytical Processing. This system is an application that collects, manages, processes and presents multidimensional data for analysis and management purposes.

31. What is the different type of repositories that can be created using the Informatica Repository Manager?

Ans:

  • Standalone Repository – A repository which functions individually and is unrelated to any other repositories.
  • Global Repository – A centralized repository in a domain. This repository can contain shared objects across the repositories in a domain. The objects are shared through global shortcuts.
  • Local Repository – A Local repository is within a domain. The local repository can connect to a global repository using global shortcuts and can use objects in its shared folders.

32. What are the objects that you can’t use in a mapplet?

Ans:

  • COBOL source definition
  • Joiner transformations
  • Normalizer transformations
  • Non reusable sequence generator transformations.
  • Pre or post-session stored procedures
  • Target definitions
  • Power mart 3.5 styles Look Up functions
  • XML source definitions
  • IBM MQ source definitions

33. What is OLAP?

Ans:

OLAP is an abbreviation of Online Analytical Processing. This system is an application that collects, manages, processes and presents multidimensional data for analysis and management purposes.

34. Describe all the biggest management and technical challenges in adopting MDM?

Ans:

There is always a challenge for technical folks in data governance to sell the project and get the fund. There is always a look for ROI by management. They require MDM knotted to quantifiable benefits that are considered by business leaders such as dollar amounts around ROI.

35. Define Data Mining?

Ans:

It is a process that helps in analyzing data from several perspectives and also allows summarizing it into helpful information.

36. Name various data movement modes in Informatica?

Ans:

A data movement mode helps in determining how power center server takes care of the character data. Data movement is selected in the Informatica server configuration settings. There are two different data movement modes available in Informatica. They are:

  • Unicode Mode and ASCII Mode
  • Explain OLAP.
  • OLAP stands for Online Analytical Processing. It processes as an app helps that gathers, manages, presents and processes multidimensional data for management and analysis purposes.

37. How to delete duplicate record in Informatica?

Ans:

Following are ways to remove duplicate records

  • In the source, qualifier use select distinctly
  • Use Aggregator and group by all fields
  • Override SQL query in Source qualifier

38. Name the tables that are linked with staging data in Informatica MDM?

Ans:

There are various tables that are linked with staging data in Informatica MDM. They are:

  • Landing Table
  • Raw Table
  • Rejects Table
  • Staging Table
Course Curriculum

Best IBM MDM Certification Course from Real-Time Experts

Weekday / Weekend BatchesSee Batch Details

39. Explain various types of LOCK used in Informatica MDM 10.1?

Ans:

Two types of LOCK are used in Informatica MDM 10.1. They are:

  • Exclusive Lock: Letting just one user make alterations to the underlying operational reference store.
  • Write Lock: Letting multiple users make amendments to the underlying metadata at the same time.

40. Define different ways used in Informatica to migrate from one environment to another?

Ans:

  • A repository can be imported and exported to the new environment
  • Informatica deployment groups can be used
  • Folders/objects can be copied
  • Each mapping can be exported to XML and then be imported in the new environment. 

41.How popular is MDM across the people associated with different fields?

Ans:

People associated with IT and Computers get the concept easily. Business official understands MDM but not up to the level of former because businessmen are more interested in only knowing what benefit MDM can bring to their group. While IT and Computer people, know more about the features of MDM.

42. What is Data Warehousing?

Ans:

A Data Warehouse is the main repository of an organization’s historical data, its corporate memory. It contains the raw material for management’s decision support system. The critical factor leading to the use of a data warehouse is that a Data Analyst can perform complex queries and analysis, such as data mining, on the information without slowing down the operational systems. Data warehousing collection of data designed to support management decision-making. Data warehouses contain a wide variety of data that present a coherent picture of business conditions at a single point in time. It is a repository of integrated information, available for queries and analysis.

43. What are fundamental stages of Data Warehousing?

Ans:

  • Offline Operational Databases – Data warehouses in this initial stage are developed by simply copying the database of an operational system to an off-line server where the processing load of reporting does not impact on the operational system’s performance.
  • Offline Data Warehouse – Data warehouses in this stage of evolution are updated on a regular time cycle (usually daily, weekly or monthly) from the operational systems and the data is stored in an integrated reporting-oriented data structure.
  • Real Time Data Warehouse – Data warehouses at this stage are updated on a transaction or event basis, every time an operational system performs a transaction (e.g. an order or a delivery or a booking etc.)
  • Integrated Data Warehouse – Data warehouses at this stage are used to generate activity or transactions that are passed back into the operational systems for use in the daily activity of the organization.

44. What are the components of Informatica PowerCenter?

Ans:

Following are the various components of Informatica PowerCenter,

  • PowerCentre Domain
  • PowerCenter Repository
  • Administration Console
  • PowerCenter Client
  • Repository Service
  • Integration service
  • Web Services Hub
  • Data Analyzer
  • Metadata Manager
  • PowerCenter Repository Reports

45. What is Mapping?

Ans:

A mapping is a set of source and target definitions linked by transformation objects that define the rules for data transformation. Mappings represent the data flow between sources and targets

46. What is a Fact table?

Ans:

The fact table contains measurements of business processes and the fact table contains the foreign keys for the dimension tables. For example, if your business process is “paper production” then “average production of paper by one machine” or “weekly production of paper” would be considered as a measurement of the business process.

47. What is the difference between mapping parameter and variable?

Ans:

A Mapping Parameter is a static value that you define before running the session and its value remains until the end of the session. When we run the session PowerCenter evaluates the value from the parameter and retains the same value throughout the session. When the session runs again it reads from the file for its value.

A Mapping Variable is dynamic or changes anytime during the session. PowerCenter reads the initial value of the variable before the start of the session and changes its value by using variable functions and before ending the session it saves the current value (last value held by the variable). Next time when the session runs the variable value is the last saved value in the previous session.

48. How to delete a duplicate record in Informatica?

Ans:

Following are ways to remove duplicate records

  • In source, qualifier use select distinctly
  • Use Aggregator and group by all fields
  • Override SQL query in Source qualifier

49. What is the different type of repositories that can be created using Informatica Repository Manager?

Ans:

  • Standalone Repository – A repository which functions individually and is unrelated to any other repositories.
  • Global Repository – A centralized repository in a domain. This repository can contain shared objects across the repositories in a domain. The objects are shared through global shortcuts.
  • Local Repository – A Local repository is within a domain. Local repositories can connect to a global repository using global shortcuts and can use objects in its shared folders.
Data Warehouse Sample Resumes! Download & Edit, Get Noticed by Top Employers! Download

50. How to find all invalid mappings in a folder?

Ans:

Use the following query

51.Why is it often stated that a Data Warehouse is the main repository of an organization’s historical data?

Ans:

Management’s decision support system is served with the raw source by the Data Warehouses. The use of Data warehouse becomes essential because a Data Analyst can perform complex queries and analysis like data mining which makes use of a data warehouse. At a single point in time, we are able to present a clear image of business conditions with the help of Data warehousing which otherwise contains a wide variety of data.

Are you looking training with Right Jobs?

Contact Us

Popular Courses