Microsoft Exam

Similar documents
Python Machine Learning

Training Catalogue for ACOs Global Learning Services V1.2. amadeus.com

CS Machine Learning

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Ericsson Wallet Platform (EWP) 3.0 Training Programs. Catalog of Course Descriptions

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

M55205-Mastering Microsoft Project 2016

CREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT

Assignment 1: Predicting Amazon Review Ratings

Introduction to Causal Inference. Problem Set 1. Required Problems

Getting Started Guide

Student User s Guide to the Project Integration Management Simulation. Based on the PMBOK Guide - 5 th edition

ecampus Basics Overview

On-Line Data Analytics

Answer each question by placing an X over the appropriate answer. Select only one answer for each question.

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

The Moodle and joule 2 Teacher Toolkit

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

Aviation English Solutions

Linking Task: Identifying authors and book titles in verbose queries

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

A Framework for Customizable Generation of Hypertext Presentations

CS 446: Machine Learning

Visit us at:

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Driving Author Engagement through IEEE Collabratec

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

LOS ANGELES CITY COLLEGE (LACC) ALTERNATE MEDIA PRODUCTION POLICY EQUAL ACCESS TO INSTRUCTIONAL AND COLLEGE WIDE INFORMATION

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Postprint.

Hard Drive 60 GB RAM 4 GB Graphics High powered graphics Input Power /1/50/60

Connect Microbiology. Training Guide

Learning to Think Mathematically with the Rekenrek Supplemental Activities

Closing out the School Year for Teachers and Administrators Spring PANC Conference Wrightsville Beach April 7-9, 2014

Lecture 1: Machine Learning Basics

Human Emotion Recognition From Speech

Blackboard Communication Tools

Modeling function word errors in DNN-HMM based LVCSR systems

TeacherPlus Gradebook HTML5 Guide LEARN OUR SOFTWARE STEP BY STEP

Atlanta Police Study Guide

Circuit Simulators: A Revolutionary E-Learning Platform

Moodle MyFeedback update April 2017

COURSE LISTING. Courses Listed. Training for Cloud with SAP SuccessFactors in Integration. 23 November 2017 (08:13 GMT) Beginner.

Modeling function word errors in DNN-HMM based LVCSR systems

Pod Assignment Guide

Mathematics Scoring Guide for Sample Test 2005

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Minitab Tutorial (Version 17+)

Learning Methods in Multilingual Speech Recognition

A Cost-Effective Cloud Service for E-Learning Video on Demand

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Online Marking of Essay-type Assignments

Strategy and Design of ICT Services

learning collegiate assessment]

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

A virtual surveying fieldcourse for traversing

Go Global with Fisher

Radius STEM Readiness TM

Patterns for Adaptive Web-based Educational Systems

Outreach Connect User Manual

The stages of event extraction

EXECUTIVE SUMMARY. Online courses for credit recovery in high schools: Effectiveness and promising practices. April 2017

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Office of Planning and Budgets. Provost Market for Fiscal Year Resource Guide

Using Moodle in ESOL Writing Classes

IMPROVE THE QUALITY OF WELDING

Truth Inference in Crowdsourcing: Is the Problem Solved?

Mining Association Rules in Student s Assessment Data

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

A Neural Network GUI Tested on Text-To-Phoneme Mapping

The Global Economic Education Alliance

Model Ensemble for Click Prediction in Bing Search Ads

Interaction Design Considerations for an Aircraft Carrier Deck Agent-based Simulation

Calibration of Confidence Measures in Speech Recognition

Best Practices in Internet Ministry Released November 7, 2008

Urban Analysis Exercise: GIS, Residential Development and Service Availability in Hillsborough County, Florida

Implementing a tool to Support KAOS-Beta Process Model Using EPF

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Helping your child succeed: The SSIS elementary curriculum

Education for an Information Age

Learning From the Past with Experiment Databases

Contra Costa College: HBCU Tour 2017 Due by Monday, January 9, Transfer Center SAB 227

Committee Member Responsibilities

SYLLABUS- ACCOUNTING 5250: Advanced Auditing (SPRING 2017)

Technology Plan Woodford County Versailles, Kentucky

TEACHING IN THE TECH-LAB USING THE SOFTWARE FACTORY METHOD *

Indian Institute of Technology, Kanpur

Automating the E-learning Personalization

Houghton Mifflin Online Assessment System Walkthrough Guide

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

KIS MYP Humanities Research Journal

Section 3.4. Logframe Module. This module will help you understand and use the logical framework in project design and proposal writing.

Syllabus of the Course Skills for the Tourism Industry

Transcription:

Volume: 37 Questions Question: 1 You are building an Azure Machine Learning Solution for an Online retailer. When a customer selects a product, you need to recommend products that the customer might like to purchase at the same time. The recommendation should be based on what other customers purchased the same product. Which model should you use? A. Collaborative Filtering B. Boosted Decision Tree Regression Model C. Two-Class boosted decision tree D. K-Means Clustering Question: 2 You are analyzing taxi trips in New York City. You leverage the Azure Data Factory to create data pipelines and to orchestrate data movement. You plan to develop a predictive model for 170 million rows (37 GB) of raw data in Apache Hive by using Microsoft R Serve to identify which factors contributes to the passenger tipping behavior. All of the platforms that are used for the analysis are the same. Each worker node has eight processor cores and 28 GB Of memory. Which type of Azure HDInsight cluster should you use to produce results as quickly as possible? A. Hadoop B. HBase C. Interactive Hive D. Spark Question: 3

A Travel agency named Margie s Travel sells airline tickets to customers in the United States. Margie s Travel wants you to provide insights and predictions on flight delays. The agency is considering implementing a system that will communicate to its customers as the flight departure near about possible delays due to weather conditions. The flight data contains the following attributes: * DepartureDate: The departure date aggregated at a per hour granularity. * Carrier: The code assigned by the IATA and commonly used to identify a carrier. * OriginAirportID: An identification number assigned by the USDOT to identify a unique airport (the flight s Origin) * DestAirportID: The departure delay in minutes. * DepDet30: A Boolean value indicating whether the departure was delayed by 30 minutes or more ( a value of 1 indicates that the departure was delayed by 30 minutes or more) The weather data contains the following Attributes: AirportID, ReadingDate (YYYY/MM/DDHH), SKYConditionVisibility, WeatherType, Windspeed, StationPressure, PressureChange and HourlyPrecip. You plan to predict flight delays that are 30 minutes or more. You need to build a training model that accurately fits the data. The solution must minimize over fitting and minimize data leakage. Which attribute should you remove? A. OriginAirportID B. DepDel C. DepDel30 D. Carrier E. DestAirportID Answer: B Question: 4 You are working on an Azure Machine Learning Experiment. You have the dataset configured as shown in the following table: You need to ensure that you can compare the performance of the models and add annotations to the results. You connect the Score Model modules from each trained model as inputs for the Evaluate Model module, and then save the result as a dataset.

Question: 5 You are working on an Azure Machine Learning Experiment. You have the dataset configured as shown in the following table: You need to ensure that you can compare the performance of the models and add annotations to the results. You save the output of the Score Model modules as a combined set, and then use the Project Columns modules to select the MAE. Question: 6 You are building an Azure Machine Learning experiment. You need to transform a string column into a label column for a Multiclass Decision Jungle module. Which module should you use? A. Select Columns Transform B. Group Categorical Values C. Convert to Indicator Values D. Edit Metadata

Answer: C Question: 7 DRAG DROP A Travel agency named Margie s Travel sells airline tickets to customers in the United States. Margie s Travel wants you to provide insights and predictions on flight delays. The agency is considering implementing a system that will communicate to its customers as the flight departure near about possible delays due to weather conditions. The flight data contains the following attributes: * DepartureDate: The departure date aggregated at a per hour granularity. * Carrier: The code assigned by the IATA and commonly used to identify a carrier. * OriginAirportID: An identification number assigned by the USDOT to identify a unique airport (the flight s Origin) * DestAirportID: The departure delay in minutes. * DepDet30: A Boolean value indicating whether the departure was delayed by 30 minutes or more ( a value of 1 indicates that the departure was delayed by 30 minutes or more) The weather data contains the following Attributes: AirportID, ReadingDate (YYYY/MM/DDHH), SKYConditionVisibility, WeatherType, Windspeed, StationPressure, PressureChange and HourlyPrecip. You need to remove the bias and to identify the columns in the input dataset that have the greatest predictive power. Which module should you use for each requirement? To answer drag the appropriate modules to the correct requirements. Answer:

Question: 8 You are designing an Azure Machine Learning workflow. You have a dataset that contains two million large digital photographs. You plan to detect the presence of trees in the photographs. You need to ensure that your model supports the following: * Hidden Layers that support a directed graph structure. * User-defined core components on the GPU You create a Machine Learning Experiment that implements the Multiclass Decision Jungle Module. Answer: B Question: 9 You plan to create a predictive analytics solution for credit risk assessment and fraud prediction in Azure Machine Learning. The Machine Learning workspace for the solution will be shared with other users in your organization. You will add assets to projects and conduct experiments in the workspace. The experiments will be used for training models that will be published to provide scoring from web services. The experiment tor fraud prediction will use Machine Learning modules and APIs to train the models and will predict probabilities in an Apache Hadoop ecosystem. You plan to configure the resources for part of a workflow that will be used to preprocess data from files stored in Azure Blob storage. You plan to use Python to preprocess and store the data in Hadoop. You need to get the data into Hadoop as quickly as possible. Which three actions should you perform? Each correct answer presents pan of the solution. NOTE: Each correct selection is worth one point. A. Create an Azure virtual machine (VM), and then configure MapReduce on the VM. B. Create an Azure HDInsight Hadoop cluster. C. Create an Azure virtual machine (VM), and then install an IPython Notebook server. D. Process the files by using Python to store the data to a Hadoop instance.