Lab Questions

Please record your answers to these questions according to the direction of your instructor. Questions are provided in multiple choice or open-ended format. Answers for instructors may be requested here.

Multiple Choice Questions

Lecture

Question
1 Data integration can be defined as...
  1. discovery, cleansing, monitoring, transforming and delivery of data from a variety of sources.
  2. using algebra to find the integral of a numeric attribute.
  3. comparing predicted data to observed data.
  4. using pivot tables/cross tabulation tables to find subtotals of grouped data.
2 What does ETL stand for?
  1. Enumerate, Translate, List
  2. Expand, Transform, Loop
  3. Extract, Transform, Load
  4. Exact Technical Loading
3 What distinguishes FME's ETL capabilities?
  1. Low computer memory requirements
  2. Multi-language support
  3. Required coding knowledge
  4. Ability to work with spatial data
4 In the terminology of FME, a translation...
  1. translates data from one format to another.
  2. translates scripts from one programming language to another.
  3. translates string data from one language to another.
  4. translates the geographic coordinates of data to a different coordinate system.
5 In the terminology of FME, a transformation...
  1. transforms data from one format to another.
  2. transforms data by moving it from one digital storage location to another.
  3. transforms data by showing it on a map.
  4. transforms data by changing the structure or content of the data.
6 The FME Data Inspector is a fully-featured GIS and cartography application. True or false?
  1. True
  2. False
7 What is the difference between structural and content data transformation?
  1. Structural transformation performs statistical analysis, while content transformation uploads data to a new location on the web.
  2. Structural transformation merges, divides, re-orders, or otherwise changes data structures, while content transformation changes the underlying values in a dataset.
  3. Structural transformation makes long datasets wide and vice versa, while content transformation adds metadata to the dataset.
  4. Structural transformation transforms the data into a different format, while content transformation performs spatial analysis on the data.

Lab

Question
8 In Exercise 1, which library has the highest circulation?
  1. Mount Pleasant
  2. Kitsilano
  3. Central Branch
  4. Firehall
9 In Exercise 3, what is the value of ZoneName for the westernmost Industrial zone? Hint: you can try using the Filter function (Tools > Filter Features) of Data Inspector to narrow down your choices.
  1. IC-1
  2. IC-2
  3. IC-3
  4. I-1
  5. I-2
  6. I-3
  7. M-1
  8. M-2
  9. M-3
  10. M-4
10 In Exercise 4, after editing the writer feature type schema, what is the type of the attribute VisitorCount?
  1. char
  2. integer
  3. smallint
  4. float
11 In Exercise 4, after editing the writer feature type schema, what is the type of the attribute ParkArea?
  1. char
  2. integer
  3. smallint
  4. float
12 Complete the Advanced Exercise for Exercise 5. Who might benefit if you include bookmarks and annotation in your workspace? Select all that apply.
  1. A client or customer provided with the workspace
  2. A coworker who has to edit the workspace
  3. Yourself in the future if you return to the workspace
  4. The end-user of the data produced by the workspace
13 Complete the Advanced Exercise for Exercise 6. What are the names of the smallest and largest parks?
  1. Smallest: Carolina Park; Largest: Stanley Park.
  2. Smallest: Pioneer Place (Pigeon Park); Largest: Vanier Park.
  3. Smallest: Pioneer Place (Pigeon Park); Largest: Stanley Park.
  4. Smallest: Jean Beaty Park; Largest: Vanier Park.
14 In Exercise 7, how many parks have greater-than-average areas (i.e. have a ParkArea that is >= AverageParkArea)? How many are below average? Hint: use a Tester transformer after the StatisticsCalculator transformer.
  1. 5 and 68
  2. 14 and 96
  3. 12 and 61
  4. 4 and 69
15 In Exercise 8, what is the average size of parks in the Kitsilano neighborhood?
  1. 24,469 m2
  2. 28,638 m2
  3. 23,986 m2
  4. 27,628 m2
16 Complete the Advanced Exercise for Exercise 9. Assuming you want to calculate the park areas using the B.C. Albers coordinate system and units, you should place the Reprojector transformer after the AreaCalculator. True or false?
  1. True
  2. False

Data Integration Scenario

Question
17 What percentage of Vancouver public art installations are located in the downtown neighborhood?
  1. 84%
  2. 47%
  3. 66%
  4. 62%
18 Which neighborhood has the longest total bike path length?
  1. Downtown
  2. Fairview
  3. Kitsilano
  4. Mount Pleasant
  5. Stanley Park
  6. Strathcona
  7. West End
19 Why do the Bike Paths and Public Art Installation reader feature types have their names displayed as <All>?
  1. The readers are in Merge Feature Type mode, reading all features in the dataset as a single feature type
  2. The readers are reading all the files in the specified folder
  3. The readers are set up to read all SHP and XLS files in the C:\FMEData2018\Data folder
  4. We changed the name of the feature types on purpose to read <All>
20 How many public art installations are in the Mount Pleasant neighborhood?
  1. 12
  2. 13
  3. 14
  4. 16

Open-ended Questions

Lecture

Question
1 Think of an existing business, organization, or technology that relies heavily on data (e.g. VRBO, the United Nation Statistics Division, or autonomous cars). What kind of data sources might it need to integrate? Describe three data sources, their formats, and what information they provide. How would data integration in this example relate to one of the nine reasons to integrate your data covered in the lecture?
2 What is spatial ETL?
3 In the terminology of FME, what is the difference between a translation and a transformation?
4 Is the FME Data Inspector a fully-featured GIS and cartography application? Why or why not?
5 What is the difference between structural and content data transformation?

Lab

Question
6 What is another use case for this workspace in Exercise 1 if the data were different? What organization or business could benefit from a similar data integration workspace?
7 In Exercise 3, what is the value of ZoneName for the westernmost Industrial zone?
8 In Exercise 4, why is VisitorCount stored with Type “integer” and variable while ParkArea and AverageParkArea are stored with Type “float”?
9 Complete the Advanced Exercise for Exercise 5. Why might bookmarking your workspaces be considered a best practice for FME users? Best practice is defined by Merriam-Webster as “a procedure that has been shown by research and experience to produce optimal results and that is established or proposed as a standard suitable for widespread adoption.”
10 Complete the Advanced Exercise for Exercise 6. What are the smallest, largest, and total park areas? What are the names of the smallest and largest parks? Don’t forget to include units. Where can you confirm the data’s units?
11 In Exercise 7, how many parks have greater-than-average areas (i.e. have a ParkArea that is >= AverageParkArea)? How many are below average? Hint: use a Tester transformer after the StatisticsCalculator transformer.
12 In Exercise 8, what is the average size of parks in the Kitsilano neighborhood?
13 Complete the Advanced Exercise for Exercise 9. Where should the Reprojector transformer be placed in the workspace and why is this important?

Data Integration Scenario

Question
14 What percentage of Vancouver public art installations are located in the downtown neighborhood?
15 Which neighborhood has the longest total bike path length?

16. Write a 200 word report on the data integration solution you created.

  • Include the brainstorming diagram or outline you created before carrying out the walkthrough analysis. How close were you? What did you not anticipate?
  • Which additional dataset did you use?
  • What problem or question does your solution address?
  • What analysis steps did you carry out?
  • Provide a deliverable of some kind to show the value of your project. This could be a map (screenshot of Data Inspector is ok), a table, or a description of a new organizational process made possible by the solution.

results matching ""

    No results matching ""