更新时间:2021-07-02 22:34:02
封面
版权信息
Credits
About the Author
About the Reviewers
www.PacktPub.com
Why subscribe?
Customer Feedback
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
Getting Started with Pentaho Data Integration
Pentaho Data Integration and Pentaho BI Suite
Introducing Pentaho Data Integration
Using PDI in real-world scenarios
Loading data warehouses or data marts
Integrating data
Data cleansing
Migrating information
Exporting data
Integrating PDI along with other Pentaho tools
Installing PDI
Launching the PDI Graphical Designer - Spoon
Starting and customizing Spoon
Exploring the Spoon interface
Extending the PDI functionality through the Marketplace
Introducing transformations
The basics about transformations
Creating a Hello World! Transformation
Designing a Transformation
Previewing and running a Transformation
Installing useful related software
Summary
Getting Started with Transformations
Designing and previewing transformations
Getting familiar with editing features
Using the mouseover assistance toolbar
Adding steps and creating hops
Working with grids
Designing transformations
Putting the editing features in practice
Previewing and fixing errors as they appear
Looking at the results in the execution results pane
The Logging tab
The Step Metrics tab
Running transformations in an interactive fashion
Understanding PDI data and metadata
Understanding the PDI rowset
Adding or modifying fields by using different PDI steps
Explaining the PDI data types
Handling errors
Implementing the error handling functionality
Customizing the error handling
Creating Basic Task Flows
Introducing jobs
Learning the basics about jobs
Creating a Simple Job
Designing and running jobs
Revisiting the Spoon interface and the editing features
Designing jobs
Getting familiar with the job design process
Looking at the results in the Execution results window
The Job metrics tab
Enriching your work by sending an email
Running transformations from a Job
Using the Transformation Job Entry
Understanding and changing the flow of execution
Changing the flow of execution based on conditions
Forcing a status with an abort Job or success entry
Changing the execution to be synchronous
Managing files
Creating a Job that moves some files
Selecting files and folders
Working with regular expressions
Summarizing the Job entries that deal with files
Customizing the file management
Knowing the basics about Kettle variables
Understanding the kettle.properties file
How and when you can use variables
Reading and Writing Files
Reading data from files
Reading a simple file
Troubleshooting reading files
Learning to read all kind of files
Specifying the name and location of the file
Reading several files at the same time
Reading files that are compressed or located on a remote server
Reading a file whose name is known at runtime