RPG Academy: Database Modernization—Methodology, Part 4

RPG
Typography
  • Smaller Small Medium Big Bigger
  • Default Helvetica Segoe Georgia Times

Today’s topic is normalization. If your database were a piece of dirty laundry, normalization would be a kind of wash-rinse-dry process. But in this case, shrinking wouldn’t be a problem.

Normalization, explained in the first TechTip of this series, is more than just a theoretical concept. It’s time to put it to work in your database.

Normalizing the Database

In this stage of the modernization process, it would be a good idea to take some time and revisit the database design, review the notes you took earlier, and have another look at the Entity Relationship Diagram you drafted. Then, follow the normalization process and update your logical database model’s tables to conform with the second normal form, at least. The following steps are involved in this stage:

  1. Eliminate unnecessary columns from SQL tables—In the process of normalization, some unnecessary columns will be eliminated or moved to other tables. Many tables contain columns that were intended for some purpose, but over the course of business modifications are no longer used or were never used at all. This is the opportunity to identify and remove such columns. Be careful, and eliminate only those that you’re absolutely certain are not required.
  2. Update the data dictionary—The data dictionary that you started a few steps ago, when you defined standard abbreviations for table names, can now be updated with some of the following:
  • Object naming conventions
  • Column naming conventions
  • Function naming conventions
  • Application naming conventions
  • Standard abbreviations
  1. Establish data domains—This is the process of grouping columns with like attributes into classes or domains. You can implement the data dictionary with established domains using field reference files, since the SQL CREATE TABLE statement can now reference this file. This requires a small and not-very-obvious trick, so let me give you an example. Imagine that you have a field reference table named FRT and it contains the definition commonly used for a few data domains, such as:
    • Percentages, defined in column PERC, assuming the percentage varies from 0.01% to 100.00%
    • Coefficients, defined in column COEF, assuming the coefficient goes from 0.00001 to 1000.00000
    • Names, defined in column NAME, as a 50-character column
    • Descriptions, defined in column DESCRIPT, as a 250-character column

Now, let’s say I want to create a new table containing only columns defined in my field reference table. The respective CREATE TABLE statement would look something like this:

CREATE OR REPLACE TABLE new_table

(user_name, user_description, salary_coef, bonus_perc)

AS

(SELECT NAME, DESCRIPT, COEF, PERC FROM FRT)

WITH NO DATA;

Great, right? However, in real life, not everything comes predefined, so it’s highly probable that sooner or later (definitely sooner), you’ll come across a situation in which you’ll need to create a column that doesn’t match any definition in your field reference table. Here is where the trick I mentioned earlier is used. When using this type of CREATE TABLE statement, you can’t mix “standard” and “select-like” definitions, so you’re forced to tweak the SELECT statement to include the extra columns. The way to do it is simple, if you’re familiar with the CAST SQL function. Here’s a variation of the previous example, tweaked to include a couple of new columns (an INTEGER named employee_id and a very long VARCHAR, named soft_skills):

CREATE OR REPLACE TABLE new_table

(user_name, user_description, salary_coef, bonus_perc, employee_id, soft_skills)

AS

(SELECT NAME, DESCRIPT, COEF, PERC, CAST(0 AS INTEGER), CAST(‘’ AS VARCHAR(500)) FROM FRT)

WITH NO DATA;

It’s a bit more verbose, but it gets the job done. There are, however, some details worth mentioning: When you create a table via a SELECT statement, your new table will inherit more than just the definition of the data types and sizes of the columns in the SELECT statement. It also gets the nullability definition (NULL or NOT NULL) of the column and, optionally, other characteristics such as column defaults, identity column attributes, and implicitly hidden and even row change timestamp definitions. In order to “copy” these characteristics, you need to specify the respective INCLUDE clause. Let’s illustrate this by using the previous statement and changing it to also copy the column defaults (see the part in bold):

CREATE OR REPLACE TABLE new_table

(user_name, user_description, salary_coef, bonus_perc, employee_Id, soft_skills)

AS

(SELECT NAME, DESCRIPT, COEF, PERC, CAST(0 AS INTEGER), CAST(‘’ AS VARCHAR(500)) FROM FRT)

WITH NO DATA

INCLUDE COLUMN DEFAULTS;

The same can be used for the other characteristics, in the same way. In short, you can use a field reference table to enforce a certain uniformity over your database by defining the exact attributes of the most commonly used column types.

  1. Create or update the logical database model—The normalization process probably required the creation of some tables and changes in others. In turn, this caused new relationships between tables. All of this must be documented thoroughly. It’s a good time to create or update your ERD with the latest changes. Be sure to include the attributes and constraints you added when you migrated the DDS objects.
  2. Implement the model—Preferably using an automated tool such as IBM InfoSphere Data Architect, apply the changes made in the logical model to the physical model (the closest to your actual database).

Now you’ll need to adjust some programs, just like you did in your two-file-conversion PoC, because of the new tables that were created during the normalization phase. This won’t be the last time you’ll have to change them in this process. The next big step requires you to change them again, but it’s worth it.

In the next TechTip, I’ll start discussing the second big step of the database modernization process: moving business rules to the database.

Rafael Victoria-Pereira

Rafael Victória-Pereira has more than 20 years of IBM i experience as a programmer, analyst, and manager. Over that period, he has been an active voice in the IBM i community, encouraging and helping programmers transition to ILE and free-format RPG. Rafael has written more than 100 technical articles about topics ranging from interfaces (the topic for his first book, Flexible Input, Dazzling Output with IBM i) to modern RPG and SQL in his popular RPG Academy and SQL 101 series on mcpressonline.com and in his books Evolve Your RPG Coding and SQL for IBM i: A Database Modernization Guide. Rafael writes in an easy-to-read, practical style that is highly popular with his audience of IBM technology professionals.

Rafael is the Deputy IT Director - Infrastructures and Services at the Luis Simões Group in Portugal. His areas of expertise include programming in the IBM i native languages (RPG, CL, and DB2 SQL) and in "modern" programming languages, such as Java, C#, and Python, as well as project management and consultancy.


MC Press books written by Rafael Victória-Pereira available now on the MC Press Bookstore.

Evolve Your RPG Coding: Move from OPM to ILE...and Beyond Evolve Your RPG Coding: Move from OPM to ILE...and Beyond
Transition to modern RPG programming with this step-by-step guide through ILE and free-format RPG, SQL, and modernization techniques.
List Price $79.95

Now On Sale

Flexible Input, Dazzling Output with IBM i Flexible Input, Dazzling Output with IBM i
Uncover easier, more flexible ways to get data into your system, plus some methods for exporting and presenting the vital business data it contains.
List Price $79.95

Now On Sale

SQL for IBM i: A Database Modernization Guide SQL for IBM i: A Database Modernization Guide
Learn how to use SQL’s capabilities to modernize and enhance your IBM i database.
List Price $79.95

Now On Sale

BLOG COMMENTS POWERED BY DISQUS

LATEST COMMENTS

Support MC Press Online

$

Book Reviews

Resource Center

  •  

  • LANSA Business users want new applications now. Market and regulatory pressures require faster application updates and delivery into production. Your IBM i developers may be approaching retirement, and you see no sure way to fill their positions with experienced developers. In addition, you may be caught between maintaining your existing applications and the uncertainty of moving to something new.

  • The MC Resource Centers bring you the widest selection of white papers, trial software, and on-demand webcasts for you to choose from. >> Review the list of White Papers, Trial Software or On-Demand Webcast at the MC Press Resource Center. >> Add the items to yru Cart and complet he checkout process and submit

  • SB Profound WC 5536Join us for this hour-long webcast that will explore:

  • Fortra IT managers hoping to find new IBM i talent are discovering that the pool of experienced RPG programmers and operators or administrators with intimate knowledge of the operating system and the applications that run on it is small. This begs the question: How will you manage the platform that supports such a big part of your business? This guide offers strategies and software suggestions to help you plan IT staffing and resources and smooth the transition after your AS/400 talent retires. Read on to learn: