24
Sun, Nov
1 New Articles

Round-the-Clock Business Information System Management

High Availability / Disaster Recovery
Typography
  • Smaller Small Medium Big Bigger
  • Default Helvetica Segoe Georgia Times

Round-the-clock systems are more than just an intellectual exercise for an increasing number of companies. Some of the "modern" reasons why more companies, including smaller ones, are moving to 24x7 operations have become clichés. They include the need to support Web presences and global operations and the need to shoehorn lengthier maintenance tasks made necessary by rapidly growing databases into shrinking maintenance windows. Other reasons are mentioned less frequently but are still important. These include an insistence by some companies that the IT department never say no to CEOs, CFOs, marketing managers, and other authorized personnel who, without warning, want to use the VPN connection from their homes to investigate an idea that comes to them on the weekend, in the evening, or even in middle of the night.

How High Is High?

When considering high availability and operations that extend beyond "nine-to-five," one question to ask is, how high is high? A decade or so ago, organizations had only three generic choices. They could do nothing to protect the availability of their data and applications and, instead, cross their fingers and hope that nothing went wrong. Few companies were that brave or that foolhardy.

More likely, they took nightly tape backups that allowed them to recover their data if disaster struck. Of course, back in the days of slow tape drives, the recovery process would probably have taken a few days. Possibly worse, they were able to recover data only up to the previous night. Updates entered after that point were lost in the event of a disaster. Systems generally had to be shut down during the nightly backup process, which was acceptable for traditional nine-to-five but not for the round-the-clock operations that are becoming more common. And neither of these solutions did anything to avert the downtime required for other regular maintenance, such as hardware and software upgrades and database reorganizations.

The third option was a full HA solution that provided real-time or near real-time replication of all data and objects to a hot-standby server. The HA software could also automatically failover to the backup system when the primary system became unavailable. Or an administrator could initiate a switchover to accommodate maintenance on the primary server. Because the HA software could support a remote backup server, this alternative would also prevent business downtime in the event of a disaster. This was, and remains, the gold standard in HA. The only problem was that, until a few years ago, a full HA solution was too expensive for most small and medium sized businesses.

When the first true AS/400 (now System i) HA options were introduced more than 25 years ago, they were directed primarily at large enterprises. They required considerable effort to implement, monitor, and manage. This administration workload, combined with high price tags, put them out of reach of many AS/400 shops. New options that have come on the scene in the intervening years changed the economics. Almost all companies can now cost-justify an investment that significantly improves the availability of the data and applications beyond a solely tape-based backup strategy. This is partly because the total cost of ownership of HA solutions has fallen and partly because the market now includes lower-cost options that fall between tape backups and full HA on the availability spectrum.

Data Vaulting

Tape drives have become considerably faster over the years. Consequently, one of the drawbacks of tape, namely the time required to perform save and restore tasks, has diminished. Nonetheless, the time to restore a full data center from tape after a disaster may still be too much of a burden on an organization.

Another former problem with tape saves—the need to shut systems down during backup operations—has also been alleviated. System i has had a save-while-active function for some time. This eliminates the need for downtime during tape save operations, but it is not totally satisfactory for 24x7 operations because tape saves still greatly impair application performance while they are running.

Even ignoring the performance issues associated with the save-while-active function, tape backups are still far from ideal. Data is generally saved to tape and sent offsite once a day, typically at night. Assuming you use journaling, changes made throughout the day will be saved to the journal, but if that journal is local to the production system, a disaster that destroys the data center will likely also destroy the journal. Thus, you may lose up to a day's worth of data after a disaster. The data loss will be greater still if the most recent tape had not yet been shipped offsite or if it was corrupted.

Data vaulting offers an inexpensive way to overcome these liabilities. Data vaulting software captures changes made on the production system and saves them on disks on another system. Vaulting software can copy changes as they happen, batch them to be sent periodically, or allow you to choose between these two options.

The system hosting the vault can be local or remote, and it doesn't necessarily have to match the production system. For example, you might use a low-cost Linux or Windows server to run the vault for your production System i server. Unlike HA, the objective of vaulting is not to have a hot-standby server ready to take over operations immediately whenever necessary, but rather to capture changes made between tape saves so you can recover data close to a point of failure—or right up to the point of failure if using continuous data capture and transmission.

If you use a data vault, you will likely still want to use tape saves as a last line of defense in the protection of your data. The vault can help you with that as well. Tapes can be created from the vault, eliminating the impact that save operations normally has on production systems.

Data vaulting offers another advantage over tape saves. Using the vault to recover single or multiple objects that become corrupted or are accidentally deleted is generally fast and easy.

Single-Point Availability Solutions

Data vaulting can fill in some of the gaps of a tape-only backup strategy, but it doesn't prevent the regular and lengthy downtime that results from maintenance tasks such as database reorganizations or hardware and software migrations. However, there are affordable products on the market that can address these availability issues.

Database reorganizations are periodically necessary to free up space consumed by logically deleted records and to improve application performance. Traditionally, the reorganization tool provided with the database required that applications be shut down while it ran, but there are ways to overcome this problem. As the name implies, reorganize-while-active tools reorganize databases while production systems remain active.

These tools might perform the reorganization in-place or they might create a mirrored file that is reorganized. In the latter case, the tool keeps the mirrored file synchronized with the production database by replicating any changes applied to the production file while the reorganization is in progress. When the reorganization is complete, the mirrored file becomes the production file.

When it comes to system resource usage, nothing is free. While-active reorganization tools consume some System i resources. Fortunately, sophisticated reorganize-while-active software allows you to schedule the reorganization processes to run during periods when the demand on the system is expected to be low. The tool may also be able to split the reorganization job into smaller tasks that can fit into the limited windows allotted to them.

The downtime traditionally required to convert databases when upgrading applications or to migrate databases when upgrading servers can be eliminated with tools that work in a fashion similar to the reorganize-while-active tools. Convert-while-active, upgrade-while-active, and migrate-while-active tools typically create a mirror database on the new hardware or in the new format and then keep that mirrored database synchronized until you are ready to begin using the new database.

Lower TCO for HA

Data vaulting and single-point solutions are steps on the road to high availability, but they are still far from the ultimate goal. In a true HA environment, HA software replicates all data and objects to a second server and keeps that server synchronized with the primary server in real-time or near real-time, allowing the second server to act as hot-standby backup. Functionality in the HA software can automatically switch users to the backup when the primary server is unavailable, or operators can manually tell it to initiate a switchover to accommodate maintenance on the primary server.

The need for redundant hardware and software, not to mention the cost of the HA software itself and the administrative burden that the early versions of that software imposed, used to put a full HA solution beyond the budgets of all but the largest of enterprises, but that's no longer the case.

Consider first the HA software. In the early days, it was designed specifically for large enterprises and came with a price tag to match. Over the years, new competitors entered the market with lower-priced products. To meet this competition, the early entrants created additional product editions designed specifically for small and medium-size shops. When used in fairly straightforward system environments, these less expensive editions typically offer the same HA sophistication as the more expensive editions, but they don't support some of the particularly complex system technologies and topologies that are generally used exclusively at larger enterprises.

The high cost of buying a second server that acts solely as a backup was another serious impediment for many small and medium-size companies. IBM has helped to lessen this burden. It offers many configurations of "Capacity BackUp" (CBU) Editions for its Model 520, 525, 570 and 595 System i servers. The CBU editions are available at a much lower price than their equivalent non-CBU editions, but, under normal circumstances, the CBU servers can be used only as a backup machine. While the primary system is handling normal operations, the only software other than the operating system that can legally run on the CBU machine is the HA software that maintains data and object redundancy.

When a disaster brings down the primary server or it must be taken offline for other reasons, its System i software licenses temporarily transfer to the CBU machine, allowing users to be switched to it without breaking any IBM licenses and without having to pay for second licenses. When the primary server is brought back online, the System i software licenses automatically revert to it. (It is likely also possible to transfer your application and other software licenses temporarily without having to buy a second license, but that depends on the terms of each vendor's license.)

The first generations of HA software were often cumbersome to install and required considerable monitoring and management. Small IT shops found it difficult to justify the extra headcount. This is no longer an issue for some of the products. Over the years, HA vendors have incorporated considerable automation into their products, making them self-installing and, to a large extent, self managing. Autonomics also makes the more sophisticated of the products self-healing. In the background, they automatically check the integrity of replicated data and objects, correcting any problems without the need for operator intervention. Because of this increased automation and autonomics, monitoring and managing of advanced HA software may require less than 15 minutes of an administrator's time each day.

Self-Managing Systems

Round-the-clock system management is about more than just maintaining availability 24x7. It's also about keeping systems continually tuned and problem-free. Some products adhere to this holistic philosophy of system management by providing an integrated set of tools that perform a variety of functions, including resource utilization reporting and analysis, system performance reporting and analysis, file reorganizations, and more.

If all such a system management product does is provide a convenient interface to integrate that functionality, it provides a productivity benefit, but it doesn't help the small to medium-size IT department that wants to manage its systems 24x7, without the need for round-the-clock onsite staff. To meet this higher objective, the software requires automation and autonomics. Ideally, the tool should have considerable intelligence, with the ability to automatically recognize storage problems—not just the need for file reorganizations, but also the existence of obsolete data that can be safely archived, among other issues—that may arise on all types of System i storage, including System i disks, IFS files, ASPs, and iASPs. It should recommend appropriate actions to resolve the issues, allow you to schedule those actions to run at convenient times, and, in some cases, if the appropriate options are set, it should execute those actions on its own, without the need for manual intervention.

The bottom line is that IT managers at small and medium-size System i shops now have more options for affordable round-the-clock system management and HA than they did in the past. If your knowledge of the vendors' products is more than a few years old, a review of the today's market offerings may yield considerable value for your organization.

BLOG COMMENTS POWERED BY DISQUS

LATEST COMMENTS

Support MC Press Online

$

Book Reviews

Resource Center

  • SB Profound WC 5536 Have you been wondering about Node.js? Our free Node.js Webinar Series takes you from total beginner to creating a fully-functional IBM i Node.js business application. You can find Part 1 here. In Part 2 of our free Node.js Webinar Series, Brian May teaches you the different tooling options available for writing code, debugging, and using Git for version control. Brian will briefly discuss the different tools available, and demonstrate his preferred setup for Node development on IBM i or any platform. Attend this webinar to learn:

  • SB Profound WP 5539More than ever, there is a demand for IT to deliver innovation. Your IBM i has been an essential part of your business operations for years. However, your organization may struggle to maintain the current system and implement new projects. The thousands of customers we've worked with and surveyed state that expectations regarding the digital footprint and vision of the company are not aligned with the current IT environment.

  • SB HelpSystems ROBOT Generic IBM announced the E1080 servers using the latest Power10 processor in September 2021. The most powerful processor from IBM to date, Power10 is designed to handle the demands of doing business in today’s high-tech atmosphere, including running cloud applications, supporting big data, and managing AI workloads. But what does Power10 mean for your data center? In this recorded webinar, IBMers Dan Sundt and Dylan Boday join IBM Power Champion Tom Huntington for a discussion on why Power10 technology is the right strategic investment if you run IBM i, AIX, or Linux. In this action-packed hour, Tom will share trends from the IBM i and AIX user communities while Dan and Dylan dive into the tech specs for key hardware, including:

  • Magic MarkTRY the one package that solves all your document design and printing challenges on all your platforms. Produce bar code labels, electronic forms, ad hoc reports, and RFID tags – without programming! MarkMagic is the only document design and print solution that combines report writing, WYSIWYG label and forms design, and conditional printing in one integrated product. Make sure your data survives when catastrophe hits. Request your trial now!  Request Now.

  • SB HelpSystems ROBOT GenericForms of ransomware has been around for over 30 years, and with more and more organizations suffering attacks each year, it continues to endure. What has made ransomware such a durable threat and what is the best way to combat it? In order to prevent ransomware, organizations must first understand how it works.

  • SB HelpSystems ROBOT GenericIT security is a top priority for businesses around the world, but most IBM i pros don’t know where to begin—and most cybersecurity experts don’t know IBM i. In this session, Robin Tatam explores the business impact of lax IBM i security, the top vulnerabilities putting IBM i at risk, and the steps you can take to protect your organization. If you’re looking to avoid unexpected downtime or corrupted data, you don’t want to miss this session.

  • SB HelpSystems ROBOT GenericCan you trust all of your users all of the time? A typical end user receives 16 malicious emails each month, but only 17 percent of these phishing campaigns are reported to IT. Once an attack is underway, most organizations won’t discover the breach until six months later. A staggering amount of damage can occur in that time. Despite these risks, 93 percent of organizations are leaving their IBM i systems vulnerable to cybercrime. In this on-demand webinar, IBM i security experts Robin Tatam and Sandi Moore will reveal:

  • FORTRA Disaster protection is vital to every business. Yet, it often consists of patched together procedures that are prone to error. From automatic backups to data encryption to media management, Robot automates the routine (yet often complex) tasks of iSeries backup and recovery, saving you time and money and making the process safer and more reliable. Automate your backups with the Robot Backup and Recovery Solution. Key features include:

  • FORTRAManaging messages on your IBM i can be more than a full-time job if you have to do it manually. Messages need a response and resources must be monitored—often over multiple systems and across platforms. How can you be sure you won’t miss important system events? Automate your message center with the Robot Message Management Solution. Key features include:

  • FORTRAThe thought of printing, distributing, and storing iSeries reports manually may reduce you to tears. Paper and labor costs associated with report generation can spiral out of control. Mountains of paper threaten to swamp your files. Robot automates report bursting, distribution, bundling, and archiving, and offers secure, selective online report viewing. Manage your reports with the Robot Report Management Solution. Key features include:

  • FORTRAFor over 30 years, Robot has been a leader in systems management for IBM i. With batch job creation and scheduling at its core, the Robot Job Scheduling Solution reduces the opportunity for human error and helps you maintain service levels, automating even the biggest, most complex runbooks. Manage your job schedule with the Robot Job Scheduling Solution. Key features include:

  • LANSA Business users want new applications now. Market and regulatory pressures require faster application updates and delivery into production. Your IBM i developers may be approaching retirement, and you see no sure way to fill their positions with experienced developers. In addition, you may be caught between maintaining your existing applications and the uncertainty of moving to something new.

  • LANSAWhen it comes to creating your business applications, there are hundreds of coding platforms and programming languages to choose from. These options range from very complex traditional programming languages to Low-Code platforms where sometimes no traditional coding experience is needed. Download our whitepaper, The Power of Writing Code in a Low-Code Solution, and:

  • LANSASupply Chain is becoming increasingly complex and unpredictable. From raw materials for manufacturing to food supply chains, the journey from source to production to delivery to consumers is marred with inefficiencies, manual processes, shortages, recalls, counterfeits, and scandals. In this webinar, we discuss how:

  • The MC Resource Centers bring you the widest selection of white papers, trial software, and on-demand webcasts for you to choose from. >> Review the list of White Papers, Trial Software or On-Demand Webcast at the MC Press Resource Center. >> Add the items to yru Cart and complet he checkout process and submit

  • Profound Logic Have you been wondering about Node.js? Our free Node.js Webinar Series takes you from total beginner to creating a fully-functional IBM i Node.js business application.

  • SB Profound WC 5536Join us for this hour-long webcast that will explore:

  • Fortra IT managers hoping to find new IBM i talent are discovering that the pool of experienced RPG programmers and operators or administrators with intimate knowledge of the operating system and the applications that run on it is small. This begs the question: How will you manage the platform that supports such a big part of your business? This guide offers strategies and software suggestions to help you plan IT staffing and resources and smooth the transition after your AS/400 talent retires. Read on to learn: