Viscosity
logo-black
  • Data
    • Database Services
      • Performance Assessment
      • Proactive Healthcheck
      • Database Migration & Consolidation
      • Performance Tuning
    • Microsoft Services
      • Microsoft Azure Services
      • Microsoft SQL Server Services
      • Microsoft Gold Azure Partner
    • Oracle Services
      • Oracle Database 19c
      • RAC
      • GoldenGate
      • Data Guard
      • Oracle & SQL Database
    • Viscosity Remote Services
  • Apps
    • App Dev Services
    • Oracle APEX
    • Viscosity AMP
    • Viscosity-ai
    • Shane-ai
  • Cloud
    • Cloud Services
    • Hybrid Cloud
    • Viscosity Edge Cloud
    • Virtualization & Cloud Expertise
    • Microsoft Azure Services
  • Infrastructure
    • Infrastructure Services
    • Exadata
      • Exadata Resale & Services
    • Oracle Database Appliance
      • Oracle Database Appliance Resale & Services
      • ODA Health Checks
    • Zero Data Loss Recovery Appliance
    • VMware to KVM Migration
  • Events
    • Upcoming Events
    • Virtual Training
    • 2025 Past Events
    • 2024 Past Events
    • 2023 Past Events
    • 2022 Past Events
  • About
    • About Us
    • News
    • Blogs
    • Publications
    • Contact
Hit enter to search or ESC to close
ENGAGE WITH US

Machine Learning , DBA , AI , emerging tech , ML DBA

The DBA's Next Great Job — Part I

By Rich Niemiec
June 07, 2023

A DBA tends to be the "person of last resort" in prioritizing and solving issues as they come in. The Oracle DBA has been in a constant state of transformation for decades. Sometimes, it's a job that directly works with Applications like financials; other times, custom applications or an APEX app for a phone takes the front seat, but administration & security never gets to rest. The Autonomous Database has offered to give the DBA a digital twin to help alleviate the demand for less important or less used databases (note the image on ATP= Autonomous Transaction Processing that took all of 2 minutes to create the database). A training application may not be as important as a stock market application for a trading company. Yet, a training application that trains an employee at just the right time to fix a valve at a refinery before it breaks (using prescriptive analytics) may be crucially important to an oil company.

 

Still, DBAs are generally overworked & underutilized, as I mentioned in a recent Forbes article. DBAs are underutilized in the world of Machine Learning (ML), where they can help provide the data needed for Data Scientists. One paper shows that data preparation represents 80% of Machine Learning projects, while Data Science is only around 20%. Yet, 80% of ML projects fail. While there are many reasons for failures, not having a good DBA is at the top of the list, and not having a DBA on the project is another (although usual reasons also include: bad data / not enough data / missing data...etc.). With the advent of ML & additional AI, such as robotics, VR (virtual reality), AR (augmented reality), GPS integration, etc., the DBA is more crucial than ever.

 

We'll discuss some new choices for DBAs to consider:

- Assist in Machine Learning to help the Data Scientist get better data (ML DBA)

- Dive into Machine Learning & start to leverage algorithms for the business (Data Scientist)

- Stay an administration DBA (DBA)

- Integrate things like VR/AR/GPS to enhance sales, security, or delivery (Emerging Technology)

- Integrate Robotics (Build the future)

- Move into Management of one or all of the above (Manage the team that does the above tasks)

 

Become the Machine Learning DBA or Data Scientist

Much of the job in Machine Learning (ML) is all about the data: moving, manipulating, augmenting, deleting duplicates, connecting, combining, and evaluating. As a DBA, you may have done this for over a decade, and it is probably your best skill. Data Scientists are often lost when it comes to the harder parts of preparing the data set. You will either be a key member of the team or add data science to your skills to be the team leader of that next ML project. The project starts with a well-formulated business problem: Who are our best customers by revenue, profit, repeat business, and referrals? Next, it's all about understanding & preparing the data: deleting bad data, combining good data sources, filling in missing data, finding big data that might fit, and getting data from other cloud providers. Now we can build and train a model (using 60% of the data) with function that calls for an algorithm (some math that does something for us). Let's use a function that classifies our good & bad customers using an algorithm (support vector machine). Then we'll use the other 40% of the data to test our model and perhaps pick a better (or faster) algorithm.

 

When you use Machine Learning notebooks in Oracle, it has examples of either SQL & PL/SQL, Python, or R. You choose the language you want to use. There are more examples of using each algorithm for a business problem than you'll need for a very long time. As you work your way through the different algorithm examples, you'll quickly see that you don't need to learn all of them at once. You just need to find the one that solves the problem that's important to your company. Perhaps it's the classification of good and bad customers. After that, you may want to find the attributes that make up a good or bad customer. Then you might want to go out to big data and find more customers like your best ones. The dmdtdemo.sql example is just one that Oracle has to show how easy it is to classify (the function you're performing) potential customers into two groups (either a customer that will buy a logo credit card with their favorite team or won't buy one). It uses an algorithm (decision tree) and has a SQL query to list the results for the salesperson who will focus on the best potential clients. Okay, you did have to write 15 minutes of SQL code to create the model & query the data, and you had to learn what function/algorithm would do this for you. But hey, now you're a data scientist (not really), and maybe they'll have to double your salary. Is this where you want to take your career - no more running/fixing backups in the middle of the night and getting calls day & night for half the pay (that's tough to leave behind)? You'll have to do pizza meet-ups at 4 pm and share successes with your latest hour of coding that found 20 new clients (that you've classified as good potentials) that will bring your company the next $1M deals. Oh yeah, the hard part was finding the data to teach your model (being a DBA was the key).

 

Stay the DBA & the Job You Have Now

Maybe retirement is not that far away. You've migrated to 19c, and the next version, 23c, maybe the last one you need to support. First, you need to train your digital twin (autonomous database) to manage all of those easy databases that nobody has time to manage. That's 2 minutes to create the database, then migrate the data (shouldn't be much since these are databases that nobody is taking care of), then get all the patches (oh, that's right, you’re using autonomous database - it does all of that for you). Next, start ensuring you're using all the best features in the database that really matter (the one they pay you to keep the system up 24/7 since it runs the business). Maybe you just enjoy being the DBA. As the problem solver of last resort, you often get to fix the latest impossible problem. It can be performance, security, integration, recoverability, availability, business sense, or just experience. You may just want to stay the DBA!

 

Some features you should be considering; use the best features that were in:

  • Key 12c Features to Leverage: Pluggable Databases and In-Memory Column Store
  • Key 18c Features to Try: Snapshot Carousel, PDB Switchover, & In-Memory External Tables
  • Key 19c Features to Try (current long-term version): 3 free PDBs, ADW/ATP, SQL Quarantine, Persistent Memory, Sharded RAC, AJD (Autonomous JSON Database) & Automatic Indexes. There are also key features from earlier versions that you should leverage in 19c/23c. You should upgrade to 19c soon if you're not there now. We can help!
  • Key 21c Features to Try: JavaScript in the DB, AutoML (Auto Machine Learning), OML4Py, SQL Macros, Persistent Memory, Blockchain Tables, Auto In-Memory Management, Hybrid In-Memory Scans, AutoUpgrade, Active Data Guard with Standby DB Result Cache
  • Key 23c Features to Play with (see my last 23c blog for much more on this): True Cache, Priority Transactions, Auto SQL Repair, JavaScript in the Database, Read-Only per-PDB standbys, Schema-Level Privileges, and a new Developer Role. While 23c has over 300 new features, don't forget that 21c brought 200 features to light. While some of these may be back-ported to 19c, you certainly want to be aware of some of the best ones.

Stay tuned for Part II! 

 

 

References:

  • DBAs Next Great Job, (presentation), Rich Niemiec
  • Larry Ellison Keynote at CloudWorld
  • Oracle 23c; Oracle CloudWorld, Andrew Mendelsohn
  • Oracle 23c; Oracle CloudWorld, Jenny Tsai-Smith & Dominic Giles
  • Preparing for Oracle 23c, Rich Niemiec
  • https://blogs.oracle.com/authors/mark-hornick
  • Oracle12c Release 2 Performance Tuning Tips & Techniques; Richard J. Niemiec; Oracle Press
  • viscosityna.com, www.oracle.com, & technet.oracle.com
  • All companies and product names are trademarks or registered trademarks of the respective owners
  • Rich Niemiec ©2023. This document cannot be reproduced without expressed written consent from Rich Niemiec or Viscosity NA, but may be reproduced or copied for article, presentation, and conference use by the author.
  • Neither Viscosity nor the author guarantees this document to be error-free. Please provide comments/questions to richniemiec@gmail.com – niemiec@viscosityna.com; I am always looking to improve!
All posts
About Author
Rich Niemiec

Rich Niemiec is the current Chief Innovation Officer of Viscosity North America. He is an Oracle ACE Director, a world-renowned IT expert, and was a co-founder and the CEO of TUSC, a Chicago-based systems integrator of Oracle-based business solutions started in 1988. TUSC was the Oracle Partner of the Year in 2002, 2004, 2007, 2008, 2010, & 2011 (Rolta TUSC). Rich is the past President of the International Oracle Users Group (IOUG) and the current President of the Midwest Oracle Users Group (MOUG). Rich is in the Chicago & E&Y National Entrepreneur Hall of Fame.

You might also like
Happy Birthday Oracle – June 16, 2024 – 47 years!
Happy Birthday Oracle – June 16, 2024 – 47 years!
June 07, 2023
The DBA's Next Great Job — Part II
The DBA's Next Great Job — Part II
June 07, 2023
SUBMIT YOUR COMMENT
logo for footer

Viscosity's core expertise includes:

Data Transformation, Emerging Technology, High Availability & Scalability Solutions, Cloud Migrations, Performance Tuning, Data Integrations, Machine Learning, APEX Development, and Custom Application Development.


Solutions

Resources

Partnerships

Careers

Clients

 

Contact
Email: sales@viscosityna.com

Telephone:
(469) 444-1380

Address:
3016 Communications Pkwy Suite 200, Plano, TX 75093

Copyright 2025. All Rights Reserved by Viscosity North America.