Robotic process automation for PG&E
The project aimed to automate the data retrieval process from PG&E, a utility company, by developing a bot using Robomotion. The bot performs a series of tasks including logging in to the PG&E website, downloading PDF files containing essential data, extracting relevant information, storing it in a database, and uploading the PDF files to both Google Drive and Digital Ocean Space. Additionally, it sends notifications to Discord at the start and resumption of the bot, and also in case of any errors occurring during the process.
Tasks Completed:
- Init Config:
Configured initial settings to ensure smooth operation of the bot.
- Connect to Database:
Established a connection to the database for storing extracted data.
- Excel Operations:
Implemented functionality to access Excel files containing necessary data.
- Login to Website & Get Accounts:
Automated login to the PG&E website and retrieval of account information.
- Download & Read PDF:
Enabled the bot to download PDF files associated with each account and read their contents.
- Perform OCR using Tesseract OCR:
Utilized Tesseract OCR for Optical Character Recognition on PDFs, extracting text data.
- Get Values From PDF & Prepare Data:
Extracted relevant information from PDF files and formatted it for database storage.
- Insert to Database:
Inserted the prepared data into the database for future reference and analysis.
- Upload PDFs to Google Drive and Digital Ocean Space:
Automated the process of uploading PDF files to both Google Drive and Digital Ocean Space for backup and accessibility.
- Sending Notification to Discord:
Implemented functionality to send notifications to Discord at the start and resumption of the bot, and also in case of any errors occurring during the process.