Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
audina:dailymaintenance:start [2011/10/28 12:13] smayr created |
audina:dailymaintenance:start [2011/11/29 12:59] (current) smayr [Web Server (www)] |
||
---|---|---|---|
Line 1: | Line 1: | ||
= System Daily Maintenance = | = System Daily Maintenance = | ||
+ | Author: Thai Tran | ||
- | == Exchange | + | == Exchange |
- | | + | |
- | Verify that environmental conditions are tracked and maintained. | + | |
- | Check temperature and humidity to ensure that environmental systems such as heating and air conditioning settings are within acceptable conditions, and that they function within the hardware manufacturer' | + | |
- | Ensure that your physical network and related hardware such as routers, switches, hubs, physical cables, and connectors are operational. | + | |
- | Check Backups | + | === Physical Environmental Checks === |
- | Make sure that the recommended minimum backup strategy of a daily online backup is completed. | + | |
- | Verify that the previous backup operation completed. | + | * Check temperature |
- | Analyze | + | * Ensure |
- | Verify | + | |
- | Performance | + | === Check Backups === |
- | % Processor Time | + | * Make sure that the recommended minimum backup strategy of a daily online backup is completed. |
- | Available MBs | + | * Verify that the previous backup operation completed. |
- | % Committed Bytes in Use | + | * Analyze and respond to errors and warnings during the backup operation. |
+ | * Verify that the transaction logs were successfully purged (if your backup type is purging logs). | ||
- | Event Logs | + | === Performance === |
- | Filter application and system logs on the Exchange server to see all errors. | + | * % Processor Time. |
- | Filter application and system logs on the Exchange server to see all warnings. | + | * Available MBs. |
- | Note repetitive warning and error logs. | + | * % Committed Bytes in Use. |
- | Respond to discovered failures and problems. | + | |
- | Exchange Database | + | === Event Logs === |
- | Check the number of transaction | + | * Filter application and system |
- | Verify that databases are mounted. | + | * Filter application and system logs on the Exchange server |
- | Make sure that public folder replication is up-to-date. | + | * Note repetitive warning and error logs. |
- | If full-text indexing is enabled, verify that indexes are up-to-date. | + | * Respond to discovered failures |
- | Test mailbox, verify the logon of each database | + | |
- | MAPI Client Performance and server availability | + | === Exchange Database === |
- | Examine System Monitor counters. | + | * Check the number of transaction |
- | Examine Event Viewer | + | |
- | Verify that a test account can log on to the Exchange server | + | * Make sure that public folder replication is up-to-date. |
- | Verify your Performance monitor RPC counters against a baseline - RPC average latency/RPC requests/ | + | * If full-text indexing is enabled, verify that indexes are up-to-date. |
+ | * Test mailbox, verify | ||
- | Check Queue viewer | + | === MAPI Client Performance and server |
- | Check queues for each server | + | * Examine System Monitor counters. |
- | Record queue size. | + | * Examine Event Viewer |
+ | * Verify that a test account can log on to the Exchange | ||
+ | * Verify your Performance monitor RPC counters against a baseline - RPC average latency/RPC requests/ | ||
- | Message Paths and Mail flow | + | === Check Queue viewer === |
- | Send messages between internal servers using test accounts. | + | |
- | Check and verify that messages deliver successfully. | + | * Record queue size. |
- | Send outgoing messages to non-local accounts. | + | |
- | Check and verify that outgoing messages deliver successfully. With the test account on the external host, verify that mail comes in. | + | |
- | Verify successful message transfer across connectors and routes. | + | |
- | Security Logs | + | === Message Paths and Mail flow === |
- | Mail Essential and Mail Security for exchange | + | * Send messages between internal servers using test accounts. |
- | View the security event log on Event Viewer and match security changes to known, authorized configuration changes. | + | * Check and verify that messages deliver successfully. |
- | Investigate unauthorized security changes discovered in security event log. | + | * Send outgoing messages to non-local accounts. |
- | Check security news for latest virus, worm, and vulnerabilities. | + | * Check and verify that outgoing messages deliver successfully. With the test account on the external host, verify that mail comes in. |
- | Update and fix discovered security problems and vulnerabilities. | + | * Verify successful message transfer across connectors and routes. |
- | Verify that SMTP does not relay anonymously, | + | |
- | Verify that SSL is functioning for configured secure channels. | + | === Security Logs === |
- | Update virus signatures daily. | + | * //Mail Essential// and //Mail Security for Exchange// |
- | Note: All the backups sync to the local hard drive. | + | |
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | |||
+ | Note: All the backups sync to the local hard drive. | ||
== CRM, OnContact Server == | == CRM, OnContact Server == | ||
- | Verify that SQL Services are running (SQL Agent) | + | * Verify that SQL Services are running (SQL Agent). |
- | Verify that SQL Agent jobs succeeded | + | |
- | Verify that spindles have free space | + | |
- | Verify that data and log files for each database have free space | + | |
- | Check Backups | + | === Check Backups |
- | Make sure that the recommended minimum backup strategy of a daily online backup is completed. | + | |
- | Verify that the previous backup operation completed. | + | |
- | Verify that full backups succeeded | + | |
- | Verify that transactional log Backups succeeded | + | |
- | Analyze and respond to errors and warnings during the backup operation. | + | |
- | Verify that the transaction logs were successfully purged (if your backup type is purging logs). | + | |
- | Performance | + | === Performance |
- | % Processor Time | + | |
- | Available MBs | + | |
- | % Committed Bytes in Use | + | |
- | Event Logs | + | === Event Logs === |
- | Filter application and system logs on the SQL to see all errors. | + | |
- | Filter application and system logs on the SQL server to see all warnings. | + | |
- | Note repetitive warning and error logs. | + | |
- | Respond to discovered failures and problems. | + | |
- | Note: All the backups sync to the local hard drive. | + | |
+ | Note: All the backups sync to the local hard drive. | ||
== Infusion server == | == Infusion server == | ||
- | Verify that SQL Services are running (ie. SQL Agent) | + | * Verify that SQL Services are running (ie. SQL Agent). |
- | Verify that SQL Agent jobs succeeded | + | |
- | Verify that spindles have free space | + | |
- | Verify that data and log files for each database have free space | + | |
- | Check Backups | + | === Check Backups |
- | Make sure that the recommended minimum backup strategy of a daily online backup is completed. | + | |
- | Verify that the previous backup operation completed. | + | |
- | Verify that full backups succeeded | + | |
- | Verify that transactional log Backups succeeded | + | |
- | Analyze and respond to errors and warnings during the backup operation. | + | |
- | Verify that the transaction logs were successfully purged (if your backup type is purging logs). | + | |
- | Performance | + | === Performance |
- | % Processor Time | + | |
- | Available MBs | + | |
- | % Committed Bytes in Use | + | |
- | Event Logs | + | === Event Logs === |
- | Filter application and system logs on the SQL to see all errors. | + | |
- | Filter application and system logs on the SQL server to see all warnings. | + | |
- | Note repetitive warning and error logs. | + | |
- | Respond to discovered failures and problems. | + | |
- | Note: All the backups sync to the local hard drive. | + | |
+ | Note: All the backups sync to the local hard drive. | ||
== Time Clock Server == | == Time Clock Server == | ||
- | Clock communication – general items | + | * Clock communication – general items. |
- | Clock communication – error messages | + | |
- | Clock communication – error situational problems | + | |
- | Make sure that the recommended minimum backup strategy of a daily online backup is completed. | + | |
- | Verify that the previous backup operation completed. | + | |
- | Verify that full backups succeeded | + | |
- | Verify that transactional log Backups | + | |
- | Analyze and respond to errors and warnings during the backup operation. | + | |
- | Verify that the transaction logs were successfully purged (if your backup type is purging logs). | + | |
Note: All the backups sync to the local hard drive. | Note: All the backups sync to the local hard drive. | ||
== File Server == | == File Server == | ||
- | Check application and system logs on the server to see all errors. | + | * Check application and system logs on the server to see all errors. |
- | Check application and system logs on the Exchange server to see all warnings. | + | |
- | Note repetitive warning and error logs. | + | |
- | Respond to discovered failures and problems. | + | |
- | Use daily data from event log and System Monitor | + | |
- | Check on disk usage. | + | |
- | Check on memory and CPU usage. | + | |
- | Check uptime and availability. | + | |
- | List the top generated, resolved, and pending incidents. | + | |
- | Create solutions for unresolved incidents. | + | |
- | Check anti-virus definition updates timely. | + | |
- | Check server and network status for the overall organization and segments. | + | |
- | Check organizational performance and availability. | + | |
- | Check risk analysis and evaluation including upcoming changes. | + | |
- | Check capacity, availability, | + | |
- | Review items that have not met target objectives. | + | |
- | Note: Backup on this server is sync to the NAS | + | |
+ | Note: Backup on this server is sync to the NAS. | ||
== Spark Server == | == Spark Server == | ||
- | Check disk space availability | + | * Check disk space availability. |
- | Check status of backups | + | |
- | Check that the pmon process is running | + | |
- | No changes to /etc/passwd /etc/shadow /etc/hosts / | + | |
- | Check the latest entries in the logs | + | |
Note: Manual backup users/ | Note: Manual backup users/ | ||
- | == SWdev Server (Software Development) == | + | == Software Development |
- | Check disk space availability | + | |
- | Check status of backups | + | |
- | Check that the pmon process is running | + | |
- | No changes to /etc/passwd /etc/shadow /etc/hosts / | + | |
- | Check the latest entries in the logs | + | |
== Web Server (www) == | == Web Server (www) == | ||
- | Check disk space availability | + | * Check disk space availability. |
- | Check status of backups | + | |
- | Check that the pmon process is running | + | * Backup folder is ''/ |
- | No changes to /etc/passwd /etc/shadow /etc/hosts / | + | * Backup script is ''/ |
- | Check the latest entries in the logs | + | * Backup to mirror drive is ''/ |
- | Note: Backup sync/mirror to the internal drive and NAS | + | * Backup script to mirror drive is ''/ |
+ | * Backup of mirrored '' | ||
+ | rsync --daemon --config=/ | ||
+ | root@www:~# cat rsync-swdev.sh | ||
+ | # | ||
+ | |||
+ | #rsync --verbose | ||
+ | # --recursive --times --perms --links --delete \ | ||
+ | # --exclude " | ||
+ | # 192.168.0.160: | ||
+ | |||
+ | # Website | ||
+ | rsync --archive --verbose --progress --stats --rsh=/ | ||
+ | --recursive --times --perms --links --delete --exclude=stats \ | ||
+ | 192.168.0.160:: | ||
+ | |||
+ | # Databases | ||
+ | rsync --archive --verbose --progress --stats --rsh=/ | ||
+ | --recursive --times --perms --links --delete \ | ||
+ | 192.168.0.160:: | ||
+ | |||
+ | # Root user home | ||
+ | rsync --archive --verbose --progress --stats --rsh=/ | ||
+ | --recursive --times --perms --links --delete \ | ||
+ | 192.168.0.160:: | ||
+ | |||
+ | # Subserver Repositories | ||
+ | rsync --archive --verbose --progress --stats --rsh=/ | ||
+ | --recursive --times --perms --links --delete \ | ||
+ | 192.168.0.160:: | ||
+ | </ | ||
+ | * Check that the '' | ||
+ | | ||
+ | | ||
+ | |||
+ | Note: Backup sync/mirror to the internal drive and NAS. | ||
+ | |||
+ | == System36 Client Emulator Server (Bosânova) == | ||
+ | * User manual and installation procedures: '' | ||
+ | * Check for emulator server services are running. | ||
+ | * Check for users’ connectivity. | ||
- | == Emulator Server (BoSanova) | + | == Router/ |
- | User manual | + | * Check system monitor, CPU usage, uptime, disk usage, system load, and performance. |
- | Check for emulator server services | + | * Check web security, black list, custom sites, and policies. |
- | Check for users’ | + | |
+ | * Assign and adjust network configuration settings related to the IP addresses were given are met. | ||
+ | | ||
- | == Router/ | + | == Suggestions |
- | Check system monitor, CPU usage, uptime, disk usage, system load, and performance. | + | * Need to re-design a new network infrastructure for better productivity, connectivity, eliminate downtime, and point of failures. |
- | Check web security, black list, custom sites, and policies | + | * All production servers need to be replaced at least once every five years. |
- | Check and monitor remote user/VPN settings | + | * Need to replace all the home built servers: '' |
- | Assign | + | * Need to rebuild |
- | Check for system logs, error messages, and system diagnostics | + | * Need to rebuild |
+ | * Need a new gateway router that can monitor Audina bandwidth, productivity, and threats from the outside world. | ||
+ | * Need new network switches. | ||
+ | * Need to re-wire | ||
+ | * Need to install a patch panel. | ||
+ | * Eliminate all the small network switches, this will cause the slowness and bottleneck of the network. | ||
+ | * Need to replace all QC computers except Sherry’s computer. | ||
+ | * Need to have a better Internet bandwidth for better productivity. | ||
- | == Suggestion == | + | NOTE: These suggestions |
- | Need to re-design a new network infrastructure for better productivity, | + | |
- | All production servers need to be replaced at least once every five years | + | |
- | Need to replace all the home build servers: Infusion, Oncontact, and Timeclock. These servers do not have hardware redundant functionality to handle production environment. | + | |
- | Need to rebuild and replace fileserver because of hardware failure and running out of space | + | |
- | Need to rebuild and upgrade exchange server to exchange 2010 with backup and restore software licenses. | + | |
- | Need a new gateway router that can monitor Audina bandwidth, productivity, | + | |
- | Need new network switches | + | |
- | Need to re-wire the whole network infrastructure | + | |
- | Need to install a patch panel | + | |
- | Eliminate all the small network switches, this will cause the slowness and bottleneck of the network | + | |
- | Need to replace all QC computers except Sherry’s computer | + | |
- | Need to have a better Internet bandwidth for better productivity | + | |
- | NOTE: These suggestions | + |