Subscribe to Personal AI $26.40/year

Because of rapid changes in technology some of these information maybe not valid anymore. But its a good snapshot in time how much knowledge was stored in my brain memory that I can always update. This is also my test that A.I + chatbot can help me learn once (meaning stored in invbat.com cloud) and o.k to forget because I can always go back If I needed my collected knowledge.
| 1. Amazon Web Services (AWS) is offering free usage tier of MySQL database | ||||||||
| 2. RDS ? means Relational Database Services | ||||||||
| 3. VPC ? means Virtual Private Cloud for Sales, VPC for Marketing, VPC for Operation, VPC for Engineering | ||||||||
| 4. AZ ? means Availability Zone | ||||||||
| 5. Public end user talk to Web Server not directly to the database . This is another security implementation for securing database. Then web server talk to database like Aurora database, Redshift database, DynamoDB and Storage like S3. | ||||||||
| 6. Security services or service group in AWS are like firewall to protect database | ||||||||
| 7. MySQL default port number is 3306. It is best practice to leave it as a default. The reason is if you accidentally change the port to another number some application that are globally configured for port 3306 for MySQL will be broken meaning they will return error message. | ||||||||
| 8. AWS RDS by default do automatic database backup and the default day is 7 days. It means if your database was deleted today or got corrupted and not working anymore due to malware you don't need to worry because you know you can restore your database using old good database. Do not set the database backup to 0 (zero day) because it means disable the automatic backup. | ||||||||
| 9. CLI ? means command line interface | ||||||||
| 10. AWS EC2 instance do not manage the database. To modify and manage your EC2 database go to RDS to manage your database. | ||||||||
| 11. Automatic failover ? means when the master database is not in service due maintenance automatic failover means the database query will failover or go to the replica database and your service availability is not affected. Replica database is synchronize to master database. Replica database is read only. | ||||||||
| 12. ARN ? means Amazon Resource Number | ||||||||
| 13. Use the DNS connection and not the IP address because IP address do not do automatic failover to replica database. | ||||||||
| 14. If your Database mainly used in online analytical processing (OLAP) and get updated every 30 days for example , it is possibly o.k. to set the back up setting in zero (0) day. Why ? zero day setting means no automatic back up. It is still o.k because the source data comes from other data source therefore there is no need to create a back up. Another advantage of zero (0) day backup setting is it increases the performance of your online analytical processing (OLAP). | ||||||||
| 15. Restore to point in time, it means if you know the point in time database was deleted or corrupted, you can restore the database from that point in time. | ||||||||
| 16. myDatabase_ss ? the suffix ss means SnapShot. Snapshot database is like a manual backup. It takes time if the database is big. | ||||||||
| 17. The benefit of learning how to create a snapshot of database as your backup database is you can share your snapshot database to different availability zone or different region for localization | ||||||||
| 18. When your database is taking more time to process or return answer to query it could be many users are connected to your database. To speed up the return of your search query you watch your database default basic performance monitoring dashboard and check how many user are connected to your database (look for DB connection dashboard). If many connected user are only read access user, then to solve the speed performance problem, you can add replica database to offload some processing from the master database. | ||||||||
| 19. Flat file database ? it means similar to comma separated value (csv) files from Excel or other similar file that contains record of information | ||||||||
| 20. Relational database, think of it as a lot of tables of datasets that are link together by primary key id and foreign key id. Primary key id for example is itemID under product table. It is a unique ID to retrieve any data from the Product table database. Foreign key id is a unique id that associates, is related to, linked to other table for example OrderID is a foreign key use to link the Order table database of record. Another example of foreign key, CustomerID is a foreign key of order table to link the Customer table database. | ||||||||
21. Product Table
|
||||||||
22. Order Table
|
||||||||
23. Customer table
|
||||||||
| 24. Terminology use in table database or tabular database literature . (tabular database compare to columnar database ?) | ||||||||
| 25. Rows - row of records, imagine the Excel row. It contains the data or record. The fancy term for rows is tuples | ||||||||
| 26. Columns - the name of the record or data, It is also called field name , attributes name, properties name. See above example | ||||||||
| 27. Tables - are also called relations that is why it is called relational database | ||||||||
| 28. Tables - are also called entities or objects | ||||||||
| 29. View - are the result of joining table to answer business question. We use SQL to join table so that it can answer specific business question. For example show me the list of top ten product with monthly sales over $10,000. View table is a curated table, pre-built joined table, data pipeline leading to a dashboard to answer specific important business question. | ||||||||
| 30. Aggregation - another term for joining tables is data aggregation. This is the hard part because database primary key id and foreign key id are not always available in siloed legacy databases. In order to aggregate or join the table from several siloed legacy database you need to extract them and place in common platform so you can join them in order to answer new ad-hoc business question. | ||||||||
| 31. Normalization - this is a hard to explain terminology in relational database. But the concept is simple. You do normalization when you evaluate column field names, attributes or properties of a Table database. For example take a Customer Table : When you evaluate the field name, or attributes or properties, you will ask the question does it make sense to include First_Name, Last_Name, Address, Telephone Number, SSN, E-mail and so on. | ||||||||
| 32. 1 NF? means First normalization format review of column field names, properties or attributes | ||||||||
| 33. 2 NF? means second normalization format review of column field names, properties or attributes | ||||||||
| 34. What is the use of this knowledge of 4 NF ? It is important to know because it affects the read time from database when you are doing query from the table database. Usually at 4 NF the read time takes longer because your are gathering or joining more column field names. When you are writing or appending your table 4NF is much faster because you are not writing redundant data. | ||||||||
| 35. AWS EC2 Instance database - this database solution takes more steps . step 1. You launch an instance of database step 2. Install the database service step 3. Open appropriate ports in security group step 4. connect to the database | ||||||||
| 36. AWS hosted service database much easy solution. step 1. You launch the database step 2. Connect to the database | ||||||||
| 37. Why bother learning EC2 instance database ? answer: Because EC2 instance is user defined, it can be used to install you old version of your database server in order for your application to keep serving your customer while you are migrating your old version of database server to newer version | ||||||||
| 38. Explain why it is better to use managed database service? Answer: Because you don't need to worry about database software version update and security patches update. They are done automatically by AWS hosted service | ||||||||
| 39. High availability of database - Solution 1 is clustering of database server. You are provisioning multiple database server. In AWS terminology Multiple server means multiple instances | ||||||||
| 40. Think of clustering as grapes fruit. Grapes fruit has many individual grape but connected on one stem. Individual grape represent your individual database instance connected to one network or stem. | ||||||||
| 41. When you read or hear the word clustering of database server, you must remember four things 1. multiple database server (multiple database instances) 2. One master database but have multiple replication 3. It increases database service availability 4. It do automatic failover to replica database. | ||||||||
| 42. Scalability - increase storage instances, processing memory and network capacity. | ||||||||
| 43. a read replica can be used to offload read only operation coming from user that has read only access privilege | ||||||||
| 44. RDS database support "at rest " encryption . "At rest" encryption must be implemented at creation time of database or at restoration time. | ||||||||
| 45. CRUD in database ? mean create , read, update, and delete task | ||||||||
| 46. Aurora database is built by AWS. It is a relational database. It is optimize for online transaction processing (OLTP) . Compare OLTP to online analytical processing (OLAP). OLTP database like Aurora are usually the critical database that needs to have a backup. While OLAP database are usually data extract use for analytical purposes to answer specific business question like compliance reporting | ||||||||
| 47. Aurora database default is 10 GB and it increment by 10 GB automatically up to 64 TB. Aurora can have 15 replica with automatic failover and can support 5 MySQL with no automatic failover. | ||||||||
| 48. Redshift database is another database built by AWS. It is used for data warehouse application. OLAP databases are usually stored in data warehouse databases like Redshift database. | ||||||||
| 49. Redshift single node can handle up to 160 TB of data. If your data requires more than 160 TB , you need to provision multiple node. | ||||||||
| 50. Multiple node has a leader node . The leader node handle the connection and queries. While the multiple node do data storage, execute queries and do calculation. Redshift database is very fast. Explain why Redshift is a very fast database? Redshift is a columnar database. Columnar database is like Excel spreadsheet. Because it is not joining tables (for tabular database) , it just reading, it read very fast which is very important in any OLAP. | ||||||||
| 51. Redshift also capable of doing data compression , another explanation for its performance speed. It allows massively parallel processing (MPP). | ||||||||
| 52. Redshift support SSL transit encryption. The encryption key is managed through AWS key management. | ||||||||
| 53. Redshift operates in one availability zone. Then you can take a snapshot backup and there restored the backup snapshot in different AZ. | ||||||||
| 54. DynamoDB is another database built by AWS. It is a NoSQL database service meaning it is not a relational database. One of its advantage is millisecond latency at any scale. At any scale means whether I have 100,000 user or 5 million user, the DynamoDB will response to serve the query result in millisecond delay. This could be INVBAT.COM - A.I. + Chatbot future database platform to serve growing user. | ||||||||
| 55. DynamoDB uses solid state drive (SSD) storage and the database are spread across 3 distinct data centers. | ||||||||
| 56. DynamoDB uses two read consistency type. number 1 is eventual consistent read. This read could have a delay of few seconds. The second one is strongly consistent read. This type of read has a few millisecond delays. | ||||||||
| 57. Amazon online store uses DynamoDB. |
Why do you need a personal augmented intelligence (AI) chatbot? Because it is useful in data storage, information retrieval, and fast computation with less error.
IN-V-BAT-AI uses explainable Artificial Intelligence (AI) to automate repetitive solved problem or routine calculation so we can focus our brain power to solve harder new problem then automate again once it is solved.

INVBAT.COM - A.I. is a disruptive innovation in computing and web search technology. For example scientific calculator help us speed up calculation
but we still need to remember accurately the formula and the correct sequence of data entry. Here comes the disruptive innovation from INVBAT.COM-A.I. ,
today the problem of remembering formula and the correct sequence of data entry is now solved by combining formula and calculation and make it on demand
using smartphone, tablet, notebook, Chromebook, laptop, desktop, school smartboard and company big screen tv in conference room with internet connection.