Skip to main content

Needs for Graph Database

Needs for Graph Database:

We are living in the era of data, data is treated more precise than gold and platinum. Most of the enterprises are trying to get more insight about the data they have it as an operational / warehouse / analytical. 

graph example

 Ref.: https://dist.neo4j.com/wp-content/uploads/graph-example.png

To get more insight into the data, it is required to see the relationship among the data points. The challenge is how to establish the relationship among data points and the answers is Graph database. Relational databases can't help to establish the relationship among data points, due to their rigid schema, and consistent schema.

Relational Database issues for data set:

Number of Joins: 

While fetching data from relational databases, we join many tables, these joins are complex, and consume considerable amount of computing resources, which increase the query response times.

Self- joins:

For database ware house / business intelligence systems using RDBMS, self-JOIN are common for hierarchy and tree representation of data such as employee, and manager. When we traverse relationship by joining themselves, it results in an inefficient approach to retrieve the data.

Schema Changes:

Relational databases are not designed for frequent schema changes, and pivots. We are living in the era of agility, which requires frequent schema changes and flexibility.

Slow Queries:

Even though expert DBA put all efforts, use all tricks such as materialized view (computing past results ahead of time) , de normalize the entities to speed up the query, still queries are not fast enough to server the current business needs.

Graph database has the ability to address the issues, challenges of the RDBMS, let us explore 

Benefits of Graph database:

Agility:

In current agile software development process / method, test-driven development is an essential part. Modern Graph databases have features to server friction-less development, and graceful system maintenance.

Flexibility:

As we all know the speed does matter in current throat cut competition for the business, IT and data architect has to move at the speed of business. The structure and schema of graph data model is flexible and run with the needs of business. The IT team can add to the features require to the existing graph structure without endangering existing functionality.

Performance:

Graph database can deliver consistence performance even though the data grows every year. Graph database can handle very efficiently the data relationship. Graph database can deliver performance by several magnitude compared to RDBMS.

Ref.: https://neo4j.com/blog/why-graph-data-relationships-matter/

Comments

Popular posts from this blog

MySQL InnoDB cluster troubleshooting | commands

Cluster Validation: select * from performance_schema.replication_group_members; All members should be online. select instance_name, mysql_server_uuid, addresses from  mysql_innodb_cluster_metadata.instances; All instances should return same value for mysql_server_uuid SELECT @@GTID_EXECUTED; All nodes should return same value Frequently use commands: mysql> SET SQL_LOG_BIN = 0;  mysql> stop group_replication; mysql> set global super_read_only=0; mysql> drop database mysql_innodb_cluster_metadata; mysql> RESET MASTER; mysql> RESET SLAVE ALL; JS > var cluster = dba.getCluster() JS > var cluster = dba.getCluster("<Cluster_name>") JS > var cluster = dba.createCluster('name') JS > cluster.removeInstance('root@<IP_Address>:<Port_No>',{force: true}) JS > cluster.addInstance('root@<IP add>,:<port>') JS > cluster.addInstance('root@ <IP add>,:<port> ') JS > dba.getC...

InnoDB cluster Remove Instance Force | Add InnoDB instance

InnoDB cluster environment UUID is different on node: To fix it stop group replication, remove instance (use force if require), add instance back Identify the node which is not in sync: Execute following SQL statement on each node and identify the node has different UUID on all nodes. mysql> select * from mysql_innodb_cluster_metadata.instances; Stop group replication: Stop group replication on the node which does not have same UUID on all nodes. mysql > stop GROUP_REPLICATION; Remove instances from cluster: Remove all secondary node from the cluster and add them back if require. $mysqlsh JS >\c root@<IP_Address>:<Port_No> JS > dba.getCluster().status() JS > dba.getCluster () <Cluster:cluster_name> JS > var cluster = dba.getCluster("cluster_name"); JS >  cluster.removeInstance('root@<IP_Address>:<Port_No>'); If you get "Cluster.removeInstance: Timeout reached waiting......" JS > cluster.removeInstance(...

MySQL slave Error_code: 1032 | MySQL slave drift | HA_ERR_KEY_NOT_FOUND

MySQL slave Error_code: 1032 | MySQL slave drift: With several MySQL, instance with master slave replication, I have one analytics MySQL, environment which is larger in terabytes, compared to other MySQL instances in the environment. Other MySQL instances with terabytes of data are running fine master, slave replication. But this analytics environment get started generating slave Error_code :1032. mysql> show slave status; Near relay log: Error_code: 1032; Can't find record in '<table_name>', Error_code: 1032; handler error HA_ERR_KEY_NOT_FOUND; the event's master log <name>-bin.000047, end_log_pos 5255306 Near master section: Could not execute Update_rows event on table <db_name>.<table_name>; Can't find record in '<table_name>', Error_code: 1032; Can't find record in '<table_name>', Error_code: 1032; handler error HA_ERR_KEY_NOT_FOUND; the event's master log <name>-bin.000047, end_l...