Why not use SQL instead of GraphQL? Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar ManaraWhy use services (REST/SOAP) instead of a library?Why to use web services instead of direct access to a relational database for an android app?Why creating a new MDX language instead of extending SQL?Why use JSP instead of servlets?How to efficiently query movies with characters via The Movie DB API?Is using REST consumed by a Javascript framework on the front-end the same as using microservices?GraphQL - Should I expose link tables?Is there a valid alternative to HTTP headers for sending auth credentialsHow to store blog post in database - React & GraphQLIs GraphQL REST compliant (almost)?
How to keep bees out of canned beverages?
"Whatever a Russian does, they end up making the Kalashnikov gun"? Are there any similar proverbs in English?
Is Electric Central Heating worth it if using Solar Panels?
Map material from china not allowed to leave the country
What *exactly* is electrical current, voltage, and resistance?
How to find if a column is referenced in a computed column?
Is there any pythonic way to find average of specific tuple elements in array?
How would this chord from "Rocket Man" be analyzed?
Has a Nobel Peace laureate ever been accused of war crimes?
Check if a string is entirely made of the same substring
"Rubric" as meaning "signature" or "personal mark" -- is this accepted usage?
How much of a wave function must reside inside event horizon for it to be consumed by the black hole?
First instead of 1 when referencing
When do you need buffers/drivers on buses in a microprocessor design?
Contradiction proof for inequality of P and NP?
std::unique_ptr of base class holding reference of derived class does not show warning in gcc compiler while naked pointer shows it. Why?
A faster way to compute the largest prime factor
Israeli soda type drink
Can you stand up from being prone using Skirmisher outside of your turn?
Why didn't the Space Shuttle bounce back into space as many times as possible so as to lose a lot of kinetic energy up there?
Can I criticise the more senior developers around me for not writing clean code?
My admission is revoked after accepting the admission offer
tikz-feynman: edge labels
My bank got bought out, am I now going to have to start filing tax returns in a different state?
Why not use SQL instead of GraphQL?
Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar ManaraWhy use services (REST/SOAP) instead of a library?Why to use web services instead of direct access to a relational database for an android app?Why creating a new MDX language instead of extending SQL?Why use JSP instead of servlets?How to efficiently query movies with characters via The Movie DB API?Is using REST consumed by a Javascript framework on the front-end the same as using microservices?GraphQL - Should I expose link tables?Is there a valid alternative to HTTP headers for sending auth credentialsHow to store blog post in database - React & GraphQLIs GraphQL REST compliant (almost)?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
Recently I learned about GraphQL which claims to be superior to RESTful. However, I started wondering why don't we simply put SQL statements into an HTTP GET request.
For example, in GraphQL I would write
Movie(id: "cixos5gtq0ogi0126tvekxo27")
id
title
actors
name
Which isn't much simpler than its SQL counterpart
SELECT id, title FROM movies WHERE id = cixos5gtq0ogi0126tvekxo27;
SELECT actors.name FROM actors, actors_movies WHERE actors.id == movies.actor_id AND movie.id == cixos5gtq0ogi0126tvekxo27;
Maybe we can URL-encode the query and send to the server
GET endpoint?q=SELECT%20id%2C%20title%20FROM%20movies%20WHERE%20id%20%3D%20cixos5gtq0ogi0126tvekxo27%3B%0ASELECT%20actors.name%20FROM%20actors%2C%20actors_movies%20WHERE%20actors.id%20%3D%3D%20movies.actor_id%20AND%20movie.id%20%3D%3D%20cixos5gtq0ogi0126tvekxo27%3B HTTP/1.1
Yes, the query URL can be too long, but you can put it into the body of a POST request if you don't care about REST compliance. (By the way, I think the HTTP RFC need be revised for REST to make sense: capping the length of query strings mixes implementation with specification at the very beginning)
Directly issuing SQL from the client also has the advantage of
- No server-side code/library is required to parse GraphQL, reducing development time.
- No server-side overhead is needed to parse GraphQL, reducing runtime.
- SQL statements are much more flexible than GraphQL because (in most cases) the latter will reduce to SQL anyway.
- Everyone knows SQL.
So, what the advantages GraphQL have over SQL?
architecture database web-development api web-services
|
show 17 more comments
Recently I learned about GraphQL which claims to be superior to RESTful. However, I started wondering why don't we simply put SQL statements into an HTTP GET request.
For example, in GraphQL I would write
Movie(id: "cixos5gtq0ogi0126tvekxo27")
id
title
actors
name
Which isn't much simpler than its SQL counterpart
SELECT id, title FROM movies WHERE id = cixos5gtq0ogi0126tvekxo27;
SELECT actors.name FROM actors, actors_movies WHERE actors.id == movies.actor_id AND movie.id == cixos5gtq0ogi0126tvekxo27;
Maybe we can URL-encode the query and send to the server
GET endpoint?q=SELECT%20id%2C%20title%20FROM%20movies%20WHERE%20id%20%3D%20cixos5gtq0ogi0126tvekxo27%3B%0ASELECT%20actors.name%20FROM%20actors%2C%20actors_movies%20WHERE%20actors.id%20%3D%3D%20movies.actor_id%20AND%20movie.id%20%3D%3D%20cixos5gtq0ogi0126tvekxo27%3B HTTP/1.1
Yes, the query URL can be too long, but you can put it into the body of a POST request if you don't care about REST compliance. (By the way, I think the HTTP RFC need be revised for REST to make sense: capping the length of query strings mixes implementation with specification at the very beginning)
Directly issuing SQL from the client also has the advantage of
- No server-side code/library is required to parse GraphQL, reducing development time.
- No server-side overhead is needed to parse GraphQL, reducing runtime.
- SQL statements are much more flexible than GraphQL because (in most cases) the latter will reduce to SQL anyway.
- Everyone knows SQL.
So, what the advantages GraphQL have over SQL?
architecture database web-development api web-services
38
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
1
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
7
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
3
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24
|
show 17 more comments
Recently I learned about GraphQL which claims to be superior to RESTful. However, I started wondering why don't we simply put SQL statements into an HTTP GET request.
For example, in GraphQL I would write
Movie(id: "cixos5gtq0ogi0126tvekxo27")
id
title
actors
name
Which isn't much simpler than its SQL counterpart
SELECT id, title FROM movies WHERE id = cixos5gtq0ogi0126tvekxo27;
SELECT actors.name FROM actors, actors_movies WHERE actors.id == movies.actor_id AND movie.id == cixos5gtq0ogi0126tvekxo27;
Maybe we can URL-encode the query and send to the server
GET endpoint?q=SELECT%20id%2C%20title%20FROM%20movies%20WHERE%20id%20%3D%20cixos5gtq0ogi0126tvekxo27%3B%0ASELECT%20actors.name%20FROM%20actors%2C%20actors_movies%20WHERE%20actors.id%20%3D%3D%20movies.actor_id%20AND%20movie.id%20%3D%3D%20cixos5gtq0ogi0126tvekxo27%3B HTTP/1.1
Yes, the query URL can be too long, but you can put it into the body of a POST request if you don't care about REST compliance. (By the way, I think the HTTP RFC need be revised for REST to make sense: capping the length of query strings mixes implementation with specification at the very beginning)
Directly issuing SQL from the client also has the advantage of
- No server-side code/library is required to parse GraphQL, reducing development time.
- No server-side overhead is needed to parse GraphQL, reducing runtime.
- SQL statements are much more flexible than GraphQL because (in most cases) the latter will reduce to SQL anyway.
- Everyone knows SQL.
So, what the advantages GraphQL have over SQL?
architecture database web-development api web-services
Recently I learned about GraphQL which claims to be superior to RESTful. However, I started wondering why don't we simply put SQL statements into an HTTP GET request.
For example, in GraphQL I would write
Movie(id: "cixos5gtq0ogi0126tvekxo27")
id
title
actors
name
Which isn't much simpler than its SQL counterpart
SELECT id, title FROM movies WHERE id = cixos5gtq0ogi0126tvekxo27;
SELECT actors.name FROM actors, actors_movies WHERE actors.id == movies.actor_id AND movie.id == cixos5gtq0ogi0126tvekxo27;
Maybe we can URL-encode the query and send to the server
GET endpoint?q=SELECT%20id%2C%20title%20FROM%20movies%20WHERE%20id%20%3D%20cixos5gtq0ogi0126tvekxo27%3B%0ASELECT%20actors.name%20FROM%20actors%2C%20actors_movies%20WHERE%20actors.id%20%3D%3D%20movies.actor_id%20AND%20movie.id%20%3D%3D%20cixos5gtq0ogi0126tvekxo27%3B HTTP/1.1
Yes, the query URL can be too long, but you can put it into the body of a POST request if you don't care about REST compliance. (By the way, I think the HTTP RFC need be revised for REST to make sense: capping the length of query strings mixes implementation with specification at the very beginning)
Directly issuing SQL from the client also has the advantage of
- No server-side code/library is required to parse GraphQL, reducing development time.
- No server-side overhead is needed to parse GraphQL, reducing runtime.
- SQL statements are much more flexible than GraphQL because (in most cases) the latter will reduce to SQL anyway.
- Everyone knows SQL.
So, what the advantages GraphQL have over SQL?
architecture database web-development api web-services
architecture database web-development api web-services
edited Apr 7 at 5:45
whatsisname
25.1k136788
25.1k136788
asked Apr 6 at 7:04
nalzoknalzok
1839
1839
38
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
1
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
7
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
3
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24
|
show 17 more comments
38
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
1
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
7
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
3
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24
38
38
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
1
1
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1
1
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
7
7
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
3
3
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24
|
show 17 more comments
4 Answers
4
active
oldest
votes
Basically, abstraction.
SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.
Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.
And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).
For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.
Why can't I just compare the URL (or SQL query) against keys in Redis
before performing the actual query on the RDBMS?
Because it's not easy. Even if someone uses a very simple query, such as:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:
SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.
And even if you do fix that, another query could switch the order of conditions and the query would look like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'
and another request could contain a redundant WHERE
argument, like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'
All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
add a comment |
In theory there is no reason you can't expose an SQL interface like this.
In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.
Even if you allow only read access, a bad query can still hog resources.
Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.
The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
add a comment |
As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.
If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).
add a comment |
if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.
GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.
in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world
add a comment |
protected by gnat Apr 7 at 8:49
Thank you for your interest in this question.
Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).
Would you like to answer one of these unanswered questions instead?
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
Basically, abstraction.
SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.
Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.
And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).
For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.
Why can't I just compare the URL (or SQL query) against keys in Redis
before performing the actual query on the RDBMS?
Because it's not easy. Even if someone uses a very simple query, such as:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:
SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.
And even if you do fix that, another query could switch the order of conditions and the query would look like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'
and another request could contain a redundant WHERE
argument, like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'
All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
add a comment |
Basically, abstraction.
SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.
Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.
And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).
For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.
Why can't I just compare the URL (or SQL query) against keys in Redis
before performing the actual query on the RDBMS?
Because it's not easy. Even if someone uses a very simple query, such as:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:
SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.
And even if you do fix that, another query could switch the order of conditions and the query would look like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'
and another request could contain a redundant WHERE
argument, like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'
All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
add a comment |
Basically, abstraction.
SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.
Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.
And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).
For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.
Why can't I just compare the URL (or SQL query) against keys in Redis
before performing the actual query on the RDBMS?
Because it's not easy. Even if someone uses a very simple query, such as:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:
SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.
And even if you do fix that, another query could switch the order of conditions and the query would look like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'
and another request could contain a redundant WHERE
argument, like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'
All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.
Basically, abstraction.
SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.
Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.
And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).
For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.
Why can't I just compare the URL (or SQL query) against keys in Redis
before performing the actual query on the RDBMS?
Because it's not easy. Even if someone uses a very simple query, such as:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:
SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'
and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.
And even if you do fix that, another query could switch the order of conditions and the query would look like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'
and another request could contain a redundant WHERE
argument, like this:
SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'
All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.
edited Apr 6 at 19:16
answered Apr 6 at 12:06
AndyAndy
7,72611638
7,72611638
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
add a comment |
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
This is a nice answer, but please see the update.
– nalzok
Apr 7 at 4:16
add a comment |
In theory there is no reason you can't expose an SQL interface like this.
In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.
Even if you allow only read access, a bad query can still hog resources.
Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.
The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
add a comment |
In theory there is no reason you can't expose an SQL interface like this.
In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.
Even if you allow only read access, a bad query can still hog resources.
Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.
The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
add a comment |
In theory there is no reason you can't expose an SQL interface like this.
In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.
Even if you allow only read access, a bad query can still hog resources.
Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.
The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.
In theory there is no reason you can't expose an SQL interface like this.
In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.
Even if you allow only read access, a bad query can still hog resources.
Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.
The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.
edited Apr 6 at 17:34
answered Apr 6 at 13:54
EwanEwan
44.5k337100
44.5k337100
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
add a comment |
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
1
1
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
– nalzok
Apr 7 at 3:14
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
But i can write a recursive SQL query that will lock your table and prevent other users from running any queries at all
– Ewan
Apr 7 at 11:30
4
4
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
– Ewan
Apr 7 at 11:38
add a comment |
As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.
If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).
add a comment |
As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.
If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).
add a comment |
As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.
If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).
As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.
If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).
answered Apr 6 at 17:47
jannikbjannikb
613
613
add a comment |
add a comment |
if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.
GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.
in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world
add a comment |
if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.
GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.
in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world
add a comment |
if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.
GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.
in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world
if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.
GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.
in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world
answered Apr 6 at 17:29
Black HoleBlack Hole
1063
1063
add a comment |
add a comment |
protected by gnat Apr 7 at 8:49
Thank you for your interest in this question.
Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).
Would you like to answer one of these unanswered questions instead?
38
Little Bobby Tables.
– Philip Kendall
Apr 6 at 7:05
1
1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key...
– Philip Kendall
Apr 6 at 7:16
1
@PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
– nalzok
Apr 6 at 7:18
7
@nalzok: SQL is Turing-complete, which means it is impossible to validate statically.
– Jörg W Mittag
Apr 6 at 7:45
3
This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
– Laiv
Apr 6 at 12:24