Dear Community,
I am working on a research project related to session understanding by LLMs, not just a single query or response.
We found that SDSS SkyServer provides SQL logs in addition to the following information:
| Name | Type | Description |
|---|---|---|
| theTime | datetime | the timestamp |
| webserver | varchar(64) | the url |
| winname | varchar(64) | the windows name of the server |
| clientIP | varchar(16) | client IP address |
| seq | int | sequence number to guarantee uniqueness of PK |
| server | varchar(32) | the name of the database server |
| dbname | varchar(32) | the name of the database |
| access | varchar(32) | The website DR1, collab,… |
| sql | varchar(7800) | the SQL statement |
| elapsed | real | the lapse time of the query |
| busy | real | the total CPU time of the query |
| [rows] | bigint | the number of rows generated |
| error | int | 0 if ok, otherwise the sql error #; negative numbers are generated by the procedure |
| errorMessage | varchar(2000) | the error message. |
But we are still looking for other log-based or session-based datasets for tasks such as user behavioral modeling, user intent classification, and so on.
I would be grateful if you could share any datasets for such tasks.