Skip to content
Snippets Groups Projects
Commit 847eee6a authored by Riccardo Boero's avatar Riccardo Boero :innocent:
Browse files

Initial commit of README

parent 4c78bf8a
No related branches found
No related tags found
No related merge requests found
FACT_jobs_db.png

41.5 KiB

README.md 0 → 100644
# FACT Employment Official Statistics
Official labor statistics about the US and European countries. Data sources are the Bureau of Labor Statistics (BLS) and the Census in the US and EUROSTAT in Europe. They vary in frequency of observation (either annually or quarterly), geographical scope, and industrial details.
---
## Use
1. Connect to the GitLab container registry with privileges for the FACT group:
> docker login -u FACT_token -p glpat-J5_QFJiGwLkfgqJsnU2H registry.git.nilu.no
2. Run this data service:
> docker run -t -i --name fact\_jobs -e MARIADB\_DATABASE=FACT_jobs -e MYSQL\_ROOT\_PASSWORD=devops -p 3308:3306 -d registry.git.nilu.no/fact/data/fact\_jobs:0.1
The container makes available a MariaDB instance with the full database on employment. It is reachable on port 3308 of the localhost and the root password is 'devops'.
---
## Specifications
### Data size
```
+--------+-----------+----------+
| Table | Size (MB) | Rows (#) |
+--------+-----------+----------+
| LFS | 2.35 | 43278 |
| LODES8 | 2323.16 | 41592806 |
| QCEW | 6916.19 | 59684807 |
| REA | 22.52 | 418223 |
| SBS | 1.54 | 25938 |
+--------+-----------+----------+
```
### Data structure
![Database tables](FACT_jobs_db.png "Title")
### Data fields
#### LODES8 - LEHD Origin-Destination Employment Statistics, the U.S. Census
Data refers to **jobs**.
| Column | Description |
|---|---|
|GeoID| FIPS block id, 15 chars: STATE+COUNTY+TRACT+BLOCK |
|Year| 2002-2020|
|CNS01 | Number of jobs in NAICS sector 11 (Agriculture, Forestry, Fishing and Hunting)|
| CNS02 | Number of jobs in NAICS sector 21 (Mining, Quarrying, and Oil and Gas Extraction)|
| CNS03 | Number of jobs in NAICS sector 22 (Utilities)|
| CNS04 | Number of jobs in NAICS sector 23 (Construction)|
| CNS05 | Number of jobs in NAICS sector 31-33 (Manufacturing)|
| CNS06 | Number of jobs in NAICS sector 42 (Wholesale Trade)|
| CNS07 | Number of jobs in NAICS sector 44-45 (Retail Trade)|
| CNS08 | Number of jobs in NAICS sector 48-49 (Transportation and Warehousing)|
| CNS09 | Number of jobs in NAICS sector 51 (Information)|
| CNS10 | Number of jobs in NAICS sector 52 (Finance and Insurance)|
| CNS11 | Number of jobs in NAICS sector 53 (Real Estate and Rental and Leasing)|
| CNS12 | Number of jobs in NAICS sector 54 (Professional, Scientific, and Technical Services)|
| CNS13 | Number of jobs in NAICS sector 55 (Management of Companies and Enterprises)|
| CNS14 | Number of jobs in NAICS sector 56 (Administrative and Support and Waste Management and Remediation Services)|
| CNS15 | Number of jobs in NAICS sector 61 (Educational Services)|
| CNS16 | Number of jobs in NAICS sector 62 (Health Care and Social Assistance)|
| CNS17 | Number of jobs in NAICS sector 71 (Arts, Entertainment, and Recreation)|
| CNS18 | Number of jobs in NAICS sector 72 (Accommodation and Food Services)|
| CNS19 | Number of jobs in NAICS sector 81 (Other Services [except Public Administration])|
| CNS20 | Number of jobs in NAICS sector 92 (Public Administration)|
#### QCEW - Quarterly Census of Employment and Wages, the U.S. Bureau of Labor Statistics
| Column | Description |
|---|---|
|GeoID| FIPS: US, STATE, COUNTY |
|Year| 2000-2022|
|Agglvl_code |14 National, by NAICS Sector; 15 National, by NAICS 3-digit; 16 National, by NAICS 4-digit; 17 National, by NAICS 5-digit; 18 National, by NAICS 6-digit; 54 Statewide, NAICS Sector; 55 Statewide, NAICS 3-digit; 56 Statewide, NAICS 4-digit; 57 Statewide, NAICS 5-digit; 58 Statewide, NAICS 6-digit; 74 County, NAICS Sector; 75 County, NAICS 3-digit; 76 County, NAICS 4-digit; 77 County, NAICS 5-digit; 78 County, NAICS 6-digit |
|Q1_establishments| Number of establishments in quarter 1|
|Q1_disclosure| Percentage of establishments with disclosed information in quarter 1|
|Q1_avg_weekly_wage| Average weekly wage in quarter 1|
|Jan_jobs| Number of jobs in January|
|...|...|
#### REA - Regional Economic Accounts, EUROSTAT
Data refers to **employed persons**.
| Column | Description |
|---|---|
|GeoID| NUTS 3 |
|Year| 1995-2021|
|Nace | Reduced level 1 NACE, Rev. 2|
|EmpTh | Thousands of Employed Persons|
#### LFS - Labour Force Survey, EUROSTAT
Data refers to people of any sex and any age > 15 years old who are **employed persons**.
| Column | Description |
|---|---|
|GeoID| NUTS 0 - 2 char country code |
|Year| 2008-2021|
|Nace | Level 2 NACE, Rev. 2|
|EmpTh_Q1 | Thousands of Employed Persons in Quarter 1|
|EmpTh_Q2 | Thousands of Employed Persons in Quarter 2|
|EmpTh_Q3 | Thousands of Employed Persons in Quarter 3|
|EmpTh_Q4 | Thousands of Employed Persons in Quarter 4|
#### SBS - Structural Business Statistics, EUROSTAT
Data include only G data source below for now and hence are limited to 2021.
| Column | Description |
|---|---|
|GeoID| NUTS 0 - 2 char country code |
|Year| 2008-2021|
|Nace | Level 4 NACE, Rev. 2|
|Enterprises| Number of enterprises |
|Employment| Persons employed - number |
|LaborCost| Unit labor cost per person employed, thousand euro|
---
## Notes
Data is selected, downloaded, and reorganized from multiple data sources.
### LODES
The U.S. Census publishes regularly the [Longitudinal Employer-Household Dynamics - LEHD](https://lehd.ces.census.gov/data/) and within that the [LEHD Origin-Destination Employment Statistics - LODES](https://lehd.ces.census.gov/data/lodes/).
The data considered here is the Workplace Area Characteristics (WAC) of LODES8, where the version number indicates the TIGER geographical boundary specification adopted (2020 Census blocks). Details are at https://lehd.ces.census.gov/data/lodes/LODES8/LODESTechDoc8.0.pdf.
The overall period is 2002-2020 but not all states are represented in all periods. All 51 states are available for only the period 2011-2016.
### BLS
Percentage of disclosure refers to number of establishments and is much influenced by the fact that data is reported at different ownership codes and without totals. The more details means less disclosure, of course.
```
+-------------+--------------------------------+
| Agglvl_code | TRUNCATE(AVG(Q1_disclosure),4) |
+-------------+--------------------------------+
| 14 | 0.9999 |
| 15 | 0.9990 |
| 16 | 0.9992 |
| 17 | 0.9980 |
| 18 | 0.9975 |
| 54 | 0.9920 |
| 55 | 0.9536 |
| 56 | 0.8972 |
| 57 | 0.8534 |
| 58 | 0.7892 |
| 74 | 0.7723 |
| 75 | 0.6351 |
| 76 | 0.5094 |
| 77 | 0.4270 |
| 78 | 0.3971 |
+-------------+--------------------------------+
```
### EUROSTAT
#### REA
**A**. *Employment (thousand persons) by NUTS 3 regions (Level 1 NACE)*
https://ec.europa.eu/eurostat/databrowser/product/view/nama_10r_3empers
**B**. *Compensation of employees by NUTS 2 regions (Level 1 NACE)*
https://ec.europa.eu/eurostat/databrowser/product/view/nama_10r_2coe
**C**. *Employment (thousand hours worked) by NUTS 2 regions (Level 1 NACE)*
https://ec.europa.eu/eurostat/databrowser/product/view/nama_10r_2emhrw
#### LFS
**D**. *Employment by sex, age and detailed economic activity (from 2008 onwards, NACE Rev. 2 two digit level) - 1 000*
https://ec.europa.eu/eurostat/databrowser/product/view/lfsq_egan22d
**E**. *Employment by sex, age, economic activity and NUTS 2 regions (NACE Rev. 2 level 1) (1 000)*
https://ec.europa.eu/eurostat/databrowser/product/view/lfst_r_lfe2en2
**SBS**
**F**. *SBS data by NUTS 2 regions and NACE Rev. 2 (from 2008 onwards)*
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_r_nuts06_r2
**G**. *Enterprises by detailed NACE Rev.2 activity and special aggregates (from 2021)*
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_ovw_act
**H**. *SBS historical data 2011-2020 - Country, 4-digits*
Annual detailed enterprise statistics for industry (NACE Rev. 2, B-E)
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_na_ind_r2
Annual detailed enterprise statistics for construction (NACE Rev. 2, F)
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_na_con_r2
Annual detailed enterprise statistics for services (NACE Rev. 2 H-N and S95)
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_na_1a_se_r2
Annual detailed enterprise statistics for trade (NACE Rev. 2 G)
https://ec.europa.eu/eurostat/databrowser/product/view/sbs_na_dt_r2
| Ref.| Vars | NUTS | TIME| NACE | Notes
|---|---|---|---|---|---|
| A | e | 3 | 95/21 annual | 1r | 11 industries
| B | s | 2 | | 1r | |
| C | e | 2 | 95/21 annual | 1r ||
| D | e | 0 | 08/23 quarterly | 2 ||
| E | e | 2 | 08/22 annual | 1r | 10 industries
| F | e,u,s | 2 | 08/20 annual | 2 (3 G) | no Ag |
| G | e,u,s | 0 | 21 annual | 4 | no Ag |
| H | e,u,s | 0 | 05/20 annual | 4 | no Ag, split in parts |
>Vars: e=employment #; u=units/enterprises; s=salary/wages
Most up-to-date: D
Highest frequency: D
Most industry detail: G, H
Most space detail: A
```
^ NUTS
|
3 | A
|
2 | B,C,E F
|
1 |
|
0 | D G,H
| NACE
----------------------------------->
1r 2 3 4
```
We select D as main reference, and aim at expanding to G,H (for NACE) and to A (for NUTS).
We not consider further B,C,E, and possibly F, which in contrast could be used for validation.
Note that only F, G, H have info on units/enterprises and salaries.
```
^ NUTS
|
3 | REA(A)
|
2 |
|
1 |
|
0 | LFS(D) SBS(G,H)
| NACE
----------------------------------->
1r 2 3 4
REA: Regional Economic Accounts
SBS: Structural Business Statistics
LFS: Labour Force Survey
```
### Authors
Riccardo Boero - ribo@nilu.no
### License
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment