Purpose: This technical assessment evaluates your ability to design, build, and explain a data lake solution using the FY 2024 Federal Real Property Profile (FRPP) Public Dataset. The focus is on practical engineering tradeoffs and clear documentation.
The Federal Real Property Profile Management System (FRPP MS) is the federal government’s centralized inventory of real property under the custody and control of executive branch agencies. Due to national security and other authorized exclusions, some FRPP data is not available to the public.
Source: Harvested from GSA JSON sample csv data.
You are supporting a public data analytics team that needs a minimal FRPP data lake for exploration, reporting, and future GenAI-assisted analysis. Your task is to propose a clean architecture and build a small, working subset using the FY 2024 FRPP Public Dataset.
| Type | Requirement | Description | Expectation Fit |
|---|---|---|---|
| Required | US-REQ-01 | Ingest the FRPP Public Dataset (Excel/CSV) into S3 so it is available in the data lake. Cost details 💰 | REQ-EXP-03 |
| Required | US-REQ-02 | Catalog FRPP data in Glue and query it in Athena, including a curated view for utilization and asset status. | REQ-EXP-02, REQ-EXP-03 |
| Required | US-REQ-03 | Provide a minimal API (Lambda or container) that returns FRPP summaries by agency, state, or utilization. | REQ-EXP-03 |
| Required | US-REQ-04 | Produce summary metrics (counts, square feet, replacement value) by agency, bureau, and state. | REQ-EXP-02 |
| Required | US-REQ-05 | Document public exclusions, missing values, and data quality handling assumptions. | REQ-EXP-01, REQ-EXP-04 |
| Required | US-REQ-06 | Provide an optional narrative summary using Amazon Bedrock (or an LLM) based on query output. | REQ-EXP-03, REQ-EXP-05 |
| Required | US-REQ-07 | Publish dataset metadata (field list, update frequency, and licensing) in the README. | REQ-EXP-04 |
| Optional | REQ-OPT-01 | Provide an OpenSearch index for metadata search. | REQ-EXP-03 |
| Optional | REQ-OPT-02 | Add a geospatial summary (e.g., CBSA or state rollups) or a map-friendly export. | REQ-EXP-02 |
| Optional | REQ-OPT-03 | Provide a simple UI or CLI client for FRPP queries. | REQ-EXP-03 |
| Optional | REQ-OPT-04 | Add CI checks or unit tests. | REQ-EXP-05 |
Field list (CSV header) from the public metadata sample:
Reporting Agency, Reporting Agency Code, Reporting Bureau, Reporting Bureau Code, Real Property Unique Identifier, Using Agency, Using Agency Code, Using Bureau, Using Bureau Code, US/Foreign, Country Code, Country Name, State Code, State Name, County Code, County Name, City Code, City Name, Zip Code, Core-based Statistical Area, Core-based Statistical Area Code, Street Address, Installation Id, Installation Name, Sub Installation Id, Latitude, Longitude, Legal Interest Indicator, Legal Interest Code, Real Property Type, Real Property Type Code, Real Property Use, Real Property Use Code, Utilization, Utilization Code, Asset Status, Asset Status Code, Historical Status, Historical Status Code, Field Office, Field Office Code, Field Office Collocation, Field Office Collocation Code, Acres, Actual Sales Price, Age of Property, Reason Cannot Currently Be Disposed, Reason Cannot Currently Be Disposed Code, Cannot Currently Be Disposed of Date, CFO Agency, Congressional District, Consecutive Years Underutilized, Date Start of Underutilized, Determination to Dispose Date, Determine # of Contractors, Determine Number of Fed Contractors Code, Number of Federal Contractors, Determine # of Employees, Determine Number of Fed Employees Code, Number of Federal Employees, Disposition Date, Disposition Method, Disposition Method Code, Estimated Future Capital Expenditures, Estimated NPV (as of 6/11/2024), Excess Date, FASTA Disposal Exclusion, FASTA Disposal Exclusion Code, FASTA Disposal Exclusion Reason, FASTA Disposal Exclusion Reason Code, FiscalYear, Historical Capital Expenditures, Lease Annual Rent to Lessor, Leased Annual Maintenance Costs, Leased Annual Operations Costs, Lease Authority, Lease Authority Code, Lease Expiration Date, Lease Occupancy Date, Lease Start Date, Net Proceeds, Number of Days Excess, Number of Days Surplus, Number of Days Underutilized, Outgrant Indicator, Owned and Otherwise Managed Annual Operations Cost, Owned and Otherwise Managed Annual Maintenance Cost, Repair Needs, Replacement Value, Condition Index, Report of Excess Accepted Date, Report of Excess Submitted Date, Reduce the Footprint, Square Feet (Buildings), Square Feet Unit of Measure, SquareFeet Unit Of Measure Code, Structural Units, Surplus Declaration Date, Sustainability Status, Sustainability Code, Sustainability System Used Code, Sustainability System Used, Sustainability Square Footage, Sustainability Assessment Date, Unit Of Measure, Unit Of Measure Code, Year Asset Reported Underutilized, Year of Construction, Outgrant Indicator Value, Asset Height Range, Asset Height, Asset Height Range Above Mean Sea Level, Asset Height Above Mean Sea Level, Elevation Above Mean Sea Level, FOIA Exemption, FOIA Exemption Code, Statutory Citation
COMMERCE,13,NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION.,14,AKM13201,COMMERCE,13,NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION.,14,UNITED STATES,840,UNITED STATES,02,ALASKA,020,ANCHORAGE,0710,ELMENDORF AFB,99506,,0,,5771000,NWS ANCHORAGE AK (ELMENDORF) CAMPUS INSTALLATION,001624,61.2601000,-149.6901000,Owned,G,Building,35,Laboratories,74,Unutilized,5,Report of Excess Accepted,C,Not Evaluated,5,,,,,,67,,true,AL,5,9/30/2020,,Yes,Y,0,Yes,Y,0,,,$0.00,,5/9/2018,Yes,Y,On Military Installation,MIL,2024,$0.00,,,,,,,,2,584,,1,709,N,$7,296.00,$0.00,,$808,630.00,,5/9/2018,,N,1,824,Gross Square Feet,G,,,,,,,,2020,1958,No,Height > 0 feet and <= 30 feet above ground level,,>319.70441 and <=349.70441,,319.70441,No FOIA exemption,10,
Use AWS services where possible. If you cannot access AWS, mock the integrations and explain how they would map to real services.
/frpp/query endpoint.agency (optional), state (optional), utilization (optional), and limit (optional).summary, rows, sources, and status.model and promptVersion in the response.POST /frpp/query
| Field | Type | Description |
|---|---|---|
| agency | string | Reporting agency name or code (optional) |
| state | string | State code or name (optional) |
| utilization | string | Utilization value or code (optional) |
| limit | number | Max rows to return (optional) |
{
"agency": "COMMERCE",
"state": "AK",
"utilization": "Unutilized",
"limit": 10
}
{
"summary": {
"count": 3,
"totalSquareFeet": 11966,
"totalReplacementValue": 2565221
},
"rows": [
{"realPropertyId": "AKM13201", "state": "AK", "utilization": "Unutilized", "squareFeet": 1824}
],
"sources": ["s3://.../frpp_public_dataset_fy24_07022025.csv", "athena://query/123"],
"status": "ok"
}