Hey, Gedeon,
Some points;
1) It looks like the definition of Net area is Area Owned + Area Leased, which unfortunately muddles how much of the land they own:
. reg netcultarea ownedland leasedland
Source | SS df MS Number of obs = 4527
-------------+------------------------------ F( 2, 4524) = .
Model | 519745.073 2 259872.536 Prob > F = 0.0000
Residual | 1951.045 4524 .431265474 R-squared = 0.9963
-------------+------------------------------ Adj R-squared = 0.9963
Total | 521696.118 4526 115.266487 Root MSE = .65671
------------------------------------------------------------------------------
netcultarea | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ownedland | .9992911 .0009141 1093.14 0.000 .9974989 1.001083
leasedland | .8267605 .006375 129.69 0.000 .8142623 .8392586
_cons | .039269 .0116594 3.37 0.001 .0164108 .0621272
------------------------------------------------------------------------------
Luckily it appears very few households lease land.
2) Are you sure "price of ___ land" isn't actually the total purchase price of the land rather than the per-unit price (which means these two price variables in their original form may be what you want)? Look:
. gen p = priceunirr/netirr
(2595 missing values generated)
. sum p, detail
p
-------------------------------------------------------------
Percentiles Smallest
1% 40 0
5% 89.28571 0
10% 120 0 Obs 1932
25% 250 0 Sum of Wgt. 1932
50% 500 Mean 1419.346
Largest Std. Dev. 3937.299
75% 1250 33333.33
90% 3000 50000 Variance 1.55e+07
95% 5000 50000 Skewness 12.29906
99% 18666.67 100000 Kurtosis 240.6135
But of course it's odd that the "price" variable is defined for more households than the net land variable...I suppose you should try it both ways and see what happens...
-@